Overview

Dataset statistics

Number of variables5
Number of observations12663
Missing cells2833
Missing cells (%)4.5%
Total size in memory640.1 KiB
Average record size in memory51.8 B

Variable types

Text2
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns the phone's battery level. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
sourceThe charge source name
statusReturn if the battery is charging

Alerts

experimentid has constant value "wenetIndia"Constant
source has 2833 (22.4%) missing valuesMissing
timestamp has unique valuesUnique
userid has 3375 (26.7%) zerosZeros

Reproduction

Analysis started2024-11-22 12:32:40.667719
Analysis finished2024-11-22 12:32:40.918150
Duration0.25 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size222.7 KiB
2024-11-22T13:32:41.025171image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters126630
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 12663
100.0%
2024-11-22T13:32:41.245061image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 25326
20.0%
n 25326
20.0%
w 12663
10.0%
t 12663
10.0%
I 12663
10.0%
d 12663
10.0%
i 12663
10.0%
a 12663
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 126630
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 25326
20.0%
n 25326
20.0%
w 12663
10.0%
t 12663
10.0%
I 12663
10.0%
d 12663
10.0%
i 12663
10.0%
a 12663
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 126630
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 25326
20.0%
n 25326
20.0%
w 12663
10.0%
t 12663
10.0%
I 12663
10.0%
d 12663
10.0%
i 12663
10.0%
a 12663
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 126630
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 25326
20.0%
n 25326
20.0%
w 12663
10.0%
t 12663
10.0%
I 12663
10.0%
d 12663
10.0%
i 12663
10.0%
a 12663
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.56408434
Minimum0
Maximum62
Zeros3375
Zeros (%)26.7%
Negative0
Negative (%)0.0%
Memory size99.1 KiB
2024-11-22T13:32:41.349685image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12
Q317
95-th percentile35
Maximum62
Range62
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.00430919
Coefficient of variation (CV)0.9554464028
Kurtosis3.416559072
Mean12.56408434
Median Absolute Deviation (MAD)5
Skewness1.581726873
Sum159099
Variance144.1034391
MonotonicityIncreasing
2024-11-22T13:32:41.447395image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
12 3800
30.0%
0 3375
26.7%
9 1914
15.1%
26 1439
 
11.4%
17 1126
 
8.9%
43 203
 
1.6%
35 156
 
1.2%
44 153
 
1.2%
62 122
 
1.0%
57 100
 
0.8%
Other values (10) 275
 
2.2%
ValueCountFrequency (%)
0 3375
26.7%
4 45
 
0.4%
8 50
 
0.4%
9 1914
15.1%
12 3800
30.0%
ValueCountFrequency (%)
62 122
1.0%
57 100
0.8%
49 37
 
0.3%
46 2
 
< 0.1%
44 153
1.2%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct12663
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size99.1 KiB
Minimum2021-07-12 09:04:15.578000
Maximum2021-08-12 14:33:21.483000
2024-11-22T13:32:41.563213image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T13:32:41.691096image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

source
Text

MISSING 

The charge source name

Distinct3
Distinct (%)< 0.1%
Missing2833
Missing (%)22.4%
Memory size207.2 KiB
2024-11-22T13:32:41.777327image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.10844354
Min length11

Characters and Unicode

Total characters109196
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcharging_ac
2nd rowcharging_ac
3rd rowcharging_ac
4th rowcharging_ac
5th rowcharging_ac
ValueCountFrequency (%)
charging_ac 9456
96.2%
charging_usb 201
 
2.0%
charging_unknown 173
 
1.8%
2024-11-22T13:32:42.120800image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
g 19660
18.0%
c 19286
17.7%
a 19286
17.7%
n 10349
9.5%
h 9830
9.0%
r 9830
9.0%
i 9830
9.0%
_ 9830
9.0%
u 374
 
0.3%
s 201
 
0.2%
Other values (4) 720
 
0.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 109196
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
g 19660
18.0%
c 19286
17.7%
a 19286
17.7%
n 10349
9.5%
h 9830
9.0%
r 9830
9.0%
i 9830
9.0%
_ 9830
9.0%
u 374
 
0.3%
s 201
 
0.2%
Other values (4) 720
 
0.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 109196
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
g 19660
18.0%
c 19286
17.7%
a 19286
17.7%
n 10349
9.5%
h 9830
9.0%
r 9830
9.0%
i 9830
9.0%
_ 9830
9.0%
u 374
 
0.3%
s 201
 
0.2%
Other values (4) 720
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 109196
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
g 19660
18.0%
c 19286
17.7%
a 19286
17.7%
n 10349
9.5%
h 9830
9.0%
r 9830
9.0%
i 9830
9.0%
_ 9830
9.0%
u 374
 
0.3%
s 201
 
0.2%
Other values (4) 720
 
0.7%

status
Boolean

Return if the battery is charging

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.5 KiB
True
9830 
False
2833 
ValueCountFrequency (%)
True 9830
77.6%
False 2833
 
22.4%
2024-11-22T13:32:42.235760image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-22T13:32:42.290073image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.000-0.091
userid-0.0911.000