Overview

Dataset statistics

Number of variables5
Number of observations256875
Missing cells74713
Missing cells (%)5.8%
Total size in memory12.6 MiB
Average record size in memory51.3 B

Variable types

Text2
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns the phone's battery level. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
sourceThe charge source name
statusReturn if the battery is charging

Alerts

experimentid has constant value "wenetItaly"Constant
source has 74713 (29.1%) missing valuesMissing

Reproduction

Analysis started2024-11-23 06:05:17.978546
Analysis finished2024-11-23 06:05:19.205048
Duration1.23 second
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.4 MiB
2024-11-23T07:05:19.299146image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters2568750
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 256875
100.0%
2024-11-23T07:05:19.523271image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 513750
20.0%
t 513750
20.0%
w 256875
10.0%
n 256875
10.0%
I 256875
10.0%
a 256875
10.0%
l 256875
10.0%
y 256875
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2568750
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 513750
20.0%
t 513750
20.0%
w 256875
10.0%
n 256875
10.0%
I 256875
10.0%
a 256875
10.0%
l 256875
10.0%
y 256875
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2568750
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 513750
20.0%
t 513750
20.0%
w 256875
10.0%
n 256875
10.0%
I 256875
10.0%
a 256875
10.0%
l 256875
10.0%
y 256875
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2568750
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 513750
20.0%
t 513750
20.0%
w 256875
10.0%
n 256875
10.0%
I 256875
10.0%
a 256875
10.0%
l 256875
10.0%
y 256875
10.0%

userid
Real number (ℝ)

User id

Distinct219
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.0284925
Minimum0
Maximum265
Zeros45
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size2.0 MiB
2024-11-23T07:05:19.646312image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q159
median126
Q3212
95-th percentile254
Maximum265
Range265
Interquartile range (IQR)153

Descriptive statistics

Standard deviation82.18000846
Coefficient of variation (CV)0.6418884335
Kurtosis-1.279889334
Mean128.0284925
Median Absolute Deviation (MAD)72
Skewness0.09631016219
Sum32887319
Variance6753.553791
MonotonicityIncreasing
2024-11-23T07:05:19.813825image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75 21808
 
8.5%
217 20598
 
8.0%
126 19847
 
7.7%
2 15422
 
6.0%
252 11052
 
4.3%
65 5384
 
2.1%
18 4960
 
1.9%
258 4537
 
1.8%
103 4288
 
1.7%
163 3675
 
1.4%
Other values (209) 145304
56.6%
ValueCountFrequency (%)
0 45
 
< 0.1%
1 725
 
0.3%
2 15422
6.0%
3 152
 
0.1%
4 367
 
0.1%
ValueCountFrequency (%)
265 1670
0.7%
264 8
 
< 0.1%
263 1383
0.5%
262 254
 
0.1%
260 51
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct256859
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size2.0 MiB
Minimum2020-11-16 07:01:50.738000
Maximum2020-12-11 21:58:01.238000
2024-11-23T07:05:19.936666image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T07:05:20.055797image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

source
Text

MISSING 

The charge source name

Distinct4
Distinct (%)< 0.1%
Missing74713
Missing (%)29.1%
Memory size4.0 MiB
2024-11-23T07:05:20.139487image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.59010661
Min length11

Characters and Unicode

Total characters2111277
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcharging_unknown
2nd rowcharging_ac
3rd rowcharging_ac
4th rowcharging_ac
5th rowcharging_ac
ValueCountFrequency (%)
charging_ac 142381
78.2%
charging_usb 14106
 
7.7%
charging_unknown 14013
 
7.7%
charging_wifi 11662
 
6.4%
2024-11-23T07:05:20.328037image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
g 364324
17.3%
c 324543
15.4%
a 324543
15.4%
n 224201
10.6%
i 205486
9.7%
h 182162
8.6%
r 182162
8.6%
_ 182162
8.6%
u 28119
 
1.3%
w 25675
 
1.2%
Other values (5) 67900
 
3.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2111277
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
g 364324
17.3%
c 324543
15.4%
a 324543
15.4%
n 224201
10.6%
i 205486
9.7%
h 182162
8.6%
r 182162
8.6%
_ 182162
8.6%
u 28119
 
1.3%
w 25675
 
1.2%
Other values (5) 67900
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2111277
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
g 364324
17.3%
c 324543
15.4%
a 324543
15.4%
n 224201
10.6%
i 205486
9.7%
h 182162
8.6%
r 182162
8.6%
_ 182162
8.6%
u 28119
 
1.3%
w 25675
 
1.2%
Other values (5) 67900
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2111277
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
g 364324
17.3%
c 324543
15.4%
a 324543
15.4%
n 224201
10.6%
i 205486
9.7%
h 182162
8.6%
r 182162
8.6%
_ 182162
8.6%
u 28119
 
1.3%
w 25675
 
1.2%
Other values (5) 67900
 
3.2%

status
Boolean

Return if the battery is charging

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size251.0 KiB
True
182162 
False
74713 
ValueCountFrequency (%)
True 182162
70.9%
False 74713
29.1%
2024-11-23T07:05:20.441142image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T07:05:20.494955image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.000-0.023
userid-0.0231.000