Overview

Dataset statistics

Number of variables5
Number of observations26078
Missing cells6765
Missing cells (%)5.2%
Total size in memory1.3 MiB
Average record size in memory53.8 B

Variable types

Text2
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns the phone's battery level. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
sourceThe charge source name
statusReturn if the battery is charging

Alerts

experimentid has constant value "wenetDenmark"Constant
source has 6765 (25.9%) missing valuesMissing
timestamp has unique valuesUnique
userid has 1079 (4.1%) zerosZeros

Reproduction

Analysis started2024-11-23 01:51:32.174614
Analysis finished2024-11-23 01:51:32.434187
Duration0.26 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size509.5 KiB
2024-11-23T02:51:32.534026image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters312936
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 26078
100.0%
2024-11-23T02:51:32.754165image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 78234
25.0%
n 52156
16.7%
w 26078
 
8.3%
t 26078
 
8.3%
D 26078
 
8.3%
m 26078
 
8.3%
a 26078
 
8.3%
r 26078
 
8.3%
k 26078
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 312936
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 78234
25.0%
n 52156
16.7%
w 26078
 
8.3%
t 26078
 
8.3%
D 26078
 
8.3%
m 26078
 
8.3%
a 26078
 
8.3%
r 26078
 
8.3%
k 26078
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 312936
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 78234
25.0%
n 52156
16.7%
w 26078
 
8.3%
t 26078
 
8.3%
D 26078
 
8.3%
m 26078
 
8.3%
a 26078
 
8.3%
r 26078
 
8.3%
k 26078
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 312936
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 78234
25.0%
n 52156
16.7%
w 26078
 
8.3%
t 26078
 
8.3%
D 26078
 
8.3%
m 26078
 
8.3%
a 26078
 
8.3%
r 26078
 
8.3%
k 26078
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct17
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.10230846
Minimum0
Maximum27
Zeros1079
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size203.9 KiB
2024-11-23T02:51:32.859187image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median21
Q323
95-th percentile26
Maximum27
Range27
Interquartile range (IQR)20

Descriptive statistics

Standard deviation9.332258015
Coefficient of variation (CV)0.6179358632
Kurtosis-1.470904492
Mean15.10230846
Median Absolute Deviation (MAD)4
Skewness-0.528338024
Sum393838
Variance87.09103965
MonotonicityIncreasing
2024-11-23T02:51:32.955761image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
23 5950
22.8%
21 5516
21.2%
2 4400
16.9%
17 3111
11.9%
3 2101
 
8.1%
26 1978
 
7.6%
6 1229
 
4.7%
0 1079
 
4.1%
25 218
 
0.8%
27 143
 
0.5%
Other values (7) 353
 
1.4%
ValueCountFrequency (%)
0 1079
 
4.1%
2 4400
16.9%
3 2101
8.1%
6 1229
 
4.7%
8 124
 
0.5%
ValueCountFrequency (%)
27 143
 
0.5%
26 1978
 
7.6%
25 218
 
0.8%
23 5950
22.8%
22 25
 
0.1%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct26078
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size203.9 KiB
Minimum2020-11-16 07:00:11.548000
Maximum2020-12-11 21:57:17.497000
2024-11-23T02:51:33.074948image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:51:33.196921image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

source
Text

MISSING 

The charge source name

Distinct4
Distinct (%)< 0.1%
Missing6765
Missing (%)25.9%
Memory size428.2 KiB
2024-11-23T02:51:33.284632image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.72334697
Min length11

Characters and Unicode

Total characters226413
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcharging_ac
2nd rowcharging_ac
3rd rowcharging_ac
4th rowcharging_ac
5th rowcharging_ac
ValueCountFrequency (%)
charging_ac 14390
74.5%
charging_wifi 3103
 
16.1%
charging_unknown 1486
 
7.7%
charging_usb 334
 
1.7%
2024-11-23T02:51:33.558390image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
g 38626
17.1%
c 33703
14.9%
a 33703
14.9%
i 25519
11.3%
n 23771
10.5%
h 19313
8.5%
r 19313
8.5%
_ 19313
8.5%
w 4589
 
2.0%
f 3103
 
1.4%
Other values (5) 5460
 
2.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 226413
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
g 38626
17.1%
c 33703
14.9%
a 33703
14.9%
i 25519
11.3%
n 23771
10.5%
h 19313
8.5%
r 19313
8.5%
_ 19313
8.5%
w 4589
 
2.0%
f 3103
 
1.4%
Other values (5) 5460
 
2.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 226413
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
g 38626
17.1%
c 33703
14.9%
a 33703
14.9%
i 25519
11.3%
n 23771
10.5%
h 19313
8.5%
r 19313
8.5%
_ 19313
8.5%
w 4589
 
2.0%
f 3103
 
1.4%
Other values (5) 5460
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 226413
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
g 38626
17.1%
c 33703
14.9%
a 33703
14.9%
i 25519
11.3%
n 23771
10.5%
h 19313
8.5%
r 19313
8.5%
_ 19313
8.5%
w 4589
 
2.0%
f 3103
 
1.4%
Other values (5) 5460
 
2.4%

status
Boolean

Return if the battery is charging

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.6 KiB
True
19313 
False
6765 
ValueCountFrequency (%)
True 19313
74.1%
False 6765
 
25.9%
2024-11-23T02:51:33.658527image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T02:51:33.708532image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.000-0.007
userid-0.0071.000