Overview

Dataset statistics

Number of variables4
Number of observations46915
Missing cells0
Missing cells (%)0.0%
Total size in memory1.7 MiB
Average record size in memory37.0 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether the phone's doze mode is on or off. Doze mode is a low battery consumption state in which the phone enters after some time of not being used. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusThe status of whether the mode is on or off

Alerts

experimentid has constant value "wenetDenmark"Constant
timestamp has unique valuesUnique
userid has 862 (1.8%) zerosZeros

Reproduction

Analysis started2024-11-23 01:50:58.893817
Analysis finished2024-11-23 01:50:59.156831
Duration0.26 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size916.4 KiB
2024-11-23T02:50:59.262705image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters562980
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 46915
100.0%
2024-11-23T02:50:59.628314image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 140745
25.0%
n 93830
16.7%
w 46915
 
8.3%
t 46915
 
8.3%
D 46915
 
8.3%
m 46915
 
8.3%
a 46915
 
8.3%
r 46915
 
8.3%
k 46915
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 562980
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 140745
25.0%
n 93830
16.7%
w 46915
 
8.3%
t 46915
 
8.3%
D 46915
 
8.3%
m 46915
 
8.3%
a 46915
 
8.3%
r 46915
 
8.3%
k 46915
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 562980
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 140745
25.0%
n 93830
16.7%
w 46915
 
8.3%
t 46915
 
8.3%
D 46915
 
8.3%
m 46915
 
8.3%
a 46915
 
8.3%
r 46915
 
8.3%
k 46915
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 562980
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 140745
25.0%
n 93830
16.7%
w 46915
 
8.3%
t 46915
 
8.3%
D 46915
 
8.3%
m 46915
 
8.3%
a 46915
 
8.3%
r 46915
 
8.3%
k 46915
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.95135884
Minimum0
Maximum27
Zeros862
Zeros (%)1.8%
Negative0
Negative (%)0.0%
Memory size366.7 KiB
2024-11-23T02:50:59.736238image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q16
median6
Q317
95-th percentile23
Maximum27
Range27
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.736733201
Coefficient of variation (CV)0.7525933571
Kurtosis-0.3337426433
Mean8.95135884
Median Absolute Deviation (MAD)0
Skewness1.044069494
Sum419953
Variance45.38357422
MonotonicityIncreasing
2024-11-23T02:50:59.831846image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
6 23706
50.5%
3 8584
 
18.3%
17 5370
 
11.4%
20 2976
 
6.3%
23 1840
 
3.9%
16 879
 
1.9%
0 862
 
1.8%
2 825
 
1.8%
25 580
 
1.2%
19 350
 
0.7%
Other values (6) 943
 
2.0%
ValueCountFrequency (%)
0 862
 
1.8%
2 825
 
1.8%
3 8584
 
18.3%
6 23706
50.5%
12 14
 
< 0.1%
ValueCountFrequency (%)
27 336
 
0.7%
26 134
 
0.3%
25 580
 
1.2%
23 1840
3.9%
22 16
 
< 0.1%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct46915
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size366.7 KiB
Minimum2020-11-16 07:28:41.046000
Maximum2020-12-11 21:40:38.835000
2024-11-23T02:50:59.952430image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:51:00.076809image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

The status of whether the mode is on or off

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size45.9 KiB
False
28764 
True
18151 
ValueCountFrequency (%)
False 28764
61.3%
True 18151
38.7%
2024-11-23T02:51:00.167197image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T02:51:00.217808image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.0000.094
userid0.0941.000