Overview

Dataset statistics

Number of variables4
Number of observations8505491
Missing cells0
Missing cells (%)0.0%
Total size in memory405.6 MiB
Average record size in memory50.0 B

Variable types

Text1
Numeric2
DateTime1

Dataset

Description[Steps] The step counter sensor is used to get the total number of steps taken by the user since the last reboot (power on) of the phone. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
valueThe number of steps

Alerts

experimentid has constant value "wenetItaly"Constant

Reproduction

Analysis started2024-11-23 08:32:43.078876
Analysis finished2024-11-23 08:33:14.095140
Duration31.02 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.9 MiB
2024-11-23T09:33:14.213778image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters85054910
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 8505491
100.0%
2024-11-23T09:33:14.415357image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 17010982
20.0%
t 17010982
20.0%
w 8505491
10.0%
n 8505491
10.0%
I 8505491
10.0%
a 8505491
10.0%
l 8505491
10.0%
y 8505491
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 85054910
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 17010982
20.0%
t 17010982
20.0%
w 8505491
10.0%
n 8505491
10.0%
I 8505491
10.0%
a 8505491
10.0%
l 8505491
10.0%
y 8505491
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 85054910
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 17010982
20.0%
t 17010982
20.0%
w 8505491
10.0%
n 8505491
10.0%
I 8505491
10.0%
a 8505491
10.0%
l 8505491
10.0%
y 8505491
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 85054910
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 17010982
20.0%
t 17010982
20.0%
w 8505491
10.0%
n 8505491
10.0%
I 8505491
10.0%
a 8505491
10.0%
l 8505491
10.0%
y 8505491
10.0%

userid
Real number (ℝ)

User id

Distinct175
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean177.9748758
Minimum0
Maximum264
Zeros2389
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size129.8 MiB
2024-11-23T09:33:14.544930image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile26
Q1126
median229
Q3229
95-th percentile236
Maximum264
Range264
Interquartile range (IQR)103

Descriptive statistics

Standard deviation72.83348447
Coefficient of variation (CV)0.4092346415
Kurtosis-0.2688841676
Mean177.9748758
Median Absolute Deviation (MAD)13
Skewness-1.071219847
Sum1513763704
Variance5304.71646
MonotonicityIncreasing
2024-11-23T09:33:14.668369image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
229 3968860
46.7%
191 270904
 
3.2%
27 173227
 
2.0%
60 120748
 
1.4%
148 113939
 
1.3%
163 102664
 
1.2%
19 95382
 
1.1%
42 85060
 
1.0%
109 84711
 
1.0%
78 84297
 
1.0%
Other values (165) 3405699
40.0%
ValueCountFrequency (%)
0 2389
 
< 0.1%
2 40022
0.5%
3 2930
 
< 0.1%
4 19232
0.2%
6 19508
0.2%
ValueCountFrequency (%)
264 7985
 
0.1%
263 29977
0.4%
262 38886
0.5%
259 7158
 
0.1%
257 32219
0.4%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct8488514
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size129.8 MiB
Minimum2020-11-16 07:00:00.140000
Maximum2020-12-11 21:59:57.281000
2024-11-23T09:33:14.792594image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T09:33:14.923824image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

value
Real number (ℝ)

The number of steps

Distinct176968
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1832296.271
Minimum0
Maximum25594200
Zeros30873
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size129.8 MiB
2024-11-23T09:33:15.046573image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile43300
Q1278300
median671900
Q31835500
95-th percentile8615800
Maximum25594200
Range25594200
Interquartile range (IQR)1557200

Descriptive statistics

Standard deviation2919406.219
Coefficient of variation (CV)1.593304678
Kurtosis10.46052885
Mean1832296.271
Median Absolute Deviation (MAD)557200
Skewness2.970513869
Sum1.558457944 × 1013
Variance8.522932674 × 1012
MonotonicityNot monotonic
2024-11-23T09:33:15.185112image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
71900 135745
 
1.6%
3711400 112150
 
1.3%
637700 111525
 
1.3%
671900 94918
 
1.1%
1149900 91787
 
1.1%
300100 88164
 
1.0%
667800 86088
 
1.0%
411000 84249
 
1.0%
1175700 81653
 
1.0%
338500 78244
 
0.9%
Other values (176958) 7540968
88.7%
ValueCountFrequency (%)
0 30873
0.4%
100 272
 
< 0.1%
200 364
 
< 0.1%
300 486
 
< 0.1%
400 234
 
< 0.1%
ValueCountFrequency (%)
25594200 11
< 0.1%
25593500 1
 
< 0.1%
25593200 1
 
< 0.1%
25591200 1
 
< 0.1%
25587700 3
 
< 0.1%

Correlations

2024-11-23T09:33:15.260526image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
useridvalue
userid1.000-0.322
value-0.3221.000