Overview

Dataset statistics

Number of variables3
Number of observations410508
Missing cells0
Missing cells (%)0.0%
Total size in memory14.1 MiB
Average record size in memory36.0 B

Variable types

Text1
Numeric1
DateTime1

Dataset

Description[Step] The step detector sensor collects an event each time a step is taken by the user. The value reported by the sensor is always one, the fractional part being always zero, and the event timestamp is the time when the user’s foot hit the ground. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Alerts

experimentid has constant value "wenetDenmark"Constant
userid has 36640 (8.9%) zerosZeros

Reproduction

Analysis started2024-11-23 01:51:53.115910
Analysis finished2024-11-23 01:51:54.491538
Duration1.38 second
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
2024-11-23T02:51:54.567075image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters4926096
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 410508
100.0%
2024-11-23T02:51:54.817426image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1231524
25.0%
n 821016
16.7%
w 410508
 
8.3%
t 410508
 
8.3%
D 410508
 
8.3%
m 410508
 
8.3%
a 410508
 
8.3%
r 410508
 
8.3%
k 410508
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4926096
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1231524
25.0%
n 821016
16.7%
w 410508
 
8.3%
t 410508
 
8.3%
D 410508
 
8.3%
m 410508
 
8.3%
a 410508
 
8.3%
r 410508
 
8.3%
k 410508
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4926096
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1231524
25.0%
n 821016
16.7%
w 410508
 
8.3%
t 410508
 
8.3%
D 410508
 
8.3%
m 410508
 
8.3%
a 410508
 
8.3%
r 410508
 
8.3%
k 410508
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4926096
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1231524
25.0%
n 821016
16.7%
w 410508
 
8.3%
t 410508
 
8.3%
D 410508
 
8.3%
m 410508
 
8.3%
a 410508
 
8.3%
r 410508
 
8.3%
k 410508
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.01571224
Minimum0
Maximum27
Zeros36640
Zeros (%)8.9%
Negative0
Negative (%)0.0%
Memory size3.1 MiB
2024-11-23T02:51:54.918041image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16
median17
Q317
95-th percentile27
Maximum27
Range27
Interquartile range (IQR)11

Descriptive statistics

Standard deviation8.050025539
Coefficient of variation (CV)0.6184852116
Kurtosis-1.116572444
Mean13.01571224
Median Absolute Deviation (MAD)4
Skewness-0.3071962849
Sum5343054
Variance64.80291117
MonotonicityIncreasing
2024-11-23T02:51:55.007585image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
17 197845
48.2%
6 52359
 
12.8%
2 47569
 
11.6%
0 36640
 
8.9%
21 33116
 
8.1%
27 27343
 
6.7%
3 10738
 
2.6%
22 4571
 
1.1%
12 327
 
0.1%
ValueCountFrequency (%)
0 36640
8.9%
2 47569
11.6%
3 10738
 
2.6%
6 52359
12.8%
12 327
 
0.1%
ValueCountFrequency (%)
27 27343
 
6.7%
22 4571
 
1.1%
21 33116
 
8.1%
17 197845
48.2%
12 327
 
0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct410459
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
Minimum2020-11-16 07:36:51.616000
Maximum2020-12-11 21:59:59.744000
2024-11-23T02:51:55.114240image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:51:55.232539image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)