Overview

Dataset statistics

Number of variables3
Number of observations126671
Missing cells0
Missing cells (%)0.0%
Total size in memory4.1 MiB
Average record size in memory34.0 B

Variable types

Text1
Numeric1
DateTime1

Dataset

Description[Step] The step detector sensor collects an event each time a step is taken by the user. The value reported by the sensor is always one, the fractional part being always zero, and the event timestamp is the time when the user’s foot hit the ground. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Alerts

experimentid has constant value "wenetIndia"Constant
userid has 7116 (5.6%) zerosZeros

Reproduction

Analysis started2024-11-22 12:32:58.168627
Analysis finished2024-11-22 12:32:58.568889
Duration0.4 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.2 MiB
2024-11-22T13:32:58.633454image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1266710
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 126671
100.0%
2024-11-22T13:32:58.888083image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 253342
20.0%
n 253342
20.0%
w 126671
10.0%
t 126671
10.0%
I 126671
10.0%
d 126671
10.0%
i 126671
10.0%
a 126671
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1266710
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 253342
20.0%
n 253342
20.0%
w 126671
10.0%
t 126671
10.0%
I 126671
10.0%
d 126671
10.0%
i 126671
10.0%
a 126671
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1266710
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 253342
20.0%
n 253342
20.0%
w 126671
10.0%
t 126671
10.0%
I 126671
10.0%
d 126671
10.0%
i 126671
10.0%
a 126671
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1266710
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 253342
20.0%
n 253342
20.0%
w 126671
10.0%
t 126671
10.0%
I 126671
10.0%
d 126671
10.0%
i 126671
10.0%
a 126671
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.49725667
Minimum0
Maximum62
Zeros7116
Zeros (%)5.6%
Negative0
Negative (%)0.0%
Memory size989.7 KiB
2024-11-22T13:32:58.991854image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median12
Q324
95-th percentile44
Maximum62
Range62
Interquartile range (IQR)15

Descriptive statistics

Standard deviation14.7360645
Coefficient of variation (CV)0.7558019444
Kurtosis0.5666364183
Mean19.49725667
Median Absolute Deviation (MAD)12
Skewness1.02518449
Sum2469737
Variance217.151597
MonotonicityIncreasing
2024-11-22T13:32:59.202225image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
24 29748
23.5%
12 23574
18.6%
9 14447
11.4%
8 12970
10.2%
35 11081
 
8.7%
4 9314
 
7.4%
0 7116
 
5.6%
43 6214
 
4.9%
44 4257
 
3.4%
62 4246
 
3.4%
Other values (8) 3704
 
2.9%
ValueCountFrequency (%)
0 7116
 
5.6%
4 9314
 
7.4%
8 12970
10.2%
9 14447
11.4%
12 23574
18.6%
ValueCountFrequency (%)
62 4246
3.4%
57 640
 
0.5%
46 92
 
0.1%
44 4257
3.4%
43 6214
4.9%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct126667
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size989.7 KiB
Minimum2021-07-12 08:27:22.454000
Maximum2021-08-12 14:39:34.739000
2024-11-22T13:32:59.325959image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T13:32:59.467607image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)