Overview

Dataset statistics

Number of variables3
Number of observations4949365
Missing cells0
Missing cells (%)0.0%
Total size in memory207.7 MiB
Average record size in memory44.0 B

Variable types

Text1
Numeric1
DateTime1

Dataset

Description[unitless] Returns the number of screen touch occurred. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Alerts

experimentid has constant value "wenetDenmark"Constant
userid has 308744 (6.2%) zerosZeros

Reproduction

Analysis started2024-11-23 01:51:12.462510
Analysis finished2024-11-23 01:51:28.250681
Duration15.79 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.2 MiB
2024-11-23T02:51:28.322210image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters59392380
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 4949365
100.0%
2024-11-23T02:51:28.571310image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 14848095
25.0%
n 9898730
16.7%
w 4949365
 
8.3%
t 4949365
 
8.3%
D 4949365
 
8.3%
m 4949365
 
8.3%
a 4949365
 
8.3%
r 4949365
 
8.3%
k 4949365
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 59392380
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 14848095
25.0%
n 9898730
16.7%
w 4949365
 
8.3%
t 4949365
 
8.3%
D 4949365
 
8.3%
m 4949365
 
8.3%
a 4949365
 
8.3%
r 4949365
 
8.3%
k 4949365
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 59392380
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 14848095
25.0%
n 9898730
16.7%
w 4949365
 
8.3%
t 4949365
 
8.3%
D 4949365
 
8.3%
m 4949365
 
8.3%
a 4949365
 
8.3%
r 4949365
 
8.3%
k 4949365
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 59392380
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 14848095
25.0%
n 9898730
16.7%
w 4949365
 
8.3%
t 4949365
 
8.3%
D 4949365
 
8.3%
m 4949365
 
8.3%
a 4949365
 
8.3%
r 4949365
 
8.3%
k 4949365
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.74016869
Minimum0
Maximum27
Zeros308744
Zeros (%)6.2%
Negative0
Negative (%)0.0%
Memory size75.5 MiB
2024-11-23T02:51:28.674263image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median17
Q317
95-th percentile26
Maximum27
Range27
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.333247111
Coefficient of variation (CV)0.7098064203
Kurtosis-1.426766479
Mean11.74016869
Median Absolute Deviation (MAD)9
Skewness0.1166561648
Sum58106380
Variance69.44300741
MonotonicityIncreasing
2024-11-23T02:51:28.767325image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
17 1596043
32.2%
6 861699
17.4%
3 858043
17.3%
21 382192
 
7.7%
2 335547
 
6.8%
0 308744
 
6.2%
26 221353
 
4.5%
27 102630
 
2.1%
22 86980
 
1.8%
25 85560
 
1.7%
Other values (3) 110574
 
2.2%
ValueCountFrequency (%)
0 308744
 
6.2%
2 335547
 
6.8%
3 858043
17.3%
6 861699
17.4%
12 32059
 
0.6%
ValueCountFrequency (%)
27 102630
 
2.1%
26 221353
4.5%
25 85560
 
1.7%
22 86980
 
1.8%
21 382192
7.7%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct4944224
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size75.5 MiB
Minimum2020-11-16 07:00:07.839000
Maximum2020-12-11 21:59:49.265000
2024-11-23T02:51:28.879528image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:51:28.997800image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)