Overview

Dataset statistics

Number of variables3
Number of observations5097489
Missing cells0
Missing cells (%)0.0%
Total size in memory204.2 MiB
Average record size in memory42.0 B

Variable types

Text1
Numeric1
DateTime1

Dataset

Description[unitless] Returns the number of screen touch occurred. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Alerts

experimentid has constant value "wenetIndia"Constant
userid has 2931971 (57.5%) zerosZeros

Reproduction

Analysis started2024-11-22 12:32:22.465145
Analysis finished2024-11-22 12:32:36.851021
Duration14.39 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size126.4 MiB
2024-11-22T13:32:36.917698image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters50974890
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 5097489
100.0%
2024-11-22T13:32:37.179140image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 10194978
20.0%
n 10194978
20.0%
w 5097489
10.0%
t 5097489
10.0%
I 5097489
10.0%
d 5097489
10.0%
i 5097489
10.0%
a 5097489
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 50974890
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 10194978
20.0%
n 10194978
20.0%
w 5097489
10.0%
t 5097489
10.0%
I 5097489
10.0%
d 5097489
10.0%
i 5097489
10.0%
a 5097489
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 50974890
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 10194978
20.0%
n 10194978
20.0%
w 5097489
10.0%
t 5097489
10.0%
I 5097489
10.0%
d 5097489
10.0%
i 5097489
10.0%
a 5097489
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 50974890
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 10194978
20.0%
n 10194978
20.0%
w 5097489
10.0%
t 5097489
10.0%
I 5097489
10.0%
d 5097489
10.0%
i 5097489
10.0%
a 5097489
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.750704121
Minimum0
Maximum62
Zeros2931971
Zeros (%)57.5%
Negative0
Negative (%)0.0%
Memory size77.8 MiB
2024-11-22T13:32:37.276191image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q312
95-th percentile43
Maximum62
Range62
Interquartile range (IQR)12

Descriptive statistics

Standard deviation14.39924737
Coefficient of variation (CV)1.645495856
Kurtosis5.380385753
Mean8.750704121
Median Absolute Deviation (MAD)0
Skewness2.328552181
Sum44606618
Variance207.3383247
MonotonicityIncreasing
2024-11-22T13:32:37.366000image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 2931971
57.5%
12 1211469
23.8%
17 519385
 
10.2%
62 156086
 
3.1%
35 85469
 
1.7%
43 85023
 
1.7%
57 69208
 
1.4%
4 12676
 
0.2%
49 10594
 
0.2%
25 10547
 
0.2%
Other values (2) 5061
 
0.1%
ValueCountFrequency (%)
0 2931971
57.5%
4 12676
 
0.2%
12 1211469
23.8%
17 519385
 
10.2%
22 890
 
< 0.1%
ValueCountFrequency (%)
62 156086
3.1%
57 69208
1.4%
49 10594
 
0.2%
43 85023
1.7%
35 85469
1.7%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct5093826
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size77.8 MiB
Minimum2021-07-12 08:00:01.384000
Maximum2021-08-12 13:39:46.970000
2024-11-22T13:32:37.476589image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T13:32:37.604201image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)