Overview

Dataset statistics

Number of variables4
Number of observations66832332
Missing cells0
Missing cells (%)0.0%
Total size in memory3.1 GiB
Average record size in memory50.0 B

Variable types

Text1
Numeric2
DateTime1

Dataset

Description[cm] Measures the distance between the user's head and the phone, depending on the phone it may be measured in centimeters (i.e., the absolute distance) or as labels (e.g., 'near', 'far'). To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
valueThe distance value (cm, centimeter)

Alerts

experimentid has constant value "wenetItaly"Constant
userid has 1061489 (1.6%) zerosZeros
value has 15417075 (23.1%) zerosZeros

Reproduction

Analysis started2024-11-23 13:20:58.536871
Analysis finished2024-11-23 13:25:06.685386
Duration4 minutes and 8.15 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.6 GiB
2024-11-23T14:25:06.739176image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters668323320
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 66832332
100.0%
2024-11-23T14:25:06.917836image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 133664664
20.0%
t 133664664
20.0%
w 66832332
10.0%
n 66832332
10.0%
I 66832332
10.0%
a 66832332
10.0%
l 66832332
10.0%
y 66832332
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 668323320
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 133664664
20.0%
t 133664664
20.0%
w 66832332
10.0%
n 66832332
10.0%
I 66832332
10.0%
a 66832332
10.0%
l 66832332
10.0%
y 66832332
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 668323320
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 133664664
20.0%
t 133664664
20.0%
w 66832332
10.0%
n 66832332
10.0%
I 66832332
10.0%
a 66832332
10.0%
l 66832332
10.0%
y 66832332
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 668323320
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 133664664
20.0%
t 133664664
20.0%
w 66832332
10.0%
n 66832332
10.0%
I 66832332
10.0%
a 66832332
10.0%
l 66832332
10.0%
y 66832332
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct218
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108.26394
Minimum0
Maximum265
Zeros1061489
Zeros (%)1.6%
Negative0
Negative (%)0.0%
Memory size1019.8 MiB
2024-11-23T14:25:07.037957image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile24
Q191
median91
Q3134
95-th percentile134
Maximum265
Range265
Interquartile range (IQR)43

Descriptive statistics

Standard deviation39.43345349
Coefficient of variation (CV)0.3642344209
Kurtosis2.385967798
Mean108.26394
Median Absolute Deviation (MAD)43
Skewness-0.1075521461
Sum7235531583
Variance1554.997254
MonotonicityIncreasing
2024-11-23T14:25:07.160948image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
134 27519528
41.2%
91 27340397
40.9%
3 1798246
 
2.7%
62 1168825
 
1.7%
82 1081143
 
1.6%
0 1061489
 
1.6%
77 827904
 
1.2%
99 478253
 
0.7%
195 452376
 
0.7%
203 452196
 
0.7%
Other values (208) 4651975
 
7.0%
ValueCountFrequency (%)
0 1061489
1.6%
1 11632
 
< 0.1%
2 7526
 
< 0.1%
3 1798246
2.7%
4 59074
 
0.1%
ValueCountFrequency (%)
265 2700
 
< 0.1%
264 324
 
< 0.1%
263 15684
< 0.1%
262 4467
 
< 0.1%
260 1209
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct66031836
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size1019.8 MiB
Minimum2020-11-16 07:00:00.074000
Maximum2020-12-11 21:59:59.919000
2024-11-23T14:25:07.288951image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T14:25:07.408032image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

value
Real number (ℝ)

ZEROS 

The distance value (cm, centimeter)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.859825391
Minimum0
Maximum10
Zeros15417075
Zeros (%)23.1%
Negative0
Negative (%)0.0%
Memory size1019.8 MiB
2024-11-23T14:25:07.518115image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median5
Q35
95-th percentile5
Maximum10
Range10
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.125706635
Coefficient of variation (CV)0.5507261131
Kurtosis-0.3437705502
Mean3.859825391
Median Absolute Deviation (MAD)0
Skewness-1.224299873
Sum257961132
Variance4.518628698
MonotonicityNot monotonic
2024-11-23T14:25:07.613486image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
5 51089286
76.4%
0 15417075
 
23.1%
8 276411
 
0.4%
1 20118
 
< 0.1%
10 18318
 
< 0.1%
9 11124
 
< 0.1%
ValueCountFrequency (%)
0 15417075
 
23.1%
1 20118
 
< 0.1%
5 51089286
76.4%
8 276411
 
0.4%
9 11124
 
< 0.1%
ValueCountFrequency (%)
10 18318
 
< 0.1%
9 11124
 
< 0.1%
8 276411
 
0.4%
5 51089286
76.4%
1 20118
 
< 0.1%

Correlations

2024-11-23T14:25:07.673880image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
useridvalue
userid1.000-0.102
value-0.1021.000