Overview

Dataset statistics

Number of variables4
Number of observations33135
Missing cells0
Missing cells (%)0.0%
Total size in memory1.1 MiB
Average record size in memory35.0 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether the headphones of the phone were connected. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusReturn if the headset plug has been inserted

Alerts

experimentid has constant value "wenetItaly"Constant
timestamp has unique valuesUnique

Reproduction

Analysis started2024-11-23 05:56:04.005130
Analysis finished2024-11-23 05:56:04.279792
Duration0.27 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size582.6 KiB
2024-11-23T06:56:04.383732image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters331350
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 33135
100.0%
2024-11-23T06:56:04.609842image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 66270
20.0%
t 66270
20.0%
w 33135
10.0%
n 33135
10.0%
I 33135
10.0%
a 33135
10.0%
l 33135
10.0%
y 33135
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 331350
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 66270
20.0%
t 66270
20.0%
w 33135
10.0%
n 33135
10.0%
I 33135
10.0%
a 33135
10.0%
l 33135
10.0%
y 33135
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 331350
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 66270
20.0%
t 66270
20.0%
w 33135
10.0%
n 33135
10.0%
I 33135
10.0%
a 33135
10.0%
l 33135
10.0%
y 33135
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 331350
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 66270
20.0%
t 66270
20.0%
w 33135
10.0%
n 33135
10.0%
I 33135
10.0%
a 33135
10.0%
l 33135
10.0%
y 33135
10.0%

userid
Real number (ℝ)

User id

Distinct158
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142.9447714
Minimum0
Maximum265
Zeros17
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size259.0 KiB
2024-11-23T06:56:04.733992image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile18
Q157
median163
Q3217
95-th percentile252
Maximum265
Range265
Interquartile range (IQR)160

Descriptive statistics

Standard deviation84.36356757
Coefficient of variation (CV)0.5901829549
Kurtosis-1.457181978
Mean142.9447714
Median Absolute Deviation (MAD)79
Skewness-0.186387776
Sum4736475
Variance7117.211533
MonotonicityIncreasing
2024-11-23T06:56:04.940457image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
242 4057
 
12.2%
18 2228
 
6.7%
190 1660
 
5.0%
57 1618
 
4.9%
200 1086
 
3.3%
191 865
 
2.6%
167 756
 
2.3%
257 734
 
2.2%
112 721
 
2.2%
249 720
 
2.2%
Other values (148) 18690
56.4%
ValueCountFrequency (%)
0 17
 
0.1%
1 58
0.2%
2 11
 
< 0.1%
3 47
0.1%
5 85
0.3%
ValueCountFrequency (%)
265 335
1.0%
263 148
 
0.4%
259 89
 
0.3%
257 734
2.2%
255 83
 
0.3%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct33135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size259.0 KiB
Minimum2020-11-16 07:48:30.584000
Maximum2020-12-11 21:40:50.475000
2024-11-23T06:56:05.063787image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T06:56:05.183977image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

Return if the headset plug has been inserted

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.5 KiB
True
17209 
False
15926 
ValueCountFrequency (%)
True 17209
51.9%
False 15926
48.1%
2024-11-23T06:56:05.276135image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T06:56:05.327599image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.0000.016
userid0.0161.000