Overview

Dataset statistics

Number of variables4
Number of observations4443
Missing cells0
Missing cells (%)0.0%
Total size in memory160.7 KiB
Average record size in memory37.0 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether the headphones of the phone were connected. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusReturn if the headset plug has been inserted

Alerts

experimentid has constant value "wenetDenmark"Constant
timestamp has unique valuesUnique

Reproduction

Analysis started2024-11-23 01:50:37.556290
Analysis finished2024-11-23 01:50:37.749986
Duration0.19 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size86.9 KiB
2024-11-23T02:50:37.860475image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters53316
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 4443
100.0%
2024-11-23T02:50:38.066986image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 13329
25.0%
n 8886
16.7%
w 4443
 
8.3%
t 4443
 
8.3%
D 4443
 
8.3%
m 4443
 
8.3%
a 4443
 
8.3%
r 4443
 
8.3%
k 4443
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 53316
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 13329
25.0%
n 8886
16.7%
w 4443
 
8.3%
t 4443
 
8.3%
D 4443
 
8.3%
m 4443
 
8.3%
a 4443
 
8.3%
r 4443
 
8.3%
k 4443
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 53316
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 13329
25.0%
n 8886
16.7%
w 4443
 
8.3%
t 4443
 
8.3%
D 4443
 
8.3%
m 4443
 
8.3%
a 4443
 
8.3%
r 4443
 
8.3%
k 4443
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 53316
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 13329
25.0%
n 8886
16.7%
w 4443
 
8.3%
t 4443
 
8.3%
D 4443
 
8.3%
m 4443
 
8.3%
a 4443
 
8.3%
r 4443
 
8.3%
k 4443
 
8.3%

userid
Real number (ℝ)

User id

Distinct11
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.38937655
Minimum2
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.8 KiB
2024-11-23T02:50:38.166417image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3
Q117
median17
Q317
95-th percentile26
Maximum26
Range24
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5.580534336
Coefficient of variation (CV)0.3404970482
Kurtosis1.844015508
Mean16.38937655
Median Absolute Deviation (MAD)0
Skewness-1.303642747
Sum72818
Variance31.14236348
MonotonicityIncreasing
2024-11-23T02:50:38.253840image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
17 3103
69.8%
3 424
 
9.5%
23 239
 
5.4%
26 239
 
5.4%
20 210
 
4.7%
2 104
 
2.3%
22 37
 
0.8%
25 33
 
0.7%
19 26
 
0.6%
21 23
 
0.5%
ValueCountFrequency (%)
2 104
 
2.3%
3 424
 
9.5%
12 5
 
0.1%
17 3103
69.8%
19 26
 
0.6%
ValueCountFrequency (%)
26 239
5.4%
25 33
 
0.7%
23 239
5.4%
22 37
 
0.8%
21 23
 
0.5%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct4443
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size34.8 KiB
Minimum2020-11-16 08:20:11.192000
Maximum2020-12-11 21:49:47.462000
2024-11-23T02:50:38.362441image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:50:38.482986image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

Return if the headset plug has been inserted

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
True
2298 
False
2145 
ValueCountFrequency (%)
True 2298
51.7%
False 2145
48.3%
2024-11-23T02:50:38.577910image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T02:50:38.633992image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.000-0.009
userid-0.0091.000