Overview

Dataset statistics

Number of variables4
Number of observations316915
Missing cells0
Missing cells (%)0.0%
Total size in memory10.6 MiB
Average record size in memory35.0 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether music is being played on the phone (yes or no) using the default music player from the operating system. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusReturn if a music event is being played

Alerts

experimentid has constant value "wenetItaly"Constant
status is highly imbalanced (59.6%)Imbalance

Reproduction

Analysis started2024-11-23 05:56:30.220719
Analysis finished2024-11-23 05:56:31.343400
Duration1.12 second
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size5.4 MiB
2024-11-23T06:56:31.438866image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters3169150
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 316915
100.0%
2024-11-23T06:56:31.662390image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 633830
20.0%
t 633830
20.0%
w 316915
10.0%
n 316915
10.0%
I 316915
10.0%
a 316915
10.0%
l 316915
10.0%
y 316915
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3169150
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 633830
20.0%
t 633830
20.0%
w 316915
10.0%
n 316915
10.0%
I 316915
10.0%
a 316915
10.0%
l 316915
10.0%
y 316915
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3169150
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 633830
20.0%
t 633830
20.0%
w 316915
10.0%
n 316915
10.0%
I 316915
10.0%
a 316915
10.0%
l 316915
10.0%
y 316915
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3169150
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 633830
20.0%
t 633830
20.0%
w 316915
10.0%
n 316915
10.0%
I 316915
10.0%
a 316915
10.0%
l 316915
10.0%
y 316915
10.0%

userid
Real number (ℝ)

User id

Distinct129
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.9969834
Minimum1
Maximum263
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 MiB
2024-11-23T06:56:31.783562image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19
Q157
median130
Q3202
95-th percentile249
Maximum263
Range262
Interquartile range (IQR)145

Descriptive statistics

Standard deviation79.95563898
Coefficient of variation (CV)0.6150576489
Kurtosis-1.333124605
Mean129.9969834
Median Absolute Deviation (MAD)73
Skewness0.04310607755
Sum41197994
Variance6392.904205
MonotonicityIncreasing
2024-11-23T06:56:31.903657image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19 40054
 
12.6%
245 21142
 
6.7%
161 19842
 
6.3%
249 16509
 
5.2%
42 14112
 
4.5%
79 13982
 
4.4%
57 13222
 
4.2%
118 10674
 
3.4%
209 7665
 
2.4%
217 7441
 
2.3%
Other values (119) 152272
48.0%
ValueCountFrequency (%)
1 71
 
< 0.1%
2 185
 
0.1%
4 35
 
< 0.1%
6 782
0.2%
7 120
 
< 0.1%
ValueCountFrequency (%)
263 112
 
< 0.1%
262 1667
 
0.5%
258 2248
0.7%
256 450
 
0.1%
255 5193
1.6%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct316889
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size2.4 MiB
Minimum2020-11-16 07:25:18.125000
Maximum2020-12-11 21:59:16.497000
2024-11-23T06:56:32.043743image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T06:56:32.161575image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

IMBALANCE 

Return if a music event is being played

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size309.6 KiB
True
291407 
False
 
25508
ValueCountFrequency (%)
True 291407
92.0%
False 25508
 
8.0%
2024-11-23T06:56:32.253375image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T06:56:32.304520image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.000-0.027
userid-0.0271.000