Overview

Dataset statistics

Number of variables4
Number of observations13965
Missing cells0
Missing cells (%)0.0%
Total size in memory504.7 KiB
Average record size in memory37.0 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether music is being played on the phone (yes or no) using the default music player from the operating system. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusReturn if a music event is being played

Alerts

experimentid has constant value "wenetDenmark"Constant
status is highly imbalanced (92.4%)Imbalance
timestamp has unique valuesUnique
userid has 1581 (11.3%) zerosZeros

Reproduction

Analysis started2024-11-23 01:50:48.227927
Analysis finished2024-11-23 01:50:48.437510
Duration0.21 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size272.9 KiB
2024-11-23T02:50:48.547659image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters167580
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 13965
100.0%
2024-11-23T02:50:48.768119image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 41895
25.0%
n 27930
16.7%
w 13965
 
8.3%
t 13965
 
8.3%
D 13965
 
8.3%
m 13965
 
8.3%
a 13965
 
8.3%
r 13965
 
8.3%
k 13965
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 167580
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 41895
25.0%
n 27930
16.7%
w 13965
 
8.3%
t 13965
 
8.3%
D 13965
 
8.3%
m 13965
 
8.3%
a 13965
 
8.3%
r 13965
 
8.3%
k 13965
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 167580
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 41895
25.0%
n 27930
16.7%
w 13965
 
8.3%
t 13965
 
8.3%
D 13965
 
8.3%
m 13965
 
8.3%
a 13965
 
8.3%
r 13965
 
8.3%
k 13965
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 167580
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 41895
25.0%
n 27930
16.7%
w 13965
 
8.3%
t 13965
 
8.3%
D 13965
 
8.3%
m 13965
 
8.3%
a 13965
 
8.3%
r 13965
 
8.3%
k 13965
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.34514859
Minimum0
Maximum27
Zeros1581
Zeros (%)11.3%
Negative0
Negative (%)0.0%
Memory size109.2 KiB
2024-11-23T02:50:48.868444image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q122
median23
Q323
95-th percentile26
Maximum27
Range27
Interquartile range (IQR)1

Descriptive statistics

Standard deviation9.074409862
Coefficient of variation (CV)0.494649025
Kurtosis-0.06125048268
Mean18.34514859
Median Absolute Deviation (MAD)0
Skewness-1.337235489
Sum256190
Variance82.34491434
MonotonicityIncreasing
2024-11-23T02:50:48.955476image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
23 9437
67.6%
0 1581
 
11.3%
2 1263
 
9.0%
27 605
 
4.3%
22 560
 
4.0%
12 395
 
2.8%
26 118
 
0.8%
25 6
 
< 0.1%
ValueCountFrequency (%)
0 1581
 
11.3%
2 1263
 
9.0%
12 395
 
2.8%
22 560
 
4.0%
23 9437
67.6%
ValueCountFrequency (%)
27 605
 
4.3%
26 118
 
0.8%
25 6
 
< 0.1%
23 9437
67.6%
22 560
 
4.0%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct13965
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size109.2 KiB
Minimum2020-11-16 08:35:29.943000
Maximum2020-12-11 15:43:31.933000
2024-11-23T02:50:49.064743image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T02:50:49.182651image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

IMBALANCE 

Return if a music event is being played

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size13.8 KiB
True
13836 
False
 
129
ValueCountFrequency (%)
True 13836
99.1%
False 129
 
0.9%
2024-11-23T02:50:49.273369image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-23T02:50:49.322842image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.0000.072
userid0.0721.000