Overview

Dataset statistics

Number of variables4
Number of observations932
Missing cells0
Missing cells (%)0.0%
Total size in memory32.0 KiB
Average record size in memory35.1 B

Variable types

Text1
Numeric1
DateTime1
Boolean1

Dataset

Description[0/1] Returns whether music is being played on the phone (yes or no) using the default music player from the operating system. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
statusReturn if a music event is being played

Alerts

experimentid has constant value "wenetIndia"Constant
status is highly imbalanced (93.6%)Imbalance
timestamp has unique valuesUnique

Reproduction

Analysis started2024-11-22 12:32:00.016397
Analysis finished2024-11-22 12:32:00.181476
Duration0.17 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.5 KiB
2024-11-22T13:32:00.243141image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters9320
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 932
100.0%
2024-11-22T13:32:00.424371image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1864
20.0%
n 1864
20.0%
w 932
10.0%
t 932
10.0%
I 932
10.0%
d 932
10.0%
i 932
10.0%
a 932
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9320
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1864
20.0%
n 1864
20.0%
w 932
10.0%
t 932
10.0%
I 932
10.0%
d 932
10.0%
i 932
10.0%
a 932
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9320
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1864
20.0%
n 1864
20.0%
w 932
10.0%
t 932
10.0%
I 932
10.0%
d 932
10.0%
i 932
10.0%
a 932
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9320
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1864
20.0%
n 1864
20.0%
w 932
10.0%
t 932
10.0%
I 932
10.0%
d 932
10.0%
i 932
10.0%
a 932
10.0%

userid
Real number (ℝ)

User id

Distinct7
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.98819742
Minimum8
Maximum46
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-11-22T13:32:00.524223image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile9
Q19
median26
Q335
95-th percentile35
Maximum46
Range38
Interquartile range (IQR)26

Descriptive statistics

Standard deviation10.39916909
Coefficient of variation (CV)0.4523699225
Kurtosis-1.368569244
Mean22.98819742
Median Absolute Deviation (MAD)9
Skewness-0.2817553903
Sum21425
Variance108.1427177
MonotonicityIncreasing
2024-11-22T13:32:00.616933image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
26 333
35.7%
9 286
30.7%
35 260
27.9%
17 31
 
3.3%
25 14
 
1.5%
8 4
 
0.4%
46 4
 
0.4%
ValueCountFrequency (%)
8 4
 
0.4%
9 286
30.7%
17 31
 
3.3%
25 14
 
1.5%
26 333
35.7%
ValueCountFrequency (%)
46 4
 
0.4%
35 260
27.9%
26 333
35.7%
25 14
 
1.5%
17 31
 
3.3%

timestamp
Date

UNIQUE 

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct932
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
Minimum2021-07-12 15:09:42.190000
Maximum2021-08-02 23:12:01.975000
2024-11-22T13:32:00.728518image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T13:32:00.854893image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

status
Boolean

IMBALANCE 

Return if a music event is being played

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
True
925 
False
 
7
ValueCountFrequency (%)
True 925
99.2%
False 7
 
0.8%
2024-11-22T13:32:00.951247image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-11-22T13:32:00.997931image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
statususerid
status1.0000.047
userid0.0471.000