Overview

Dataset statistics

Number of variables8
Number of observations737416
Missing cells0
Missing cells (%)0.0%
Total size in memory70.4 MiB
Average record size in memory100.1 B

Variable types

Text3
Numeric2
DateTime1
Boolean2

Dataset

Description[unitless] Measures when the phone receives a notification and when it is dismissed by the user. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
idThe notification id
isclearableReturn FALSE if it can not be clear with "clear all notifications"
isongoingReturn if the app continue independent on the notification
packageThe application package that creates the notification
statusThe notification status (removed, posted, …)

Alerts

experimentid has constant value "wenetIndia"Constant
isclearable is highly overall correlated with isongoingHigh correlation
isongoing is highly overall correlated with isclearableHigh correlation
userid has 54741 (7.4%) zerosZeros
id has 16229 (2.2%) zerosZeros

Reproduction

Analysis started2024-11-24 09:46:49.852467
Analysis finished2024-11-24 09:46:54.362994
Duration4.51 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.7 MiB
2024-11-24T10:46:54.483170image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters7374160
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 737416
100.0%
2024-11-24T10:46:54.671498image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1474832
20.0%
n 1474832
20.0%
w 737416
10.0%
t 737416
10.0%
I 737416
10.0%
d 737416
10.0%
i 737416
10.0%
a 737416
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7374160
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1474832
20.0%
n 1474832
20.0%
w 737416
10.0%
t 737416
10.0%
I 737416
10.0%
d 737416
10.0%
i 737416
10.0%
a 737416
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7374160
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1474832
20.0%
n 1474832
20.0%
w 737416
10.0%
t 737416
10.0%
I 737416
10.0%
d 737416
10.0%
i 737416
10.0%
a 737416
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7374160
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1474832
20.0%
n 1474832
20.0%
w 737416
10.0%
t 737416
10.0%
I 737416
10.0%
d 737416
10.0%
i 737416
10.0%
a 737416
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.21767089
Minimum0
Maximum62
Zeros54741
Zeros (%)7.4%
Negative0
Negative (%)0.0%
Memory size5.6 MiB
2024-11-24T10:46:54.773761image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median9
Q324
95-th percentile44
Maximum62
Range62
Interquartile range (IQR)15

Descriptive statistics

Standard deviation13.59323055
Coefficient of variation (CV)0.8932530247
Kurtosis1.825959429
Mean15.21767089
Median Absolute Deviation (MAD)0
Skewness1.608810224
Sum11221754
Variance184.7759169
MonotonicityIncreasing
2024-11-24T10:46:54.866361image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
9 402124
54.5%
24 81731
 
11.1%
0 54741
 
7.4%
12 43247
 
5.9%
44 33180
 
4.5%
4 30969
 
4.2%
35 30121
 
4.1%
43 17792
 
2.4%
57 14139
 
1.9%
18 6810
 
0.9%
Other values (7) 22562
 
3.1%
ValueCountFrequency (%)
0 54741
 
7.4%
4 30969
 
4.2%
8 5034
 
0.7%
9 402124
54.5%
12 43247
 
5.9%
ValueCountFrequency (%)
62 6091
 
0.8%
57 14139
1.9%
49 5143
 
0.7%
44 33180
4.5%
43 17792
2.4%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct737313
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size5.6 MiB
Minimum2021-07-12 08:00:38.004000
Maximum2021-08-12 14:40:00.550000
2024-11-24T10:46:54.977959image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-24T10:46:55.101054image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

id
Real number (ℝ)

ZEROS 

The notification id

Distinct4366
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean93304987.68
Minimum-2147483648
Maximum2147483647
Zeros16229
Zeros (%)2.2%
Negative30877
Negative (%)4.2%
Memory size5.6 MiB
2024-11-24T10:46:55.223198image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-2147483648
5-th percentile0
Q11
median1
Q38
95-th percentile1431325696
Maximum2147483647
Range4294967295
Interquartile range (IQR)7

Descriptive statistics

Standard deviation503248140.4
Coefficient of variation (CV)5.393582411
Kurtosis11.23082293
Mean93304987.68
Median Absolute Deviation (MAD)0
Skewness2.663194305
Sum6.880459079 × 1013
Variance2.532586908 × 1017
MonotonicityNot monotonic
2024-11-24T10:46:55.344081image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 481470
65.3%
26111991 72878
 
9.9%
8 22437
 
3.0%
0 16229
 
2.2%
2131362953 13572
 
1.8%
-56862258 13170
 
1.8%
1000 8400
 
1.1%
14 8220
 
1.1%
11 7409
 
1.0%
2 6345
 
0.9%
Other values (4356) 87286
 
11.8%
ValueCountFrequency (%)
-2147483648 163
< 0.1%
-2147483647 64
 
< 0.1%
-2147483646 6
 
< 0.1%
-2147483645 6
 
< 0.1%
-2147483644 8
 
< 0.1%
ValueCountFrequency (%)
2147483647 328
< 0.1%
2147483646 67
 
< 0.1%
2147483645 93
 
< 0.1%
2147483644 21
 
< 0.1%
2147483641 4
 
< 0.1%

isclearable
Boolean

HIGH CORRELATION 

Return FALSE if it can not be clear with "clear all notifications"

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size720.3 KiB
False
605720 
True
131696 
ValueCountFrequency (%)
False 605720
82.1%
True 131696
 
17.9%
2024-11-24T10:46:55.440970image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

isongoing
Boolean

HIGH CORRELATION 

Return if the app continue independent on the notification

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size720.3 KiB
True
605506 
False
131910 
ValueCountFrequency (%)
True 605506
82.1%
False 131910
 
17.9%
2024-11-24T10:46:55.519771image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

package
Text

The application package that creates the notification

Distinct260
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.4 MiB
2024-11-24T10:46:55.595359image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length87
Median length26
Mean length20.96939719
Min length7

Characters and Unicode

Total characters15463169
Distinct characters46
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st rowandroid
2nd rowcom.oneplus.mms
3rd rowcom.oneplus.mms
4th rowcom.oneplus.mms
5th rowcom.oneplus.mms
ValueCountFrequency (%)
com.nisargjhaveri.netspeed 373244
50.6%
com.whatsapp 112666
 
15.3%
com.pepkit.ssg 72898
 
9.9%
android 37781
 
5.1%
com.android.systemui 26323
 
3.6%
com.android.vending 13191
 
1.8%
org.telegram.messenger 7353
 
1.0%
com.google.android.gm 7285
 
1.0%
com.fast.free.unblock.secure.vpn 6459
 
0.9%
com.snapchat.android 6185
 
0.8%
Other values (250) 74031
 
10.0%
2024-11-24T10:46:55.840232image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1773301
 
11.5%
. 1350718
 
8.7%
a 1192249
 
7.7%
s 1181135
 
7.6%
i 1053667
 
6.8%
r 965557
 
6.2%
o 960823
 
6.2%
n 945952
 
6.1%
p 810580
 
5.2%
m 791266
 
5.1%
Other values (36) 4437921
28.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15463169
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1773301
 
11.5%
. 1350718
 
8.7%
a 1192249
 
7.7%
s 1181135
 
7.6%
i 1053667
 
6.8%
r 965557
 
6.2%
o 960823
 
6.2%
n 945952
 
6.1%
p 810580
 
5.2%
m 791266
 
5.1%
Other values (36) 4437921
28.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15463169
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1773301
 
11.5%
. 1350718
 
8.7%
a 1192249
 
7.7%
s 1181135
 
7.6%
i 1053667
 
6.8%
r 965557
 
6.2%
o 960823
 
6.2%
n 945952
 
6.1%
p 810580
 
5.2%
m 791266
 
5.1%
Other values (36) 4437921
28.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15463169
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1773301
 
11.5%
. 1350718
 
8.7%
a 1192249
 
7.7%
s 1181135
 
7.6%
i 1053667
 
6.8%
r 965557
 
6.2%
o 960823
 
6.2%
n 945952
 
6.1%
p 810580
 
5.2%
m 791266
 
5.1%
Other values (36) 4437921
28.7%

status
Text

The notification status (removed, posted, …)

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.0 MiB
2024-11-24T10:46:55.929509image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length20
Median length19
Mean length19.08742555
Min length19

Characters and Unicode

Total characters14075373
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownotification_posted
2nd rownotification_posted
3rd rownotification_removed
4th rownotification_posted
5th rownotification_removed
ValueCountFrequency (%)
notification_posted 672947
91.3%
notification_removed 64469
 
8.7%
2024-11-24T10:46:56.113270image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 2212248
15.7%
i 2212248
15.7%
t 2147779
15.3%
n 1474832
10.5%
e 801885
 
5.7%
f 737416
 
5.2%
c 737416
 
5.2%
a 737416
 
5.2%
_ 737416
 
5.2%
d 737416
 
5.2%
Other values (5) 1539301
10.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 14075373
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 2212248
15.7%
i 2212248
15.7%
t 2147779
15.3%
n 1474832
10.5%
e 801885
 
5.7%
f 737416
 
5.2%
c 737416
 
5.2%
a 737416
 
5.2%
_ 737416
 
5.2%
d 737416
 
5.2%
Other values (5) 1539301
10.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 14075373
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 2212248
15.7%
i 2212248
15.7%
t 2147779
15.3%
n 1474832
10.5%
e 801885
 
5.7%
f 737416
 
5.2%
c 737416
 
5.2%
a 737416
 
5.2%
_ 737416
 
5.2%
d 737416
 
5.2%
Other values (5) 1539301
10.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 14075373
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 2212248
15.7%
i 2212248
15.7%
t 2147779
15.3%
n 1474832
10.5%
e 801885
 
5.7%
f 737416
 
5.2%
c 737416
 
5.2%
a 737416
 
5.2%
_ 737416
 
5.2%
d 737416
 
5.2%
Other values (5) 1539301
10.9%

Correlations

2024-11-24T10:46:56.189916image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
idisclearableisongoinguserid
id1.0000.3300.3290.298
isclearable0.3301.0000.9990.165
isongoing0.3290.9991.000-0.165
userid0.2980.165-0.1651.000