Overview

Dataset statistics

Number of variables8
Number of observations5458512
Missing cells0
Missing cells (%)0.0%
Total size in memory544.5 MiB
Average record size in memory104.6 B

Variable types

Text3
Numeric2
DateTime1
Boolean2

Dataset

Description[unitless] Measures when the phone receives a notification and when it is dismissed by the user. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
idThe notification id
isclearableReturn FALSE if it can not be clear with "clear all notifications"
isongoingReturn if the app continue independent on the notification
packageThe application package that creates the notification
statusThe notification status (removed, posted, …)

Alerts

experimentid has constant value "wenetItaly"Constant
id has 413984 (7.6%) zerosZeros

Reproduction

Analysis started2024-11-24 09:50:53.444785
Analysis finished2024-11-24 09:51:22.693613
Duration29.25 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size135.3 MiB
2024-11-24T10:51:22.742229image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters54585120
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 5458512
100.0%
2024-11-24T10:51:22.993455image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 10917024
20.0%
t 10917024
20.0%
w 5458512
10.0%
n 5458512
10.0%
I 5458512
10.0%
a 5458512
10.0%
l 5458512
10.0%
y 5458512
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 54585120
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 10917024
20.0%
t 10917024
20.0%
w 5458512
10.0%
n 5458512
10.0%
I 5458512
10.0%
a 5458512
10.0%
l 5458512
10.0%
y 5458512
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 54585120
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 10917024
20.0%
t 10917024
20.0%
w 5458512
10.0%
n 5458512
10.0%
I 5458512
10.0%
a 5458512
10.0%
l 5458512
10.0%
y 5458512
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 54585120
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 10917024
20.0%
t 10917024
20.0%
w 5458512
10.0%
n 5458512
10.0%
I 5458512
10.0%
a 5458512
10.0%
l 5458512
10.0%
y 5458512
10.0%

userid
Real number (ℝ)

User id

Distinct167
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean138.6582391
Minimum1
Maximum265
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size83.3 MiB
2024-11-24T10:51:23.116932image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q172
median148
Q3203
95-th percentile257
Maximum265
Range264
Interquartile range (IQR)131

Descriptive statistics

Standard deviation79.15913091
Coefficient of variation (CV)0.5708938136
Kurtosis-1.24701017
Mean138.6582391
Median Absolute Deviation (MAD)65
Skewness-0.1805830293
Sum756867662
Variance6266.168006
MonotonicityIncreasing
2024-11-24T10:51:23.240211image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
191 160030
 
2.9%
200 148798
 
2.7%
32 147885
 
2.7%
187 142105
 
2.6%
66 116056
 
2.1%
263 100021
 
1.8%
83 97016
 
1.8%
176 92980
 
1.7%
148 91150
 
1.7%
213 90215
 
1.7%
Other values (157) 4272256
78.3%
ValueCountFrequency (%)
1 69105
1.3%
3 44998
0.8%
5 24120
 
0.4%
6 32557
0.6%
7 23060
 
0.4%
ValueCountFrequency (%)
265 36766
 
0.7%
264 36118
 
0.7%
263 100021
1.8%
262 50036
0.9%
260 493
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct5449509
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size83.3 MiB
Minimum2020-11-16 07:00:00.138000
Maximum2020-12-11 21:59:59.259000
2024-11-24T10:51:23.360279image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-24T10:51:23.480013image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

id
Real number (ℝ)

ZEROS 

The notification id

Distinct41905
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean208139829.7
Minimum-2147483648
Maximum2147483647
Zeros413984
Zeros (%)7.6%
Negative705092
Negative (%)12.9%
Memory size83.3 MiB
2024-11-24T10:51:23.600563image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-2147483648
5-th percentile-1125649384
Q11
median4
Q311000
95-th percentile2131428166
Maximum2147483647
Range4294967295
Interquartile range (IQR)10999

Descriptive statistics

Standard deviation842431727.8
Coefficient of variation (CV)4.047431619
Kurtosis1.772924867
Mean208139829.7
Median Absolute Deviation (MAD)10
Skewness1.000505654
Sum1.136133758 × 1015
Variance7.09691216 × 1017
MonotonicityNot monotonic
2024-11-24T10:51:23.721360image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1463216
26.8%
14 468527
 
8.6%
0 413984
 
7.6%
-56862258 217199
 
4.0%
117506050 146865
 
2.7%
8 134843
 
2.5%
11 134638
 
2.5%
-1 100102
 
1.8%
2131362944 88417
 
1.6%
-1873494995 78059
 
1.4%
Other values (41895) 2212662
40.5%
ValueCountFrequency (%)
-2147483648 1196
< 0.1%
-2147483647 286
 
< 0.1%
-2147483646 674
< 0.1%
-2147483645 89
 
< 0.1%
-2147483644 411
 
< 0.1%
ValueCountFrequency (%)
2147483647 5667
0.1%
2147483646 3114
 
0.1%
2147483645 11943
0.2%
2147483644 955
 
< 0.1%
2147483643 4328
 
0.1%

isclearable
Boolean

Return FALSE if it can not be clear with "clear all notifications"

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.9 MiB
False
3453573 
True
2004939 
ValueCountFrequency (%)
False 3453573
63.3%
True 2004939
36.7%
2024-11-24T10:51:23.814278image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

isongoing
Boolean

Return if the app continue independent on the notification

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.9 MiB
True
3449625 
False
2008887 
ValueCountFrequency (%)
True 3449625
63.2%
False 2008887
36.8%
2024-11-24T10:51:23.886837image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

package
Text

The application package that creates the notification

Distinct978
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size173.9 MiB
2024-11-24T10:51:23.992461image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length75
Median length67
Mean length17.39820981
Min length5

Characters and Unicode

Total characters94968337
Distinct characters61
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)< 0.1%

Sample

1st rowandroid
2nd rowcom.android.systemui
3rd rowcom.android.systemui
4th rowcom.android.systemui
5th rowcom.android.systemui
ValueCountFrequency (%)
com.whatsapp 1840833
33.7%
android 555815
 
10.2%
com.android.systemui 398745
 
7.3%
org.telegram.messenger 348494
 
6.4%
com.spotify.music 242610
 
4.4%
com.android.vending 219942
 
4.0%
com.google.android.apps.maps 172197
 
3.2%
com.sec.android.app.voicenote 146901
 
2.7%
com.google.android.googlequicksearchbox 121439
 
2.2%
cc.forestapp 98749
 
1.8%
Other values (968) 1312787
24.1%
2024-11-24T10:51:24.279543image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 9984555
 
10.5%
. 9309615
 
9.8%
a 9004048
 
9.5%
m 6654834
 
7.0%
c 6280621
 
6.6%
p 5809998
 
6.1%
s 5644517
 
5.9%
e 5387731
 
5.7%
d 5198511
 
5.5%
i 4802725
 
5.1%
Other values (51) 26891182
28.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 94968337
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 9984555
 
10.5%
. 9309615
 
9.8%
a 9004048
 
9.5%
m 6654834
 
7.0%
c 6280621
 
6.6%
p 5809998
 
6.1%
s 5644517
 
5.9%
e 5387731
 
5.7%
d 5198511
 
5.5%
i 4802725
 
5.1%
Other values (51) 26891182
28.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 94968337
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 9984555
 
10.5%
. 9309615
 
9.8%
a 9004048
 
9.5%
m 6654834
 
7.0%
c 6280621
 
6.6%
p 5809998
 
6.1%
s 5644517
 
5.9%
e 5387731
 
5.7%
d 5198511
 
5.5%
i 4802725
 
5.1%
Other values (51) 26891182
28.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 94968337
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 9984555
 
10.5%
. 9309615
 
9.8%
a 9004048
 
9.5%
m 6654834
 
7.0%
c 6280621
 
6.6%
p 5809998
 
6.1%
s 5644517
 
5.9%
e 5387731
 
5.7%
d 5198511
 
5.5%
i 4802725
 
5.1%
Other values (51) 26891182
28.3%

status
Text

The notification status (removed, posted, …)

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size183.2 MiB
2024-11-24T10:51:24.396471image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length20
Median length19
Mean length19.19548588
Min length19

Characters and Unicode

Total characters104778790
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownotification_removed
2nd rownotification_posted
3rd rownotification_posted
4th rownotification_posted
5th rownotification_posted
ValueCountFrequency (%)
notification_posted 4391450
80.5%
notification_removed 1067062
 
19.5%
2024-11-24T10:51:24.594001image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 16375536
15.6%
i 16375536
15.6%
t 15308474
14.6%
n 10917024
10.4%
e 6525574
 
6.2%
f 5458512
 
5.2%
c 5458512
 
5.2%
a 5458512
 
5.2%
_ 5458512
 
5.2%
d 5458512
 
5.2%
Other values (5) 11984086
11.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 104778790
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 16375536
15.6%
i 16375536
15.6%
t 15308474
14.6%
n 10917024
10.4%
e 6525574
 
6.2%
f 5458512
 
5.2%
c 5458512
 
5.2%
a 5458512
 
5.2%
_ 5458512
 
5.2%
d 5458512
 
5.2%
Other values (5) 11984086
11.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 104778790
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 16375536
15.6%
i 16375536
15.6%
t 15308474
14.6%
n 10917024
10.4%
e 6525574
 
6.2%
f 5458512
 
5.2%
c 5458512
 
5.2%
a 5458512
 
5.2%
_ 5458512
 
5.2%
d 5458512
 
5.2%
Other values (5) 11984086
11.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 104778790
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 16375536
15.6%
i 16375536
15.6%
t 15308474
14.6%
n 10917024
10.4%
e 6525574
 
6.2%
f 5458512
 
5.2%
c 5458512
 
5.2%
a 5458512
 
5.2%
_ 5458512
 
5.2%
d 5458512
 
5.2%
Other values (5) 11984086
11.4%