Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 5458512 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 544.5 MiB |
Average record size in memory | 104.6 B |
Variable types
Text | 3 |
---|---|
Numeric | 2 |
DateTime | 1 |
Boolean | 2 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
id | The notification id |
isclearable | Return FALSE if it can not be clear with "clear all notifications" |
isongoing | Return if the app continue independent on the notification |
package | The application package that creates the notification |
status | The notification status (removed, posted, …) |
experimentid has constant value "wenetItaly" | Constant |
id has 413984 (7.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-24 09:50:53.444785 |
---|---|
Analysis finished | 2024-11-24 09:51:22.693613 |
Duration | 29.25 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 135.3 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 54585120 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetItaly |
---|---|
2nd row | wenetItaly |
3rd row | wenetItaly |
4th row | wenetItaly |
5th row | wenetItaly |
Value | Count | Frequency (%) |
wenetitaly | 5458512 |
Most occurring characters
Value | Count | Frequency (%) |
e | 10917024 | |
t | 10917024 | |
w | 5458512 | |
n | 5458512 | |
I | 5458512 | |
a | 5458512 | |
l | 5458512 | |
y | 5458512 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 54585120 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 10917024 | |
t | 10917024 | |
w | 5458512 | |
n | 5458512 | |
I | 5458512 | |
a | 5458512 | |
l | 5458512 | |
y | 5458512 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 54585120 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 10917024 | |
t | 10917024 | |
w | 5458512 | |
n | 5458512 | |
I | 5458512 | |
a | 5458512 | |
l | 5458512 | |
y | 5458512 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 54585120 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 10917024 | |
t | 10917024 | |
w | 5458512 | |
n | 5458512 | |
I | 5458512 | |
a | 5458512 | |
l | 5458512 | |
y | 5458512 |
userid
Real number (ℝ)
User id
Distinct | 167 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 138.6582391 |
Minimum | 1 |
---|---|
Maximum | 265 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 83.3 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 9 |
Q1 | 72 |
median | 148 |
Q3 | 203 |
95-th percentile | 257 |
Maximum | 265 |
Range | 264 |
Interquartile range (IQR) | 131 |
Descriptive statistics
Standard deviation | 79.15913091 |
---|---|
Coefficient of variation (CV) | 0.5708938136 |
Kurtosis | -1.24701017 |
Mean | 138.6582391 |
Median Absolute Deviation (MAD) | 65 |
Skewness | -0.1805830293 |
Sum | 756867662 |
Variance | 6266.168006 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
191 | 160030 | 2.9% |
200 | 148798 | 2.7% |
32 | 147885 | 2.7% |
187 | 142105 | 2.6% |
66 | 116056 | 2.1% |
263 | 100021 | 1.8% |
83 | 97016 | 1.8% |
176 | 92980 | 1.7% |
148 | 91150 | 1.7% |
213 | 90215 | 1.7% |
Other values (157) | 4272256 |
Value | Count | Frequency (%) |
1 | 69105 | |
3 | 44998 | |
5 | 24120 | 0.4% |
6 | 32557 | |
7 | 23060 | 0.4% |
Value | Count | Frequency (%) |
265 | 36766 | 0.7% |
264 | 36118 | 0.7% |
263 | 100021 | |
262 | 50036 | |
260 | 493 | < 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 5449509 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 83.3 MiB |
Minimum | 2020-11-16 07:00:00.138000 |
---|---|
Maximum | 2020-12-11 21:59:59.259000 |
Histogram with fixed size bins (bins=50)
Distinct | 41905 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 208139829.7 |
Minimum | -2147483648 |
---|---|
Maximum | 2147483647 |
Zeros | 413984 |
Zeros (%) | 7.6% |
Negative | 705092 |
Negative (%) | 12.9% |
Memory size | 83.3 MiB |
Quantile statistics
Minimum | -2147483648 |
---|---|
5-th percentile | -1125649384 |
Q1 | 1 |
median | 4 |
Q3 | 11000 |
95-th percentile | 2131428166 |
Maximum | 2147483647 |
Range | 4294967295 |
Interquartile range (IQR) | 10999 |
Descriptive statistics
Standard deviation | 842431727.8 |
---|---|
Coefficient of variation (CV) | 4.047431619 |
Kurtosis | 1.772924867 |
Mean | 208139829.7 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 1.000505654 |
Sum | 1.136133758 × 1015 |
Variance | 7.09691216 × 1017 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
1 | 1463216 | |
14 | 468527 | 8.6% |
0 | 413984 | 7.6% |
-56862258 | 217199 | 4.0% |
117506050 | 146865 | 2.7% |
8 | 134843 | 2.5% |
11 | 134638 | 2.5% |
-1 | 100102 | 1.8% |
2131362944 | 88417 | 1.6% |
-1873494995 | 78059 | 1.4% |
Other values (41895) | 2212662 |
Value | Count | Frequency (%) |
-2147483648 | 1196 | |
-2147483647 | 286 | < 0.1% |
-2147483646 | 674 | |
-2147483645 | 89 | < 0.1% |
-2147483644 | 411 | < 0.1% |
Value | Count | Frequency (%) |
2147483647 | 5667 | |
2147483646 | 3114 | 0.1% |
2147483645 | 11943 | |
2147483644 | 955 | < 0.1% |
2147483643 | 4328 | 0.1% |
isclearable
Boolean
Return FALSE if it can not be clear with "clear all notifications"
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 46.9 MiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 3453573 | |
True | 2004939 |
isongoing
Boolean
Return if the app continue independent on the notification
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 46.9 MiB |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 3449625 | |
False | 2008887 |
package
Text
The application package that creates the notification
Distinct | 978 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 173.9 MiB |
Length
Max length | 75 |
---|---|
Median length | 67 |
Mean length | 17.39820981 |
Min length | 5 |
Characters and Unicode
Total characters | 94968337 |
---|---|
Distinct characters | 61 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 16 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | android |
---|---|
2nd row | com.android.systemui |
3rd row | com.android.systemui |
4th row | com.android.systemui |
5th row | com.android.systemui |
Value | Count | Frequency (%) |
com.whatsapp | 1840833 | |
android | 555815 | 10.2% |
com.android.systemui | 398745 | 7.3% |
org.telegram.messenger | 348494 | 6.4% |
com.spotify.music | 242610 | 4.4% |
com.android.vending | 219942 | 4.0% |
com.google.android.apps.maps | 172197 | 3.2% |
com.sec.android.app.voicenote | 146901 | 2.7% |
com.google.android.googlequicksearchbox | 121439 | 2.2% |
cc.forestapp | 98749 | 1.8% |
Other values (968) | 1312787 |
Most occurring characters
Value | Count | Frequency (%) |
o | 9984555 | 10.5% |
. | 9309615 | 9.8% |
a | 9004048 | 9.5% |
m | 6654834 | 7.0% |
c | 6280621 | 6.6% |
p | 5809998 | 6.1% |
s | 5644517 | 5.9% |
e | 5387731 | 5.7% |
d | 5198511 | 5.5% |
i | 4802725 | 5.1% |
Other values (51) | 26891182 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 94968337 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 9984555 | 10.5% |
. | 9309615 | 9.8% |
a | 9004048 | 9.5% |
m | 6654834 | 7.0% |
c | 6280621 | 6.6% |
p | 5809998 | 6.1% |
s | 5644517 | 5.9% |
e | 5387731 | 5.7% |
d | 5198511 | 5.5% |
i | 4802725 | 5.1% |
Other values (51) | 26891182 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 94968337 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 9984555 | 10.5% |
. | 9309615 | 9.8% |
a | 9004048 | 9.5% |
m | 6654834 | 7.0% |
c | 6280621 | 6.6% |
p | 5809998 | 6.1% |
s | 5644517 | 5.9% |
e | 5387731 | 5.7% |
d | 5198511 | 5.5% |
i | 4802725 | 5.1% |
Other values (51) | 26891182 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 94968337 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 9984555 | 10.5% |
. | 9309615 | 9.8% |
a | 9004048 | 9.5% |
m | 6654834 | 7.0% |
c | 6280621 | 6.6% |
p | 5809998 | 6.1% |
s | 5644517 | 5.9% |
e | 5387731 | 5.7% |
d | 5198511 | 5.5% |
i | 4802725 | 5.1% |
Other values (51) | 26891182 |
status
Text
The notification status (removed, posted, …)
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 183.2 MiB |
Length
Max length | 20 |
---|---|
Median length | 19 |
Mean length | 19.19548588 |
Min length | 19 |
Characters and Unicode
Total characters | 104778790 |
---|---|
Distinct characters | 15 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | notification_removed |
---|---|
2nd row | notification_posted |
3rd row | notification_posted |
4th row | notification_posted |
5th row | notification_posted |
Value | Count | Frequency (%) |
notification_posted | 4391450 | |
notification_removed | 1067062 | 19.5% |
Most occurring characters
Value | Count | Frequency (%) |
o | 16375536 | |
i | 16375536 | |
t | 15308474 | |
n | 10917024 | |
e | 6525574 | 6.2% |
f | 5458512 | 5.2% |
c | 5458512 | 5.2% |
a | 5458512 | 5.2% |
_ | 5458512 | 5.2% |
d | 5458512 | 5.2% |
Other values (5) | 11984086 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 104778790 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 16375536 | |
i | 16375536 | |
t | 15308474 | |
n | 10917024 | |
e | 6525574 | 6.2% |
f | 5458512 | 5.2% |
c | 5458512 | 5.2% |
a | 5458512 | 5.2% |
_ | 5458512 | 5.2% |
d | 5458512 | 5.2% |
Other values (5) | 11984086 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 104778790 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 16375536 | |
i | 16375536 | |
t | 15308474 | |
n | 10917024 | |
e | 6525574 | 6.2% |
f | 5458512 | 5.2% |
c | 5458512 | 5.2% |
a | 5458512 | 5.2% |
_ | 5458512 | 5.2% |
d | 5458512 | 5.2% |
Other values (5) | 11984086 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 104778790 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 16375536 | |
i | 16375536 | |
t | 15308474 | |
n | 10917024 | |
e | 6525574 | 6.2% |
f | 5458512 | 5.2% |
c | 5458512 | 5.2% |
a | 5458512 | 5.2% |
_ | 5458512 | 5.2% |
d | 5458512 | 5.2% |
Other values (5) | 11984086 |