Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 737416 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 70.4 MiB |
Average record size in memory | 100.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 2 |
DateTime | 1 |
Boolean | 2 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
id | The notification id |
isclearable | Return FALSE if it can not be clear with "clear all notifications" |
isongoing | Return if the app continue independent on the notification |
package | The application package that creates the notification |
status | The notification status (removed, posted, …) |
experimentid has constant value "wenetIndia" | Constant |
isclearable is highly overall correlated with isongoing | High correlation |
isongoing is highly overall correlated with isclearable | High correlation |
userid has 54741 (7.4%) zeros | Zeros |
id has 16229 (2.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-24 09:46:49.852467 |
---|---|
Analysis finished | 2024-11-24 09:46:54.362994 |
Duration | 4.51 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 7374160 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetIndia |
---|---|
2nd row | wenetIndia |
3rd row | wenetIndia |
4th row | wenetIndia |
5th row | wenetIndia |
Value | Count | Frequency (%) |
wenetindia | 737416 |
Most occurring characters
Value | Count | Frequency (%) |
e | 1474832 | |
n | 1474832 | |
w | 737416 | |
t | 737416 | |
I | 737416 | |
d | 737416 | |
i | 737416 | |
a | 737416 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 7374160 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 1474832 | |
n | 1474832 | |
w | 737416 | |
t | 737416 | |
I | 737416 | |
d | 737416 | |
i | 737416 | |
a | 737416 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 7374160 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 1474832 | |
n | 1474832 | |
w | 737416 | |
t | 737416 | |
I | 737416 | |
d | 737416 | |
i | 737416 | |
a | 737416 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 7374160 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 1474832 | |
n | 1474832 | |
w | 737416 | |
t | 737416 | |
I | 737416 | |
d | 737416 | |
i | 737416 | |
a | 737416 |
Distinct | 17 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.21767089 |
Minimum | 0 |
---|---|
Maximum | 62 |
Zeros | 54741 |
Zeros (%) | 7.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.6 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 9 |
median | 9 |
Q3 | 24 |
95-th percentile | 44 |
Maximum | 62 |
Range | 62 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 13.59323055 |
---|---|
Coefficient of variation (CV) | 0.8932530247 |
Kurtosis | 1.825959429 |
Mean | 15.21767089 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.608810224 |
Sum | 11221754 |
Variance | 184.7759169 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=17)
Value | Count | Frequency (%) |
9 | 402124 | |
24 | 81731 | 11.1% |
0 | 54741 | 7.4% |
12 | 43247 | 5.9% |
44 | 33180 | 4.5% |
4 | 30969 | 4.2% |
35 | 30121 | 4.1% |
43 | 17792 | 2.4% |
57 | 14139 | 1.9% |
18 | 6810 | 0.9% |
Other values (7) | 22562 | 3.1% |
Value | Count | Frequency (%) |
0 | 54741 | 7.4% |
4 | 30969 | 4.2% |
8 | 5034 | 0.7% |
9 | 402124 | |
12 | 43247 | 5.9% |
Value | Count | Frequency (%) |
62 | 6091 | 0.8% |
57 | 14139 | |
49 | 5143 | 0.7% |
44 | 33180 | |
43 | 17792 |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 737313 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 MiB |
Minimum | 2021-07-12 08:00:38.004000 |
---|---|
Maximum | 2021-08-12 14:40:00.550000 |
Histogram with fixed size bins (bins=50)
Distinct | 4366 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 93304987.68 |
Minimum | -2147483648 |
---|---|
Maximum | 2147483647 |
Zeros | 16229 |
Zeros (%) | 2.2% |
Negative | 30877 |
Negative (%) | 4.2% |
Memory size | 5.6 MiB |
Quantile statistics
Minimum | -2147483648 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 1 |
Q3 | 8 |
95-th percentile | 1431325696 |
Maximum | 2147483647 |
Range | 4294967295 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 503248140.4 |
---|---|
Coefficient of variation (CV) | 5.393582411 |
Kurtosis | 11.23082293 |
Mean | 93304987.68 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.663194305 |
Sum | 6.880459079 × 1013 |
Variance | 2.532586908 × 1017 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
1 | 481470 | |
26111991 | 72878 | 9.9% |
8 | 22437 | 3.0% |
0 | 16229 | 2.2% |
2131362953 | 13572 | 1.8% |
-56862258 | 13170 | 1.8% |
1000 | 8400 | 1.1% |
14 | 8220 | 1.1% |
11 | 7409 | 1.0% |
2 | 6345 | 0.9% |
Other values (4356) | 87286 | 11.8% |
Value | Count | Frequency (%) |
-2147483648 | 163 | |
-2147483647 | 64 | < 0.1% |
-2147483646 | 6 | < 0.1% |
-2147483645 | 6 | < 0.1% |
-2147483644 | 8 | < 0.1% |
Value | Count | Frequency (%) |
2147483647 | 328 | |
2147483646 | 67 | < 0.1% |
2147483645 | 93 | < 0.1% |
2147483644 | 21 | < 0.1% |
2147483641 | 4 | < 0.1% |
isclearable
Boolean
HIGH CORRELATION
 
Return FALSE if it can not be clear with "clear all notifications"
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 720.3 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 605720 | |
True | 131696 | 17.9% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 720.3 KiB |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 605506 | |
False | 131910 | 17.9% |
package
Text
The application package that creates the notification
Distinct | 260 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.4 MiB |
Length
Max length | 87 |
---|---|
Median length | 26 |
Mean length | 20.96939719 |
Min length | 7 |
Characters and Unicode
Total characters | 15463169 |
---|---|
Distinct characters | 46 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 8 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | android |
---|---|
2nd row | com.oneplus.mms |
3rd row | com.oneplus.mms |
4th row | com.oneplus.mms |
5th row | com.oneplus.mms |
Value | Count | Frequency (%) |
com.nisargjhaveri.netspeed | 373244 | |
com.whatsapp | 112666 | 15.3% |
com.pepkit.ssg | 72898 | 9.9% |
android | 37781 | 5.1% |
com.android.systemui | 26323 | 3.6% |
com.android.vending | 13191 | 1.8% |
org.telegram.messenger | 7353 | 1.0% |
com.google.android.gm | 7285 | 1.0% |
com.fast.free.unblock.secure.vpn | 6459 | 0.9% |
com.snapchat.android | 6185 | 0.8% |
Other values (250) | 74031 | 10.0% |
Most occurring characters
Value | Count | Frequency (%) |
e | 1773301 | 11.5% |
. | 1350718 | 8.7% |
a | 1192249 | 7.7% |
s | 1181135 | 7.6% |
i | 1053667 | 6.8% |
r | 965557 | 6.2% |
o | 960823 | 6.2% |
n | 945952 | 6.1% |
p | 810580 | 5.2% |
m | 791266 | 5.1% |
Other values (36) | 4437921 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 15463169 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 1773301 | 11.5% |
. | 1350718 | 8.7% |
a | 1192249 | 7.7% |
s | 1181135 | 7.6% |
i | 1053667 | 6.8% |
r | 965557 | 6.2% |
o | 960823 | 6.2% |
n | 945952 | 6.1% |
p | 810580 | 5.2% |
m | 791266 | 5.1% |
Other values (36) | 4437921 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 15463169 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 1773301 | 11.5% |
. | 1350718 | 8.7% |
a | 1192249 | 7.7% |
s | 1181135 | 7.6% |
i | 1053667 | 6.8% |
r | 965557 | 6.2% |
o | 960823 | 6.2% |
n | 945952 | 6.1% |
p | 810580 | 5.2% |
m | 791266 | 5.1% |
Other values (36) | 4437921 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 15463169 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 1773301 | 11.5% |
. | 1350718 | 8.7% |
a | 1192249 | 7.7% |
s | 1181135 | 7.6% |
i | 1053667 | 6.8% |
r | 965557 | 6.2% |
o | 960823 | 6.2% |
n | 945952 | 6.1% |
p | 810580 | 5.2% |
m | 791266 | 5.1% |
Other values (36) | 4437921 |
status
Text
The notification status (removed, posted, …)
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 19.0 MiB |
Length
Max length | 20 |
---|---|
Median length | 19 |
Mean length | 19.08742555 |
Min length | 19 |
Characters and Unicode
Total characters | 14075373 |
---|---|
Distinct characters | 15 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | notification_posted |
---|---|
2nd row | notification_posted |
3rd row | notification_removed |
4th row | notification_posted |
5th row | notification_removed |
Value | Count | Frequency (%) |
notification_posted | 672947 | |
notification_removed | 64469 | 8.7% |
Most occurring characters
Value | Count | Frequency (%) |
o | 2212248 | |
i | 2212248 | |
t | 2147779 | |
n | 1474832 | |
e | 801885 | 5.7% |
f | 737416 | 5.2% |
c | 737416 | 5.2% |
a | 737416 | 5.2% |
_ | 737416 | 5.2% |
d | 737416 | 5.2% |
Other values (5) | 1539301 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 14075373 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 2212248 | |
i | 2212248 | |
t | 2147779 | |
n | 1474832 | |
e | 801885 | 5.7% |
f | 737416 | 5.2% |
c | 737416 | 5.2% |
a | 737416 | 5.2% |
_ | 737416 | 5.2% |
d | 737416 | 5.2% |
Other values (5) | 1539301 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 14075373 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 2212248 | |
i | 2212248 | |
t | 2147779 | |
n | 1474832 | |
e | 801885 | 5.7% |
f | 737416 | 5.2% |
c | 737416 | 5.2% |
a | 737416 | 5.2% |
_ | 737416 | 5.2% |
d | 737416 | 5.2% |
Other values (5) | 1539301 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 14075373 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 2212248 | |
i | 2212248 | |
t | 2147779 | |
n | 1474832 | |
e | 801885 | 5.7% |
f | 737416 | 5.2% |
c | 737416 | 5.2% |
a | 737416 | 5.2% |
_ | 737416 | 5.2% |
d | 737416 | 5.2% |
Other values (5) | 1539301 |
id | isclearable | isongoing | userid | |
---|---|---|---|---|
id | 1.000 | 0.330 | 0.329 | 0.298 |
isclearable | 0.330 | 1.000 | 0.999 | 0.165 |
isongoing | 0.329 | 0.999 | 1.000 | -0.165 |
userid | 0.298 | 0.165 | -0.165 | 1.000 |