Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 126671 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 4.1 MiB |
Average record size in memory | 34.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 1 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
experimentid has constant value "wenetIndia" | Constant |
userid has 7116 (5.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-22 12:32:58.168627 |
---|---|
Analysis finished | 2024-11-22 12:32:58.568889 |
Duration | 0.4 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 1266710 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetIndia |
---|---|
2nd row | wenetIndia |
3rd row | wenetIndia |
4th row | wenetIndia |
5th row | wenetIndia |
Value | Count | Frequency (%) |
wenetindia | 126671 |
Most occurring characters
Value | Count | Frequency (%) |
e | 253342 | |
n | 253342 | |
w | 126671 | |
t | 126671 | |
I | 126671 | |
d | 126671 | |
i | 126671 | |
a | 126671 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1266710 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 253342 | |
n | 253342 | |
w | 126671 | |
t | 126671 | |
I | 126671 | |
d | 126671 | |
i | 126671 | |
a | 126671 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1266710 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 253342 | |
n | 253342 | |
w | 126671 | |
t | 126671 | |
I | 126671 | |
d | 126671 | |
i | 126671 | |
a | 126671 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1266710 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 253342 | |
n | 253342 | |
w | 126671 | |
t | 126671 | |
I | 126671 | |
d | 126671 | |
i | 126671 | |
a | 126671 |
Distinct | 18 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.49725667 |
Minimum | 0 |
---|---|
Maximum | 62 |
Zeros | 7116 |
Zeros (%) | 5.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 989.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 9 |
median | 12 |
Q3 | 24 |
95-th percentile | 44 |
Maximum | 62 |
Range | 62 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 14.7360645 |
---|---|
Coefficient of variation (CV) | 0.7558019444 |
Kurtosis | 0.5666364183 |
Mean | 19.49725667 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 1.02518449 |
Sum | 2469737 |
Variance | 217.151597 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=18)
Value | Count | Frequency (%) |
24 | 29748 | |
12 | 23574 | |
9 | 14447 | |
8 | 12970 | |
35 | 11081 | 8.7% |
4 | 9314 | 7.4% |
0 | 7116 | 5.6% |
43 | 6214 | 4.9% |
44 | 4257 | 3.4% |
62 | 4246 | 3.4% |
Other values (8) | 3704 | 2.9% |
Value | Count | Frequency (%) |
0 | 7116 | 5.6% |
4 | 9314 | 7.4% |
8 | 12970 | |
9 | 14447 | |
12 | 23574 |
Value | Count | Frequency (%) |
62 | 4246 | |
57 | 640 | 0.5% |
46 | 92 | 0.1% |
44 | 4257 | |
43 | 6214 |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 126667 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 989.7 KiB |
Minimum | 2021-07-12 08:27:22.454000 |
---|---|
Maximum | 2021-08-12 14:39:34.739000 |
Histogram with fixed size bins (bins=50)