Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 100771 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 4.0 MiB |
Average record size in memory | 42.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
value | The number of steps |
experimentid has constant value "wenetIndia" | Constant |
userid has 7063 (7.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-22 12:41:59.258151 |
---|---|
Analysis finished | 2024-11-22 12:41:59.740087 |
Duration | 0.48 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 1007710 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetIndia |
---|---|
2nd row | wenetIndia |
3rd row | wenetIndia |
4th row | wenetIndia |
5th row | wenetIndia |
Value | Count | Frequency (%) |
wenetindia | 100771 |
Most occurring characters
Value | Count | Frequency (%) |
e | 201542 | |
n | 201542 | |
w | 100771 | |
t | 100771 | |
I | 100771 | |
d | 100771 | |
i | 100771 | |
a | 100771 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1007710 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 201542 | |
n | 201542 | |
w | 100771 | |
t | 100771 | |
I | 100771 | |
d | 100771 | |
i | 100771 | |
a | 100771 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1007710 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 201542 | |
n | 201542 | |
w | 100771 | |
t | 100771 | |
I | 100771 | |
d | 100771 | |
i | 100771 | |
a | 100771 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1007710 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 201542 | |
n | 201542 | |
w | 100771 | |
t | 100771 | |
I | 100771 | |
d | 100771 | |
i | 100771 | |
a | 100771 |
Distinct | 18 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.88025325 |
Minimum | 0 |
---|---|
Maximum | 62 |
Zeros | 7063 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 787.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 9 |
median | 24 |
Q3 | 24 |
95-th percentile | 57 |
Maximum | 62 |
Range | 62 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 15.12194683 |
---|---|
Coefficient of variation (CV) | 0.7242223861 |
Kurtosis | 0.535535902 |
Mean | 20.88025325 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 0.9231199679 |
Sum | 2104124 |
Variance | 228.6732759 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=18)
Value | Count | Frequency (%) |
24 | 27765 | |
12 | 22433 | |
35 | 11085 | 11.0% |
9 | 10353 | 10.3% |
4 | 7671 | 7.6% |
0 | 7063 | 7.0% |
43 | 6224 | 6.2% |
62 | 4555 | 4.5% |
18 | 1109 | 1.1% |
17 | 741 | 0.7% |
Other values (8) | 1772 | 1.8% |
Value | Count | Frequency (%) |
0 | 7063 | 7.0% |
4 | 7671 | 7.6% |
8 | 285 | 0.3% |
9 | 10353 | |
12 | 22433 |
Value | Count | Frequency (%) |
62 | 4555 | |
57 | 680 | 0.7% |
46 | 120 | 0.1% |
44 | 493 | 0.5% |
43 | 6224 |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 100768 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 787.4 KiB |
Minimum | 2021-07-12 08:25:21.586000 |
---|---|
Maximum | 2021-08-12 14:33:39.816000 |
Histogram with fixed size bins (bins=50)
value
Real number (ℝ)
The number of steps
Distinct | 62123 |
---|---|
Distinct (%) | 61.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49248.76863 |
Minimum | 0 |
---|---|
Maximum | 163163 |
Zeros | 62 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 787.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 520 |
Q1 | 4026.5 |
median | 23097 |
Q3 | 124017 |
95-th percentile | 154789 |
Maximum | 163163 |
Range | 163163 |
Interquartile range (IQR) | 119990.5 |
Descriptive statistics
Standard deviation | 58111.73917 |
---|---|
Coefficient of variation (CV) | 1.179963292 |
Kurtosis | -0.9273972012 |
Mean | 49248.76863 |
Median Absolute Deviation (MAD) | 20205 |
Skewness | 0.9260876193 |
Sum | 4962847664 |
Variance | 3376974230 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0 | 62 | 0.1% |
99 | 20 | < 0.1% |
150 | 19 | < 0.1% |
134 | 18 | < 0.1% |
44 | 17 | < 0.1% |
45 | 16 | < 0.1% |
97 | 16 | < 0.1% |
94 | 16 | < 0.1% |
625 | 16 | < 0.1% |
78 | 16 | < 0.1% |
Other values (62113) | 100555 |
Value | Count | Frequency (%) |
0 | 62 | |
1 | 6 | < 0.1% |
2 | 1 | < 0.1% |
5 | 2 | < 0.1% |
6 | 6 | < 0.1% |
Value | Count | Frequency (%) |
163163 | 1 | |
163162 | 1 | |
163160 | 1 | |
163158 | 1 | |
163151 | 1 |
userid | value | |
---|---|---|
userid | 1.000 | 0.293 |
value | 0.293 | 1.000 |