Dataset statistics
| Number of variables | 4 |
|---|---|
| Number of observations | 100771 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Total size in memory | 4.0 MiB |
| Average record size in memory | 42.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 2 |
| DateTime | 1 |
Dataset
Variable descriptions
| experimentid | Experiment Id |
|---|---|
| userid | User id |
| timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
| value | The number of steps |
experimentid has constant value "wenetIndia" | Constant |
userid has 7063 (7.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-22 12:41:59.258151 |
|---|---|
| Analysis finished | 2024-11-22 12:41:59.740087 |
| Duration | 0.48 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1007710 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | wenetIndia |
|---|---|
| 2nd row | wenetIndia |
| 3rd row | wenetIndia |
| 4th row | wenetIndia |
| 5th row | wenetIndia |
| Value | Count | Frequency (%) |
| wenetindia | 100771 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 201542 | |
| n | 201542 | |
| w | 100771 | |
| t | 100771 | |
| I | 100771 | |
| d | 100771 | |
| i | 100771 | |
| a | 100771 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1007710 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 201542 | |
| n | 201542 | |
| w | 100771 | |
| t | 100771 | |
| I | 100771 | |
| d | 100771 | |
| i | 100771 | |
| a | 100771 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1007710 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 201542 | |
| n | 201542 | |
| w | 100771 | |
| t | 100771 | |
| I | 100771 | |
| d | 100771 | |
| i | 100771 | |
| a | 100771 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1007710 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 201542 | |
| n | 201542 | |
| w | 100771 | |
| t | 100771 | |
| I | 100771 | |
| d | 100771 | |
| i | 100771 | |
| a | 100771 |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.88025325 |
| Minimum | 0 |
|---|---|
| Maximum | 62 |
| Zeros | 7063 |
| Zeros (%) | 7.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 787.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9 |
| median | 24 |
| Q3 | 24 |
| 95-th percentile | 57 |
| Maximum | 62 |
| Range | 62 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 15.12194683 |
|---|---|
| Coefficient of variation (CV) | 0.7242223861 |
| Kurtosis | 0.535535902 |
| Mean | 20.88025325 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.9231199679 |
| Sum | 2104124 |
| Variance | 228.6732759 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=18)
| Value | Count | Frequency (%) |
| 24 | 27765 | |
| 12 | 22433 | |
| 35 | 11085 | 11.0% |
| 9 | 10353 | 10.3% |
| 4 | 7671 | 7.6% |
| 0 | 7063 | 7.0% |
| 43 | 6224 | 6.2% |
| 62 | 4555 | 4.5% |
| 18 | 1109 | 1.1% |
| 17 | 741 | 0.7% |
| Other values (8) | 1772 | 1.8% |
| Value | Count | Frequency (%) |
| 0 | 7063 | 7.0% |
| 4 | 7671 | 7.6% |
| 8 | 285 | 0.3% |
| 9 | 10353 | |
| 12 | 22433 |
| Value | Count | Frequency (%) |
| 62 | 4555 | |
| 57 | 680 | 0.7% |
| 46 | 120 | 0.1% |
| 44 | 493 | 0.5% |
| 43 | 6224 |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
| Distinct | 100768 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 787.4 KiB |
| Minimum | 2021-07-12 08:25:21.586000 |
|---|---|
| Maximum | 2021-08-12 14:33:39.816000 |
Histogram with fixed size bins (bins=50)
value
Real number (ℝ)
The number of steps
| Distinct | 62123 |
|---|---|
| Distinct (%) | 61.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49248.76863 |
| Minimum | 0 |
|---|---|
| Maximum | 163163 |
| Zeros | 62 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 787.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 520 |
| Q1 | 4026.5 |
| median | 23097 |
| Q3 | 124017 |
| 95-th percentile | 154789 |
| Maximum | 163163 |
| Range | 163163 |
| Interquartile range (IQR) | 119990.5 |
Descriptive statistics
| Standard deviation | 58111.73917 |
|---|---|
| Coefficient of variation (CV) | 1.179963292 |
| Kurtosis | -0.9273972012 |
| Mean | 49248.76863 |
| Median Absolute Deviation (MAD) | 20205 |
| Skewness | 0.9260876193 |
| Sum | 4962847664 |
| Variance | 3376974230 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 62 | 0.1% |
| 99 | 20 | < 0.1% |
| 150 | 19 | < 0.1% |
| 134 | 18 | < 0.1% |
| 44 | 17 | < 0.1% |
| 45 | 16 | < 0.1% |
| 97 | 16 | < 0.1% |
| 94 | 16 | < 0.1% |
| 625 | 16 | < 0.1% |
| 78 | 16 | < 0.1% |
| Other values (62113) | 100555 |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 1 | 6 | < 0.1% |
| 2 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 163163 | 1 | |
| 163162 | 1 | |
| 163160 | 1 | |
| 163158 | 1 | |
| 163151 | 1 |
| userid | value | |
|---|---|---|
| userid | 1.000 | 0.293 |
| value | 0.293 | 1.000 |