Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 66832332 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 3.1 GiB |
Average record size in memory | 50.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
value | The distance value (cm, centimeter) |
experimentid has constant value "wenetItaly" | Constant |
userid has 1061489 (1.6%) zeros | Zeros |
value has 15417075 (23.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-23 13:20:58.536871 |
---|---|
Analysis finished | 2024-11-23 13:25:06.685386 |
Duration | 4 minutes and 8.15 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 GiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 668323320 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetItaly |
---|---|
2nd row | wenetItaly |
3rd row | wenetItaly |
4th row | wenetItaly |
5th row | wenetItaly |
Value | Count | Frequency (%) |
wenetitaly | 66832332 |
Most occurring characters
Value | Count | Frequency (%) |
e | 133664664 | |
t | 133664664 | |
w | 66832332 | |
n | 66832332 | |
I | 66832332 | |
a | 66832332 | |
l | 66832332 | |
y | 66832332 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 668323320 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 133664664 | |
t | 133664664 | |
w | 66832332 | |
n | 66832332 | |
I | 66832332 | |
a | 66832332 | |
l | 66832332 | |
y | 66832332 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 668323320 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 133664664 | |
t | 133664664 | |
w | 66832332 | |
n | 66832332 | |
I | 66832332 | |
a | 66832332 | |
l | 66832332 | |
y | 66832332 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 668323320 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 133664664 | |
t | 133664664 | |
w | 66832332 | |
n | 66832332 | |
I | 66832332 | |
a | 66832332 | |
l | 66832332 | |
y | 66832332 |
Distinct | 218 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 108.26394 |
Minimum | 0 |
---|---|
Maximum | 265 |
Zeros | 1061489 |
Zeros (%) | 1.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.8 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 24 |
Q1 | 91 |
median | 91 |
Q3 | 134 |
95-th percentile | 134 |
Maximum | 265 |
Range | 265 |
Interquartile range (IQR) | 43 |
Descriptive statistics
Standard deviation | 39.43345349 |
---|---|
Coefficient of variation (CV) | 0.3642344209 |
Kurtosis | 2.385967798 |
Mean | 108.26394 |
Median Absolute Deviation (MAD) | 43 |
Skewness | -0.1075521461 |
Sum | 7235531583 |
Variance | 1554.997254 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
134 | 27519528 | |
91 | 27340397 | |
3 | 1798246 | 2.7% |
62 | 1168825 | 1.7% |
82 | 1081143 | 1.6% |
0 | 1061489 | 1.6% |
77 | 827904 | 1.2% |
99 | 478253 | 0.7% |
195 | 452376 | 0.7% |
203 | 452196 | 0.7% |
Other values (208) | 4651975 | 7.0% |
Value | Count | Frequency (%) |
0 | 1061489 | |
1 | 11632 | < 0.1% |
2 | 7526 | < 0.1% |
3 | 1798246 | |
4 | 59074 | 0.1% |
Value | Count | Frequency (%) |
265 | 2700 | < 0.1% |
264 | 324 | < 0.1% |
263 | 15684 | |
262 | 4467 | < 0.1% |
260 | 1209 | < 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 66031836 |
---|---|
Distinct (%) | 98.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1019.8 MiB |
Minimum | 2020-11-16 07:00:00.074000 |
---|---|
Maximum | 2020-12-11 21:59:59.919000 |
Histogram with fixed size bins (bins=50)
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.859825391 |
Minimum | 0 |
---|---|
Maximum | 10 |
Zeros | 15417075 |
Zeros (%) | 23.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1019.8 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5 |
median | 5 |
Q3 | 5 |
95-th percentile | 5 |
Maximum | 10 |
Range | 10 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.125706635 |
---|---|
Coefficient of variation (CV) | 0.5507261131 |
Kurtosis | -0.3437705502 |
Mean | 3.859825391 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -1.224299873 |
Sum | 257961132 |
Variance | 4.518628698 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
Value | Count | Frequency (%) |
5 | 51089286 | |
0 | 15417075 | 23.1% |
8 | 276411 | 0.4% |
1 | 20118 | < 0.1% |
10 | 18318 | < 0.1% |
9 | 11124 | < 0.1% |
Value | Count | Frequency (%) |
0 | 15417075 | 23.1% |
1 | 20118 | < 0.1% |
5 | 51089286 | |
8 | 276411 | 0.4% |
9 | 11124 | < 0.1% |
Value | Count | Frequency (%) |
10 | 18318 | < 0.1% |
9 | 11124 | < 0.1% |
8 | 276411 | 0.4% |
5 | 51089286 | |
1 | 20118 | < 0.1% |
userid | value | |
---|---|---|
userid | 1.000 | -0.102 |
value | -0.102 | 1.000 |