Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 21999540 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 1.0 GiB |
Average record size in memory | 50.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
value | The pressure value (hPa, milibar) |
experimentid has constant value "wenetIndia" | Constant |
userid is highly overall correlated with value | High correlation |
value is highly overall correlated with userid | High correlation |
Reproduction
Analysis started | 2024-11-22 13:03:19.972466 |
---|---|
Analysis finished | 2024-11-22 13:04:37.032851 |
Duration | 1 minute and 17.06 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 545.5 MiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 219995400 |
---|---|
Distinct characters | 8 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetIndia |
---|---|
2nd row | wenetIndia |
3rd row | wenetIndia |
4th row | wenetIndia |
5th row | wenetIndia |
Value | Count | Frequency (%) |
wenetindia | 21999540 |
Most occurring characters
Value | Count | Frequency (%) |
e | 43999080 | |
n | 43999080 | |
w | 21999540 | |
t | 21999540 | |
I | 21999540 | |
d | 21999540 | |
i | 21999540 | |
a | 21999540 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 219995400 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 43999080 | |
n | 43999080 | |
w | 21999540 | |
t | 21999540 | |
I | 21999540 | |
d | 21999540 | |
i | 21999540 | |
a | 21999540 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 219995400 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 43999080 | |
n | 43999080 | |
w | 21999540 | |
t | 21999540 | |
I | 21999540 | |
d | 21999540 | |
i | 21999540 | |
a | 21999540 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 219995400 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 43999080 | |
n | 43999080 | |
w | 21999540 | |
t | 21999540 | |
I | 21999540 | |
d | 21999540 | |
i | 21999540 | |
a | 21999540 |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19.71664067 |
Minimum | 12 |
---|---|
Maximum | 62 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 335.7 MiB |
Quantile statistics
Minimum | 12 |
---|---|
5-th percentile | 12 |
Q1 | 12 |
median | 12 |
Q3 | 12 |
95-th percentile | 62 |
Maximum | 62 |
Range | 50 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 17.34047071 |
---|---|
Coefficient of variation (CV) | 0.8794840359 |
Kurtosis | 1.300531133 |
Mean | 19.71664067 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.810407899 |
Sum | 433757025 |
Variance | 300.6919243 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=3)
Value | Count | Frequency (%) |
12 | 18354046 | |
57 | 2502431 | 11.4% |
62 | 1143063 | 5.2% |
Value | Count | Frequency (%) |
12 | 18354046 | |
57 | 2502431 | 11.4% |
62 | 1143063 | 5.2% |
Value | Count | Frequency (%) |
62 | 1143063 | 5.2% |
57 | 2502431 | 11.4% |
12 | 18354046 |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 21990934 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 335.7 MiB |
Minimum | 2021-07-12 08:00:00.006000 |
---|---|
Maximum | 2021-08-12 07:38:57.889000 |
Histogram with fixed size bins (bins=50)
Distinct | 1661 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 968.3728521 |
Minimum | 957.6699829 |
---|---|
Maximum | 1011.669983 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 335.7 MiB |
Quantile statistics
Minimum | 957.6699829 |
---|---|
5-th percentile | 960.2199707 |
Q1 | 962.0100098 |
median | 963.75 |
Q3 | 965.2800293 |
95-th percentile | 1008.450012 |
Maximum | 1011.669983 |
Range | 54 |
Interquartile range (IQR) | 3.270019531 |
Descriptive statistics
Standard deviation | 14.39819881 |
---|---|
Coefficient of variation (CV) | 0.01486844533 |
Kurtosis | 3.750476717 |
Mean | 968.3728521 |
Median Absolute Deviation (MAD) | 1.619995117 |
Skewness | 2.366307245 |
Sum | 2.130375729 × 1010 |
Variance | 207.308129 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
963.8499756 | 58885 | 0.3% |
963.8400269 | 58114 | 0.3% |
963.8599854 | 54738 | 0.2% |
964.210022 | 54149 | 0.2% |
963.9799805 | 51923 | 0.2% |
963.8300171 | 51724 | 0.2% |
963.0499878 | 50785 | 0.2% |
963.9699707 | 50573 | 0.2% |
964.2000122 | 50367 | 0.2% |
963.9899902 | 50163 | 0.2% |
Other values (1651) | 21468119 |
Value | Count | Frequency (%) |
957.6699829 | 1 | < 0.1% |
957.7199707 | 1 | < 0.1% |
957.7999878 | 2 | < 0.1% |
957.8099976 | 43 | |
957.8200073 | 80 |
Value | Count | Frequency (%) |
1011.669983 | 8 | < 0.1% |
1011.659973 | 70 | < 0.1% |
1011.650024 | 261 | |
1011.640015 | 470 | |
1011.630005 | 642 |
userid | value | |
---|---|---|
userid | 1.000 | 0.560 |
value | 0.560 | 1.000 |