Overview

Dataset statistics

Number of variables4
Number of observations21999540
Missing cells0
Missing cells (%)0.0%
Total size in memory1.0 GiB
Average record size in memory50.0 B

Variable types

Text1
Numeric2
DateTime1

Dataset

Description[hPa or mbar] Ambient air pressure. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
valueThe pressure value (hPa, milibar)

Alerts

experimentid has constant value "wenetIndia"Constant
userid is highly overall correlated with valueHigh correlation
value is highly overall correlated with useridHigh correlation

Reproduction

Analysis started2024-11-22 13:03:19.972466
Analysis finished2024-11-22 13:04:37.032851
Duration1 minute and 17.06 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size545.5 MiB
2024-11-22T14:04:37.085654image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters219995400
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 21999540
100.0%
2024-11-22T14:04:37.261765image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 43999080
20.0%
n 43999080
20.0%
w 21999540
10.0%
t 21999540
10.0%
I 21999540
10.0%
d 21999540
10.0%
i 21999540
10.0%
a 21999540
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 219995400
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 43999080
20.0%
n 43999080
20.0%
w 21999540
10.0%
t 21999540
10.0%
I 21999540
10.0%
d 21999540
10.0%
i 21999540
10.0%
a 21999540
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 219995400
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 43999080
20.0%
n 43999080
20.0%
w 21999540
10.0%
t 21999540
10.0%
I 21999540
10.0%
d 21999540
10.0%
i 21999540
10.0%
a 21999540
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 219995400
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 43999080
20.0%
n 43999080
20.0%
w 21999540
10.0%
t 21999540
10.0%
I 21999540
10.0%
d 21999540
10.0%
i 21999540
10.0%
a 21999540
10.0%

userid
Real number (ℝ)

HIGH CORRELATION 

User id

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.71664067
Minimum12
Maximum62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size335.7 MiB
2024-11-22T14:04:37.357545image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile12
Q112
median12
Q312
95-th percentile62
Maximum62
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation17.34047071
Coefficient of variation (CV)0.8794840359
Kurtosis1.300531133
Mean19.71664067
Median Absolute Deviation (MAD)0
Skewness1.810407899
Sum433757025
Variance300.6919243
MonotonicityIncreasing
2024-11-22T14:04:37.443426image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
ValueCountFrequency (%)
12 18354046
83.4%
57 2502431
 
11.4%
62 1143063
 
5.2%
ValueCountFrequency (%)
12 18354046
83.4%
57 2502431
 
11.4%
62 1143063
 
5.2%
ValueCountFrequency (%)
62 1143063
 
5.2%
57 2502431
 
11.4%
12 18354046
83.4%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct21990934
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size335.7 MiB
Minimum2021-07-12 08:00:00.006000
Maximum2021-08-12 07:38:57.889000
2024-11-22T14:04:37.554878image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T14:04:37.679013image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

value
Real number (ℝ)

HIGH CORRELATION 

The pressure value (hPa, milibar)

Distinct1661
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean968.3728521
Minimum957.6699829
Maximum1011.669983
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size335.7 MiB
2024-11-22T14:04:37.797214image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum957.6699829
5-th percentile960.2199707
Q1962.0100098
median963.75
Q3965.2800293
95-th percentile1008.450012
Maximum1011.669983
Range54
Interquartile range (IQR)3.270019531

Descriptive statistics

Standard deviation14.39819881
Coefficient of variation (CV)0.01486844533
Kurtosis3.750476717
Mean968.3728521
Median Absolute Deviation (MAD)1.619995117
Skewness2.366307245
Sum2.130375729 × 1010
Variance207.308129
MonotonicityNot monotonic
2024-11-22T14:04:37.919239image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
963.8499756 58885
 
0.3%
963.8400269 58114
 
0.3%
963.8599854 54738
 
0.2%
964.210022 54149
 
0.2%
963.9799805 51923
 
0.2%
963.8300171 51724
 
0.2%
963.0499878 50785
 
0.2%
963.9699707 50573
 
0.2%
964.2000122 50367
 
0.2%
963.9899902 50163
 
0.2%
Other values (1651) 21468119
97.6%
ValueCountFrequency (%)
957.6699829 1
 
< 0.1%
957.7199707 1
 
< 0.1%
957.7999878 2
 
< 0.1%
957.8099976 43
< 0.1%
957.8200073 80
< 0.1%
ValueCountFrequency (%)
1011.669983 8
 
< 0.1%
1011.659973 70
 
< 0.1%
1011.650024 261
< 0.1%
1011.640015 470
< 0.1%
1011.630005 642
< 0.1%

Correlations

2024-11-22T14:04:37.991189image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
useridvalue
userid1.0000.560
value0.5601.000