Overview

Dataset statistics

Number of variables6
Number of observations94837686
Missing cells0
Missing cells (%)0.0%
Total size in memory5.8 GiB
Average record size in memory66.0 B

Variable types

Text1
Numeric4
DateTime1

Dataset

Description[μT] Geomagnetic field strength along the x,y,z axis. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
x x axis at the time of capture
y y axis at the time of capture
z z axis at the time of capture

Alerts

experimentid has constant value "wenetIndia"Constant
userid has 11587955 (12.2%) zerosZeros

Reproduction

Analysis started2024-11-22 13:06:35.768978
Analysis finished2024-11-22 13:15:19.303469
Duration8 minutes and 43.53 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 GiB
2024-11-22T14:15:19.358810image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters948376860
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 94837686
100.0%
2024-11-22T14:15:19.532387image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 189675372
20.0%
n 189675372
20.0%
w 94837686
10.0%
t 94837686
10.0%
I 94837686
10.0%
d 94837686
10.0%
i 94837686
10.0%
a 94837686
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 948376860
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 189675372
20.0%
n 189675372
20.0%
w 94837686
10.0%
t 94837686
10.0%
I 94837686
10.0%
d 94837686
10.0%
i 94837686
10.0%
a 94837686
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 948376860
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 189675372
20.0%
n 189675372
20.0%
w 94837686
10.0%
t 94837686
10.0%
I 94837686
10.0%
d 94837686
10.0%
i 94837686
10.0%
a 94837686
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 948376860
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 189675372
20.0%
n 189675372
20.0%
w 94837686
10.0%
t 94837686
10.0%
I 94837686
10.0%
d 94837686
10.0%
i 94837686
10.0%
a 94837686
10.0%

userid
Real number (ℝ)

ZEROS 

User id

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.7266063
Minimum0
Maximum62
Zeros11587955
Zeros (%)12.2%
Negative0
Negative (%)0.0%
Memory size1.4 GiB
2024-11-22T14:15:19.632943image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median26
Q344
95-th percentile44
Maximum62
Range62
Interquartile range (IQR)35

Descriptive statistics

Standard deviation18.29618971
Coefficient of variation (CV)0.6845683851
Kurtosis-1.605318019
Mean26.7266063
Median Absolute Deviation (MAD)18
Skewness-0.1177914904
Sum2534689496
Variance334.750558
MonotonicityIncreasing
2024-11-22T14:15:19.724342image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
44 34018559
35.9%
0 11587955
 
12.2%
12 9444784
 
10.0%
9 8549947
 
9.0%
43 7261579
 
7.7%
17 6520827
 
6.9%
8 5724453
 
6.0%
35 2613525
 
2.8%
57 2239587
 
2.4%
4 1920599
 
2.0%
Other values (8) 4955871
 
5.2%
ValueCountFrequency (%)
0 11587955
12.2%
4 1920599
 
2.0%
8 5724453
6.0%
9 8549947
9.0%
12 9444784
10.0%
ValueCountFrequency (%)
62 580384
 
0.6%
57 2239587
 
2.4%
46 42471
 
< 0.1%
44 34018559
35.9%
43 7261579
 
7.7%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct93356943
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.4 GiB
Minimum2021-07-12 08:00:00.032000
Maximum2021-08-12 14:41:48.780000
2024-11-22T14:15:19.833605image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T14:15:19.960248image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

x
Real number (ℝ)

x axis at the time of capture

Distinct102386
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-6.11137384
Minimum-3394.120117
Maximum1951.920044
Zeros48654
Zeros (%)0.1%
Negative60895776
Negative (%)64.2%
Memory size1.4 GiB
2024-11-22T14:15:20.076998image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-3394.120117
5-th percentile-37.91999817
Q1-26.70000076
median-11.22000027
Q39.109999657
95-th percentile37.84999847
Maximum1951.920044
Range5346.040161
Interquartile range (IQR)35.81000042

Descriptive statistics

Standard deviation49.13554337
Coefficient of variation (CV)-8.040015986
Kurtosis235.5099224
Mean-6.11137384
Median Absolute Deviation (MAD)16.67999935
Skewness0.3334525536
Sum-579588553.3
Variance2414.301622
MonotonicityNot monotonic
2024-11-22T14:15:20.193498image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-28.20000076 196793
 
0.2%
-28.01000023 182611
 
0.2%
-31.97999954 177428
 
0.2%
-27.95000076 177426
 
0.2%
-28.79999924 176076
 
0.2%
-28.5 175031
 
0.2%
-28.25 174949
 
0.2%
-29.14999962 172605
 
0.2%
-31.68000031 165211
 
0.2%
-28.54999924 164968
 
0.2%
Other values (102376) 93074588
98.1%
ValueCountFrequency (%)
-3394.120117 33
< 0.1%
-3050.360107 1
 
< 0.1%
-2415.550049 1
 
< 0.1%
-2405.080078 1
 
< 0.1%
-2369.189941 1
 
< 0.1%
ValueCountFrequency (%)
1951.920044 1
 
< 0.1%
1951.800049 17
< 0.1%
1951.319946 1
 
< 0.1%
1950.300049 3
 
< 0.1%
1950.140015 1
 
< 0.1%

y
Real number (ℝ)

y axis at the time of capture

Distinct98862
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.813020117
Minimum-2240
Maximum2244.97998
Zeros38164
Zeros (%)< 0.1%
Negative37559920
Negative (%)39.6%
Memory size1.4 GiB
2024-11-22T14:15:20.309366image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-2240
5-th percentile-42.27000046
Q1-16.44000053
median8.279999733
Q324.35000038
95-th percentile37.86000061
Maximum2244.97998
Range4484.97998
Interquartile range (IQR)40.79000092

Descriptive statistics

Standard deviation48.69713256
Coefficient of variation (CV)17.31133463
Kurtosis275.009964
Mean2.813020117
Median Absolute Deviation (MAD)19.36999989
Skewness-2.327533306
Sum266780318.6
Variance2371.410719
MonotonicityNot monotonic
2024-11-22T14:15:20.431014image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.449999809 179170
 
0.2%
8.579999924 169381
 
0.2%
8.75 159927
 
0.2%
8.880000114 157270
 
0.2%
8.149999619 147445
 
0.2%
15.64999962 138682
 
0.1%
15.94999981 138608
 
0.1%
15.35000038 131537
 
0.1%
8.279999733 129316
 
0.1%
15.77999973 129037
 
0.1%
Other values (98852) 93357313
98.4%
ValueCountFrequency (%)
-2240 1
< 0.1%
-2239.26001 1
< 0.1%
-2239.199951 1
< 0.1%
-2239.179932 1
< 0.1%
-2239.030029 1
< 0.1%
ValueCountFrequency (%)
2244.97998 1
< 0.1%
2244.949951 1
< 0.1%
2244.899902 1
< 0.1%
2243.939941 1
< 0.1%
2242.129883 1
< 0.1%

z
Real number (ℝ)

z axis at the time of capture

Distinct127722
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-10.47684862
Minimum-3405.530029
Maximum2915.98999
Zeros64470
Zeros (%)0.1%
Negative60182100
Negative (%)63.5%
Memory size1.4 GiB
2024-11-22T14:15:20.539207image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-3405.530029
5-th percentile-36.95000076
Q1-21.12999916
median-6.840000153
Q35.039999962
95-th percentile26.51000023
Maximum2915.98999
Range6321.52002
Interquartile range (IQR)26.16999912

Descriptive statistics

Standard deviation66.93694086
Coefficient of variation (CV)-6.389033889
Kurtosis609.6784961
Mean-10.47684862
Median Absolute Deviation (MAD)12.78000021
Skewness-13.03485161
Sum-993600079.4
Variance4480.554052
MonotonicityNot monotonic
2024-11-22T14:15:20.651091image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.909999847 313482
 
0.3%
4.739999771 269406
 
0.3%
4.800000191 261249
 
0.3%
5.210000038 253130
 
0.3%
5.099999905 242923
 
0.3%
4.440000057 231325
 
0.2%
5.400000095 218000
 
0.2%
5.039999962 216811
 
0.2%
5.579999924 211830
 
0.2%
5.699999809 210617
 
0.2%
Other values (127712) 92408913
97.4%
ValueCountFrequency (%)
-3405.530029 1
< 0.1%
-3405.48999 1
< 0.1%
-3405.469971 1
< 0.1%
-3405.429932 1
< 0.1%
-3405.419922 1
< 0.1%
ValueCountFrequency (%)
2915.98999 1
< 0.1%
2832.030029 1
< 0.1%
2807.590088 1
< 0.1%
2806.870117 2
< 0.1%
2806.219971 1
< 0.1%

Correlations

2024-11-22T14:15:20.724286image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
useridxyz
userid1.000-0.2250.1220.133
x-0.2251.0000.051-0.121
y0.1220.0511.000-0.142
z0.133-0.121-0.1421.000