Overview

Dataset statistics

Number of variables4
Number of observations642940848
Missing cells0
Missing cells (%)0.0%
Total size in memory29.9 GiB
Average record size in memory50.0 B

Variable types

Text1
Numeric2
DateTime1

Dataset

Description[lx] Illuminance. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
valueThe ambient light level in SI lux units

Alerts

experimentid has constant value "wenetItaly"Constant
value is highly skewed (γ1 = 4506.746548)Skewed
value has 204613392 (31.8%) zerosZeros

Reproduction

Analysis started2024-11-23 11:40:18.781881
Analysis finished2024-11-23 12:30:33.701083
Duration50 minutes and 14.92 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size15.6 GiB
2024-11-23T13:30:33.761901image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters6429408480
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 642940848
100.0%
2024-11-23T13:30:33.947428image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1285881696
20.0%
t 1285881696
20.0%
w 642940848
10.0%
n 642940848
10.0%
I 642940848
10.0%
a 642940848
10.0%
l 642940848
10.0%
y 642940848
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6429408480
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1285881696
20.0%
t 1285881696
20.0%
w 642940848
10.0%
n 642940848
10.0%
I 642940848
10.0%
a 642940848
10.0%
l 642940848
10.0%
y 642940848
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6429408480
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1285881696
20.0%
t 1285881696
20.0%
w 642940848
10.0%
n 642940848
10.0%
I 642940848
10.0%
a 642940848
10.0%
l 642940848
10.0%
y 642940848
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6429408480
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1285881696
20.0%
t 1285881696
20.0%
w 642940848
10.0%
n 642940848
10.0%
I 642940848
10.0%
a 642940848
10.0%
l 642940848
10.0%
y 642940848
10.0%

userid
Real number (ℝ)

User id

Distinct214
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.760921
Minimum0
Maximum265
Zeros553319
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size9.6 GiB
2024-11-23T13:30:34.082399image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q155
median119
Q3200
95-th percentile254
Maximum265
Range265
Interquartile range (IQR)145

Descriptive statistics

Standard deviation80.38110694
Coefficient of variation (CV)0.6547776465
Kurtosis-1.253834288
Mean122.760921
Median Absolute Deviation (MAD)72
Skewness0.1267212358
Sum7.892801065 × 1010
Variance6461.122353
MonotonicityIncreasing
2024-11-23T13:30:34.207193image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 17251045
 
2.7%
58 16510055
 
2.6%
134 15264417
 
2.4%
225 15101079
 
2.3%
40 14664449
 
2.3%
91 14436934
 
2.2%
15 13610247
 
2.1%
99 13601091
 
2.1%
258 13467124
 
2.1%
158 13337248
 
2.1%
Other values (204) 495697159
77.1%
ValueCountFrequency (%)
0 553319
 
0.1%
1 7671609
1.2%
2 1142716
 
0.2%
3 989565
 
0.2%
4 2057941
 
0.3%
ValueCountFrequency (%)
265 3393983
0.5%
264 45140
 
< 0.1%
263 1862719
0.3%
262 1174411
 
0.2%
260 31436
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct553986674
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size9.6 GiB
Minimum2020-11-16 07:00:00.008000
Maximum2020-12-11 21:59:59.998000
2024-11-23T13:30:34.325861image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T13:30:34.454596image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

value
Real number (ℝ)

SKEWED  ZEROS 

The ambient light level in SI lux units

Distinct404636
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean155.9931373
Minimum-71238
Maximum21474836
Zeros204613392
Zeros (%)31.8%
Negative116
Negative (%)< 0.1%
Memory size9.6 GiB
2024-11-23T13:30:34.574480image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-71238
5-th percentile0
Q10
median9
Q359
95-th percentile476
Maximum21474836
Range21546074
Interquartile range (IQR)59

Descriptive statistics

Standard deviation2742.855611
Coefficient of variation (CV)17.58318128
Kurtosis35068686.29
Mean155.9931373
Median Absolute Deviation (MAD)9
Skewness4506.746548
Sum1.0029436 × 1011
Variance7523256.904
MonotonicityNot monotonic
2024-11-23T13:30:34.695384image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 204613392
31.8%
1 25694079
 
4.0%
2 17382850
 
2.7%
3 13540948
 
2.1%
5 9098039
 
1.4%
4 8934894
 
1.4%
6 8839110
 
1.4%
9 7686629
 
1.2%
7 7435170
 
1.2%
8 6164538
 
1.0%
Other values (404626) 333551199
51.9%
ValueCountFrequency (%)
-71238 1
< 0.1%
-71070 1
< 0.1%
-70591 1
< 0.1%
-70189 1
< 0.1%
-70158 1
< 0.1%
ValueCountFrequency (%)
21474836 6
< 0.1%
1190695.25 5
< 0.1%
1185558.125 1
 
< 0.1%
1135853.875 1
 
< 0.1%
894393.1875 1
 
< 0.1%

Correlations

2024-11-23T13:30:34.766325image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
useridvalue
userid1.0000.010
value0.0101.000