Overview

Dataset statistics

Number of variables8
Number of observations20423751
Missing cells0
Missing cells (%)0.0%
Total size in memory4.2 GiB
Average record size in memory222.5 B

Variable types

Text5
Numeric2
DateTime1

Dataset

Description[unitless] Returns wheter the device to wirelessly exchange data with other Bluetooth devices. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
addressMAC address.
bondstatebond state of the remote device.
namename of the remote device.
rssi(Received Signal Strength Indicator) is an estimated measure of power level received from an access point or router. (dBm)
typethe type of bluetooth device {normal, low-energy}

Alerts

experimentid has constant value "wenetDenmark"Constant
userid has 4575188 (22.4%) zerosZeros

Reproduction

Analysis started2024-11-23 02:31:32.886841
Analysis finished2024-11-23 02:34:11.638867
Duration2 minutes and 38.75 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size545.4 MiB
2024-11-23T03:34:11.733148image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters245085012
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 20423751
100.0%
2024-11-23T03:34:11.910121image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 61271253
25.0%
n 40847502
16.7%
w 20423751
 
8.3%
t 20423751
 
8.3%
D 20423751
 
8.3%
m 20423751
 
8.3%
a 20423751
 
8.3%
r 20423751
 
8.3%
k 20423751
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 245085012
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 61271253
25.0%
n 40847502
16.7%
w 20423751
 
8.3%
t 20423751
 
8.3%
D 20423751
 
8.3%
m 20423751
 
8.3%
a 20423751
 
8.3%
r 20423751
 
8.3%
k 20423751
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 245085012
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 61271253
25.0%
n 40847502
16.7%
w 20423751
 
8.3%
t 20423751
 
8.3%
D 20423751
 
8.3%
m 20423751
 
8.3%
a 20423751
 
8.3%
r 20423751
 
8.3%
k 20423751
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 245085012
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 61271253
25.0%
n 40847502
16.7%
w 20423751
 
8.3%
t 20423751
 
8.3%
D 20423751
 
8.3%
m 20423751
 
8.3%
a 20423751
 
8.3%
r 20423751
 
8.3%
k 20423751
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.015324756
Minimum0
Maximum27
Zeros4575188
Zeros (%)22.4%
Negative0
Negative (%)0.0%
Memory size311.6 MiB
2024-11-23T03:34:12.011077image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median6
Q36
95-th percentile23
Maximum27
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation7.184177964
Coefficient of variation (CV)1.024069193
Kurtosis1.2547575
Mean7.015324756
Median Absolute Deviation (MAD)0
Skewness1.51758816
Sum143279246
Variance51.61241303
MonotonicityIncreasing
2024-11-23T03:34:12.103280image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
6 12078780
59.1%
0 4575188
 
22.4%
23 2618871
 
12.8%
2 803605
 
3.9%
27 261823
 
1.3%
25 64793
 
0.3%
8 9413
 
< 0.1%
12 4491
 
< 0.1%
22 3616
 
< 0.1%
19 2131
 
< 0.1%
ValueCountFrequency (%)
0 4575188
 
22.4%
2 803605
 
3.9%
6 12078780
59.1%
8 9413
 
< 0.1%
12 4491
 
< 0.1%
ValueCountFrequency (%)
27 261823
 
1.3%
26 1040
 
< 0.1%
25 64793
 
0.3%
23 2618871
12.8%
22 3616
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct20213675
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size311.6 MiB
Minimum2020-11-16 07:00:00.159000
Maximum2020-12-11 21:59:24.810000
2024-11-23T03:34:12.213713image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T03:34:12.330550image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

address
Text

MAC address.

Distinct172641
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.5 GiB
2024-11-23T03:34:12.628932image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters1307120064
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16991 ?
Unique (%)0.1%

Sample

1st row7100f7d1156ab1082686e0681f3097151e3a367e61169cb3036b2806d23cae97
2nd row7100f7d1156ab1082686e0681f3097151e3a367e61169cb3036b2806d23cae97
3rd row7100f7d1156ab1082686e0681f3097151e3a367e61169cb3036b2806d23cae97
4th row7100f7d1156ab1082686e0681f3097151e3a367e61169cb3036b2806d23cae97
5th rowfbc54a9ac4f26c6af65366b661ad330a9e42b345c3613f7eaabd0a909e01baf7
ValueCountFrequency (%)
01a5e60382741d5bd63c7dcc336b8d3e69fa6f343ebdeb557b0418acacced2c4 541575
 
2.7%
4ea4e04a0055819e51f0705315801db1a525e854fec0c357c88c3ebd5e72c73d 484528
 
2.4%
0b6e8d6d191ea89a07b28b4c789613c812ccb5a4cc0973b7616185643eb7b718 480535
 
2.4%
37b8f920308c5da69cd4dbb8e8b70296e4649975f1ff5880464e14f5bea69fc1 363944
 
1.8%
dc3381be68731103094a4bdfc3c1a2ceed78b92b45038926afbe06fcf7ea0f24 319955
 
1.6%
ddc6a7e734aa5ef800618aa804a4b7127fb950ab60b2cb284fa56a4dd4d3ad0b 275275
 
1.3%
0eb5137677d55efadcced0d3c391b47d128cab6565f8a22c21b874e5e1afadc2 173699
 
0.9%
87e4668d4ae935e928251bf6907428341d530a5b8a5a5b450096873a6d35acff 173414
 
0.8%
da2b1e35d9e01fe8683d8457adfb1eb9ef10138ab828f7f94ae74d0bb1d327eb 136219
 
0.7%
aa9f839ce04c74a2851baf16cae2e6a3d91864f0deb9f6fbb284d4b10099b0f7 98275
 
0.5%
Other values (172631) 17376332
85.1%
2024-11-23T03:34:13.016268image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
b 84706841
 
6.5%
e 83174766
 
6.4%
1 83170275
 
6.4%
8 82847245
 
6.3%
5 82798946
 
6.3%
c 82755007
 
6.3%
d 82596030
 
6.3%
0 82445162
 
6.3%
3 82348374
 
6.3%
a 82126389
 
6.3%
Other values (6) 478151029
36.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
b 84706841
 
6.5%
e 83174766
 
6.4%
1 83170275
 
6.4%
8 82847245
 
6.3%
5 82798946
 
6.3%
c 82755007
 
6.3%
d 82596030
 
6.3%
0 82445162
 
6.3%
3 82348374
 
6.3%
a 82126389
 
6.3%
Other values (6) 478151029
36.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
b 84706841
 
6.5%
e 83174766
 
6.4%
1 83170275
 
6.4%
8 82847245
 
6.3%
5 82798946
 
6.3%
c 82755007
 
6.3%
d 82596030
 
6.3%
0 82445162
 
6.3%
3 82348374
 
6.3%
a 82126389
 
6.3%
Other values (6) 478151029
36.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
b 84706841
 
6.5%
e 83174766
 
6.4%
1 83170275
 
6.4%
8 82847245
 
6.3%
5 82798946
 
6.3%
c 82755007
 
6.3%
d 82596030
 
6.3%
0 82445162
 
6.3%
3 82348374
 
6.3%
a 82126389
 
6.3%
Other values (6) 478151029
36.6%

bondstate
Text

bond state of the remote device.

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size487.0 MiB
2024-11-23T03:34:13.106841image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length11
Median length9
Mean length9.004244372
Min length9

Characters and Unicode

Total characters183900445
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBOND_NONE
2nd rowBOND_NONE
3rd rowBOND_NONE
4th rowBOND_NONE
5th rowBOND_NONE
ValueCountFrequency (%)
bond_none 20380408
99.8%
bond_bonded 43343
 
0.2%
2024-11-23T03:34:13.311141image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 61227910
33.3%
O 40847502
22.2%
D 20510437
 
11.2%
B 20467094
 
11.1%
_ 20423751
 
11.1%
E 20423751
 
11.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 183900445
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 61227910
33.3%
O 40847502
22.2%
D 20510437
 
11.2%
B 20467094
 
11.1%
_ 20423751
 
11.1%
E 20423751
 
11.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 183900445
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 61227910
33.3%
O 40847502
22.2%
D 20510437
 
11.2%
B 20467094
 
11.1%
_ 20423751
 
11.1%
E 20423751
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 183900445
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 61227910
33.3%
O 40847502
22.2%
D 20510437
 
11.2%
B 20467094
 
11.1%
_ 20423751
 
11.1%
E 20423751
 
11.1%

name
Text

name of the remote device.

Distinct4005
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.5 GiB
2024-11-23T03:34:13.435055image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters1307120064
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique481 ?
Unique (%)< 0.1%

Sample

1st row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
2nd row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
3rd row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
4th row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
5th row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
ValueCountFrequency (%)
2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27 17778057
87.0%
c21f0f58d1a5f5f23d85d660d8d36793255bd4f84ec459bfb82a2f4f671e2306 331615
 
1.6%
a3541388c4ab5be79d808e4414d1a0f46157dd2b1094bb7ff4263c66d3bc80d9 279700
 
1.4%
0c423d56a3f738747bf33756749690d0f534c52aab7c7dd3863351cddae281da 274503
 
1.3%
19b47abbfbf89594e389fcd5679dafa5e4b470b19248e4f8c5761f08aff1fea0 203758
 
1.0%
53fe7a77b9f6750a132fefb64d5f07896e1a2b596bcccc5454616829c8b62322 177714
 
0.9%
cb280a7baae36a9d23d1a37460305002f16f2463f3ab5b9a450f3e55466dec33 86992
 
0.4%
85760f5ef656468ff05434283ac0a899e8ee67a10de3273633718626a0ddc955 73773
 
0.4%
551c5eb540adda61c939d32230ccf0a80fe66256981b892669e544b893cc73a1 53347
 
0.3%
7a54780a9f2e9fbd1b03efc05c19ed19db8975849bef976e8f4dc5df299bb72e 49408
 
0.2%
Other values (3995) 1114884
 
5.5%
2024-11-23T03:34:13.646059image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 135119601
10.3%
b 117194934
 
9.0%
a 117023086
 
9.0%
6 100432787
 
7.7%
8 99695423
 
7.6%
2 98790771
 
7.6%
9 98521873
 
7.5%
5 83373945
 
6.4%
0 80434735
 
6.2%
e 79435195
 
6.1%
Other values (6) 297097714
22.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
7 135119601
10.3%
b 117194934
 
9.0%
a 117023086
 
9.0%
6 100432787
 
7.7%
8 99695423
 
7.6%
2 98790771
 
7.6%
9 98521873
 
7.5%
5 83373945
 
6.4%
0 80434735
 
6.2%
e 79435195
 
6.1%
Other values (6) 297097714
22.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
7 135119601
10.3%
b 117194934
 
9.0%
a 117023086
 
9.0%
6 100432787
 
7.7%
8 99695423
 
7.6%
2 98790771
 
7.6%
9 98521873
 
7.5%
5 83373945
 
6.4%
0 80434735
 
6.2%
e 79435195
 
6.1%
Other values (6) 297097714
22.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1307120064
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
7 135119601
10.3%
b 117194934
 
9.0%
a 117023086
 
9.0%
6 100432787
 
7.7%
8 99695423
 
7.6%
2 98790771
 
7.6%
9 98521873
 
7.5%
5 83373945
 
6.4%
0 80434735
 
6.2%
e 79435195
 
6.1%
Other values (6) 297097714
22.7%

rssi
Real number (ℝ)

(Received Signal Strength Indicator) is an estimated measure of power level received from an access point or router. (dBm)

Distinct109
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-84.02668922
Minimum-116
Maximum127
Zeros0
Zeros (%)0.0%
Negative20423473
Negative (%)> 99.9%
Memory size311.6 MiB
2024-11-23T03:34:13.767626image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-116
5-th percentile-98
Q1-92
median-86
Q3-78
95-th percentile-65
Maximum127
Range243
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.77502606
Coefficient of variation (CV)-0.1282333763
Kurtosis1.658277057
Mean-84.02668922
Median Absolute Deviation (MAD)7
Skewness1.030214781
Sum-1716140178
Variance116.1011866
MonotonicityNot monotonic
2024-11-23T03:34:13.883164image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-93 1011706
 
5.0%
-87 912902
 
4.5%
-88 860908
 
4.2%
-94 845991
 
4.1%
-91 824400
 
4.0%
-90 819907
 
4.0%
-92 812931
 
4.0%
-86 812642
 
4.0%
-89 808564
 
4.0%
-95 779100
 
3.8%
Other values (99) 11934700
58.4%
ValueCountFrequency (%)
-116 3
 
< 0.1%
-115 2
 
< 0.1%
-114 9
< 0.1%
-113 4
< 0.1%
-112 2
 
< 0.1%
ValueCountFrequency (%)
127 2
< 0.1%
80 2
< 0.1%
79 2
< 0.1%
68 1
 
< 0.1%
63 4
< 0.1%

type
Text

the type of bluetooth device {normal, low-energy}

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size341.5 MiB
2024-11-23T03:34:13.951023image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length1.535419718
Min length1

Characters and Unicode

Total characters31359030
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rown
2nd rowle
3rd rown
4th rown
5th rowle
ValueCountFrequency (%)
le 10935279
53.5%
n 9488472
46.5%
2024-11-23T03:34:14.122789image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 10935279
34.9%
e 10935279
34.9%
n 9488472
30.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 31359030
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 10935279
34.9%
e 10935279
34.9%
n 9488472
30.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 31359030
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 10935279
34.9%
e 10935279
34.9%
n 9488472
30.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 31359030
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 10935279
34.9%
e 10935279
34.9%
n 9488472
30.3%

Correlations

2024-11-23T03:34:14.196075image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
rssiuserid
rssi1.0000.053
userid0.0531.000