Overview

Dataset statistics

Number of variables8
Number of observations36889295
Missing cells338
Missing cells (%)< 0.1%
Total size in memory8.6 GiB
Average record size in memory251.5 B

Variable types

Text4
Numeric3
DateTime1

Dataset

Description[unitless] Returns all WIFI networks detected by the smartphone. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
addressis a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.
capabilitiesallows local area networks (LANs) to operate without cables and wiring
frequencythe WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI
namethe name assigned to the WIFI network
rssi(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Alerts

experimentid has constant value "wenetItaly"Constant

Reproduction

Analysis started2024-11-23 11:08:06.765981
Analysis finished2024-11-23 11:14:32.743322
Duration6 minutes and 25.98 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size914.7 MiB
2024-11-23T12:14:32.815205image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters368892950
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 36889295
100.0%
2024-11-23T12:14:33.020541image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 73778590
20.0%
t 73778590
20.0%
w 36889295
10.0%
n 36889295
10.0%
I 36889295
10.0%
a 36889295
10.0%
l 36889295
10.0%
y 36889295
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 368892950
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 73778590
20.0%
t 73778590
20.0%
w 36889295
10.0%
n 36889295
10.0%
I 36889295
10.0%
a 36889295
10.0%
l 36889295
10.0%
y 36889295
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 368892950
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 73778590
20.0%
t 73778590
20.0%
w 36889295
10.0%
n 36889295
10.0%
I 36889295
10.0%
a 36889295
10.0%
l 36889295
10.0%
y 36889295
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 368892950
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 73778590
20.0%
t 73778590
20.0%
w 36889295
10.0%
n 36889295
10.0%
I 36889295
10.0%
a 36889295
10.0%
l 36889295
10.0%
y 36889295
10.0%

userid
Real number (ℝ)

User id

Distinct209
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean125.5390154
Minimum0
Maximum264
Zeros7981
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size562.9 MiB
2024-11-23T12:14:33.151006image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8
Q140
median120
Q3202
95-th percentile258
Maximum264
Range264
Interquartile range (IQR)162

Descriptive statistics

Standard deviation85.09628021
Coefficient of variation (CV)0.6778472809
Kurtosis-1.382273655
Mean125.5390154
Median Absolute Deviation (MAD)80
Skewness0.09638768766
Sum4631045772
Variance7241.376906
MonotonicityIncreasing
2024-11-23T12:14:33.276488image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
217 1598828
 
4.3%
111 1541150
 
4.2%
162 1495036
 
4.1%
258 1399218
 
3.8%
18 1363977
 
3.7%
19 1202119
 
3.3%
114 1145831
 
3.1%
31 1058985
 
2.9%
5 1009035
 
2.7%
177 993704
 
2.7%
Other values (199) 24081412
65.3%
ValueCountFrequency (%)
0 7981
 
< 0.1%
1 209178
0.6%
2 334633
0.9%
3 50218
 
0.1%
4 8099
 
< 0.1%
ValueCountFrequency (%)
264 6792
 
< 0.1%
263 649781
1.8%
262 302840
0.8%
260 4916
 
< 0.1%
259 41514
 
0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct36568693
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size562.9 MiB
Minimum2020-11-16 07:00:00.086000
Maximum2020-12-11 21:59:59.454000
2024-11-23T12:14:33.405974image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T12:14:33.525228image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

address
Text

is a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.

Distinct57271
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size2.7 GiB
2024-11-23T12:14:33.745890image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters2360914880
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4909 ?
Unique (%)< 0.1%

Sample

1st rowbdfbd36bbd0ab5120b86d2e91b3d52c29b9b5c56235221bd455efbf1715d3bc5
2nd row25012a88dfda34aa0806dfee04d401b739ff4cbc82749f548d2abbec50f93bcf
3rd row488dc1a2e5f10b266194b7022297c9f60fe8dfd440b6334c004394922a6f5f98
4th row0040d91ad14b2e476dceaaa1e657357987d146e76dc11599e9a31f161dd71a3f
5th rowb590fa73ea1547cc31b6ff4e2ffe80a64f66a21e462d76fa5c558202a9b60acc
ValueCountFrequency (%)
c4e6232b74491fd44f0bb5b1895466b0030a87d3a9995562e4894008cfcf03b3 526717
 
1.4%
25e1062155c26297f22e19c5d78502cb70e3ef0171330da260c5a94afaf333b1 482079
 
1.3%
79a0ccf2add998d40b9a88b0fad222991fbdeedad6132aac1c2fbfabdea34ce8 477814
 
1.3%
d9b4442ba460e4ed75f7c9d0714cf793fd99d9c5eeb9b9416b86a9bf81940e60 419603
 
1.1%
0474bcf4d94559a842fe1bb2750df17ef6ace13c6a2654ade7ff47854f34743b 342520
 
0.9%
44b3cfa8f19ddc20204996bdb0cbddd0d9d7c2c60442677733a6003fcc626997 271491
 
0.7%
97903c6ecc3e70658799065fbf424c6de8684dd3d0378236048470fe80eb08d8 268996
 
0.7%
9c6b32280fc40b392667e3a6826a374daf2ec069898af9cf8af71cc0ad626a00 263631
 
0.7%
b4545bc4694be939669e4bb60efd20b98346c30473a47f57a75593e18a2c334d 249626
 
0.7%
30ea66d08d432103d600825c4002e3b9e94b0cbd0f8892ef278f3b7278af8267 242455
 
0.7%
Other values (57261) 33344363
90.4%
2024-11-23T12:14:34.060165image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 153927248
 
6.5%
9 153018183
 
6.5%
0 150002227
 
6.4%
f 149470441
 
6.3%
6 149345465
 
6.3%
d 149280020
 
6.3%
2 148579987
 
6.3%
e 147247547
 
6.2%
c 146879509
 
6.2%
8 146363918
 
6.2%
Other values (6) 866800335
36.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
4 153927248
 
6.5%
9 153018183
 
6.5%
0 150002227
 
6.4%
f 149470441
 
6.3%
6 149345465
 
6.3%
d 149280020
 
6.3%
2 148579987
 
6.3%
e 147247547
 
6.2%
c 146879509
 
6.2%
8 146363918
 
6.2%
Other values (6) 866800335
36.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
4 153927248
 
6.5%
9 153018183
 
6.5%
0 150002227
 
6.4%
f 149470441
 
6.3%
6 149345465
 
6.3%
d 149280020
 
6.3%
2 148579987
 
6.3%
e 147247547
 
6.2%
c 146879509
 
6.2%
8 146363918
 
6.2%
Other values (6) 866800335
36.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
4 153927248
 
6.5%
9 153018183
 
6.5%
0 150002227
 
6.4%
f 149470441
 
6.3%
6 149345465
 
6.3%
d 149280020
 
6.3%
2 148579987
 
6.3%
e 147247547
 
6.2%
c 146879509
 
6.2%
8 146363918
 
6.2%
Other values (6) 866800335
36.7%

capabilities
Text

allows local area networks (LANs) to operate without cables and wiring

Distinct535
Distinct (%)< 0.1%
Missing338
Missing (%)< 0.1%
Memory size2.0 GiB
2024-11-23T12:14:34.196221image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length597
Median length575
Mean length41.49492383
Min length7

Characters and Unicode

Total characters1530704461
Distinct characters41
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)< 0.1%

Sample

1st row['WPA2-EAP-CCMP', 'ESS']
2nd row['WPA2-EAP-CCMP', 'ESS']
3rd row['WPA2-EAP-CCMP', 'ESS']
4th row['WPA2-EAP-CCMP', 'ESS']
5th row['WPA2-EAP-CCMP', 'ESS']
ValueCountFrequency (%)
ess 36884548
28.8%
wpa2-psk-ccmp 25806782
20.2%
wps 22346222
17.5%
rsn-psk-ccmp 19193875
15.0%
wfa-ht 3979657
 
3.1%
wpa2-eap-ccmp 3302490
 
2.6%
rsn-eap-ccmp 2930129
 
2.3%
wpa2-psk-tkip+ccmp 1307828
 
1.0%
wpa2-psk-ccmp+tkip 1294937
 
1.0%
wpa-psk-tkip+ccmp 1293444
 
1.0%
Other values (67) 9694114
 
7.6%
2024-11-23T12:14:34.454752image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
' 256068052
16.7%
P 189511509
12.4%
S 174869247
11.4%
- 126474177
8.3%
C 119387797
7.8%
, 91145069
 
6.0%
91145069
 
6.0%
W 63900997
 
4.2%
K 63216663
 
4.1%
M 59637501
 
3.9%
Other values (31) 295348380
19.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1530704461
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
' 256068052
16.7%
P 189511509
12.4%
S 174869247
11.4%
- 126474177
8.3%
C 119387797
7.8%
, 91145069
 
6.0%
91145069
 
6.0%
W 63900997
 
4.2%
K 63216663
 
4.1%
M 59637501
 
3.9%
Other values (31) 295348380
19.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1530704461
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
' 256068052
16.7%
P 189511509
12.4%
S 174869247
11.4%
- 126474177
8.3%
C 119387797
7.8%
, 91145069
 
6.0%
91145069
 
6.0%
W 63900997
 
4.2%
K 63216663
 
4.1%
M 59637501
 
3.9%
Other values (31) 295348380
19.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1530704461
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
' 256068052
16.7%
P 189511509
12.4%
S 174869247
11.4%
- 126474177
8.3%
C 119387797
7.8%
, 91145069
 
6.0%
91145069
 
6.0%
W 63900997
 
4.2%
K 63216663
 
4.1%
M 59637501
 
3.9%
Other values (31) 295348380
19.3%

frequency
Real number (ℝ)

the WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI

Distinct38
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2977.852404
Minimum2412
Maximum5825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size562.9 MiB
2024-11-23T12:14:34.573502image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum2412
5-th percentile2412
Q12417
median2437
Q32462
95-th percentile5500
Maximum5825
Range3413
Interquartile range (IQR)45

Descriptive statistics

Standard deviation1134.209477
Coefficient of variation (CV)0.3808816969
Kurtosis0.6657964453
Mean2977.852404
Median Absolute Deviation (MAD)25
Skewness1.624012486
Sum1.098508758 × 1011
Variance1286431.137
MonotonicityNot monotonic
2024-11-23T12:14:34.682885image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
2412 8596010
23.3%
2437 6415242
17.4%
2462 5698406
15.4%
5180 1484337
 
4.0%
5220 1275308
 
3.5%
2417 1183188
 
3.2%
2432 1103147
 
3.0%
2422 1045411
 
2.8%
2457 950929
 
2.6%
2427 942533
 
2.6%
Other values (28) 8194784
22.2%
ValueCountFrequency (%)
2412 8596010
23.3%
2417 1183188
 
3.2%
2422 1045411
 
2.8%
2427 942533
 
2.6%
2432 1103147
 
3.0%
ValueCountFrequency (%)
5825 950
 
< 0.1%
5805 8096
 
< 0.1%
5785 16568
< 0.1%
5765 769
 
< 0.1%
5745 21867
0.1%

name
Text

the name assigned to the WIFI network

Distinct33943
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.7 GiB
2024-11-23T12:14:34.839095image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters2360914880
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2935 ?
Unique (%)< 0.1%

Sample

1st row3dc7ddf8e0b15159d838aae365c9a2f56fe9f78da5162e6d92133cd20c3133bd
2nd row3dc7ddf8e0b15159d838aae365c9a2f56fe9f78da5162e6d92133cd20c3133bd
3rd rowbd26310c862b015c8e7f7338bb22c8954b92569ec8703f6ee698e57d49a4fa0e
4th row3dc7ddf8e0b15159d838aae365c9a2f56fe9f78da5162e6d92133cd20c3133bd
5th rowbd26310c862b015c8e7f7338bb22c8954b92569ec8703f6ee698e57d49a4fa0e
ValueCountFrequency (%)
f3cad6e99345bd8f7712200230f5fd88ccde37f698094fe074db4efe8f5117fd 2329355
 
6.3%
ef0c79ca0128874f83e93e02ae12ed0b77a8b4ec5f68f27a570cf418525a3724 1618487
 
4.4%
ea2bad812c6e3523d7dbac0b2fe3baabd456a7936fd028c823258e7199ca6dd1 1433360
 
3.9%
3dc7ddf8e0b15159d838aae365c9a2f56fe9f78da5162e6d92133cd20c3133bd 881892
 
2.4%
bd26310c862b015c8e7f7338bb22c8954b92569ec8703f6ee698e57d49a4fa0e 804720
 
2.2%
c870b8bf9719948325919170711ea8c00d4ab54e2a8f9d8a518171083896c1a3 536496
 
1.5%
33fa0c7edbcad6d151da9d5e349b5c5ad85ce13e01fc4c29cf500f2f85ea5ad2 526717
 
1.4%
3c8b0a6061fa3612c2e2190b508fa82833c0928a70af9908addf8263dc89a382 508322
 
1.4%
0f9054041aee653ebf03cfe9ecff9c44bf75df9a97d8b2c40ca3365237d46ced 463188
 
1.3%
6515de6b2ff936c0ca104b3f48900d7bf6afb94489835f2718d135be8ea54509 398494
 
1.1%
Other values (33933) 27388264
74.2%
2024-11-23T12:14:35.100809image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 159013092
 
6.7%
a 158561012
 
6.7%
f 156204381
 
6.6%
2 155126407
 
6.6%
e 155002593
 
6.6%
d 153421275
 
6.5%
3 150530738
 
6.4%
9 148056317
 
6.3%
c 147718823
 
6.3%
7 145320499
 
6.2%
Other values (6) 831959743
35.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
8 159013092
 
6.7%
a 158561012
 
6.7%
f 156204381
 
6.6%
2 155126407
 
6.6%
e 155002593
 
6.6%
d 153421275
 
6.5%
3 150530738
 
6.4%
9 148056317
 
6.3%
c 147718823
 
6.3%
7 145320499
 
6.2%
Other values (6) 831959743
35.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
8 159013092
 
6.7%
a 158561012
 
6.7%
f 156204381
 
6.6%
2 155126407
 
6.6%
e 155002593
 
6.6%
d 153421275
 
6.5%
3 150530738
 
6.4%
9 148056317
 
6.3%
c 147718823
 
6.3%
7 145320499
 
6.2%
Other values (6) 831959743
35.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2360914880
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
8 159013092
 
6.7%
a 158561012
 
6.7%
f 156204381
 
6.6%
2 155126407
 
6.6%
e 155002593
 
6.6%
d 153421275
 
6.5%
3 150530738
 
6.4%
9 148056317
 
6.3%
c 147718823
 
6.3%
7 145320499
 
6.2%
Other values (6) 831959743
35.2%

rssi
Real number (ℝ)

(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Distinct104
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-83.2391975
Minimum-106
Maximum0
Zeros165
Zeros (%)< 0.1%
Negative36889130
Negative (%)> 99.9%
Memory size562.9 MiB
2024-11-23T12:14:35.226017image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-106
5-th percentile-94
Q1-90
median-87
Q3-80
95-th percentile-58
Maximum0
Range106
Interquartile range (IQR)10

Descriptive statistics

Standard deviation10.88641683
Coefficient of variation (CV)-0.1307847403
Kurtosis3.587670167
Mean-83.2391975
Median Absolute Deviation (MAD)4
Skewness1.844862338
Sum-3070635312
Variance118.5140714
MonotonicityNot monotonic
2024-11-23T12:14:35.354784image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-90 5723071
15.5%
-89 3533916
 
9.6%
-88 2784686
 
7.5%
-87 1950412
 
5.3%
-86 1781121
 
4.8%
-91 1670180
 
4.5%
-92 1533793
 
4.2%
-85 1408350
 
3.8%
-84 1179590
 
3.2%
-93 1142766
 
3.1%
Other values (94) 14181410
38.4%
ValueCountFrequency (%)
-106 48
 
< 0.1%
-105 8
 
< 0.1%
-104 51
 
< 0.1%
-103 233
 
< 0.1%
-102 1124
< 0.1%
ValueCountFrequency (%)
0 165
< 0.1%
-4 3
 
< 0.1%
-5 20
 
< 0.1%
-6 15
 
< 0.1%
-7 48
 
< 0.1%

Correlations

2024-11-23T12:14:35.425903image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
frequencyrssiuserid
frequency1.000-0.0590.009
rssi-0.0591.0000.072
userid0.0090.0721.000