Overview

Dataset statistics

Number of variables8
Number of observations6732122
Missing cells1100
Missing cells (%)< 0.1%
Total size in memory1.6 GiB
Average record size in memory253.9 B

Variable types

Text4
Numeric3
DateTime1

Dataset

Description[unitless] Returns all WIFI networks detected by the smartphone. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
addressis a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.
capabilitiesallows local area networks (LANs) to operate without cables and wiring
frequencythe WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI
namethe name assigned to the WIFI network
rssi(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Alerts

experimentid has constant value "wenetDenmark"Constant
userid has 380185 (5.6%) zerosZeros

Reproduction

Analysis started2024-11-23 02:30:33.080289
Analysis finished2024-11-23 02:31:21.293046
Duration48.21 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size179.8 MiB
2024-11-23T03:31:21.398322image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters80785464
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetDenmark
2nd rowwenetDenmark
3rd rowwenetDenmark
4th rowwenetDenmark
5th rowwenetDenmark
ValueCountFrequency (%)
wenetdenmark 6732122
100.0%
2024-11-23T03:31:21.606261image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 20196366
25.0%
n 13464244
16.7%
w 6732122
 
8.3%
t 6732122
 
8.3%
D 6732122
 
8.3%
m 6732122
 
8.3%
a 6732122
 
8.3%
r 6732122
 
8.3%
k 6732122
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 80785464
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 20196366
25.0%
n 13464244
16.7%
w 6732122
 
8.3%
t 6732122
 
8.3%
D 6732122
 
8.3%
m 6732122
 
8.3%
a 6732122
 
8.3%
r 6732122
 
8.3%
k 6732122
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 80785464
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 20196366
25.0%
n 13464244
16.7%
w 6732122
 
8.3%
t 6732122
 
8.3%
D 6732122
 
8.3%
m 6732122
 
8.3%
a 6732122
 
8.3%
r 6732122
 
8.3%
k 6732122
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 80785464
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 20196366
25.0%
n 13464244
16.7%
w 6732122
 
8.3%
t 6732122
 
8.3%
D 6732122
 
8.3%
m 6732122
 
8.3%
a 6732122
 
8.3%
r 6732122
 
8.3%
k 6732122
 
8.3%

userid
Real number (ℝ)

ZEROS 

User id

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.01260568
Minimum0
Maximum27
Zeros380185
Zeros (%)5.6%
Negative0
Negative (%)0.0%
Memory size102.7 MiB
2024-11-23T03:31:21.714256image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median6
Q317
95-th percentile23
Maximum27
Range27
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.466202217
Coefficient of variation (CV)0.8455543426
Kurtosis-1.42599807
Mean10.01260568
Median Absolute Deviation (MAD)4
Skewness0.5238303021
Sum67406083
Variance71.67657997
MonotonicityIncreasing
2024-11-23T03:31:21.817587image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
3 1940349
28.8%
23 1343189
20.0%
6 1336716
19.9%
17 1155375
17.2%
2 478348
 
7.1%
0 380185
 
5.6%
16 23969
 
0.4%
21 17133
 
0.3%
25 15986
 
0.2%
26 15943
 
0.2%
Other values (7) 24929
 
0.4%
ValueCountFrequency (%)
0 380185
 
5.6%
2 478348
 
7.1%
3 1940349
28.8%
6 1336716
19.9%
8 713
 
< 0.1%
ValueCountFrequency (%)
27 1896
 
< 0.1%
26 15943
 
0.2%
25 15986
 
0.2%
23 1343189
20.0%
22 14065
 
0.2%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct6723472
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size102.7 MiB
Minimum2020-11-16 07:00:03.103000
Maximum2020-12-11 21:59:58.477000
2024-11-23T03:31:21.935594image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T03:31:22.062100image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

address
Text

is a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.

Distinct12932
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size513.6 MiB
2024-11-23T03:31:22.208566image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters430855808
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique844 ?
Unique (%)< 0.1%

Sample

1st row605cfb2d9f172c7bc67a3033ba0eb9c3ccd961a2e7ca21cababad7bad395640d
2nd row605cfb2d9f172c7bc67a3033ba0eb9c3ccd961a2e7ca21cababad7bad395640d
3rd row605cfb2d9f172c7bc67a3033ba0eb9c3ccd961a2e7ca21cababad7bad395640d
4th row605cfb2d9f172c7bc67a3033ba0eb9c3ccd961a2e7ca21cababad7bad395640d
5th row605cfb2d9f172c7bc67a3033ba0eb9c3ccd961a2e7ca21cababad7bad395640d
ValueCountFrequency (%)
dccf94f7df5ef9d0b9a06d3a35ef12aeadb74ebcc5ffa6f67a98326d53768e90 509153
 
7.6%
8121176a7e0c776e9c84d8b0137b14e47b4e5047a644547efea87072b7811c8b 247906
 
3.7%
d6e8f50e46af64eb4621f7d89f50cec10680e85be711a891695e8870c4618173 243777
 
3.6%
4577456c9a31a1245937363cd5ca56f5a31661005058231a45c61bb768c66ff7 182997
 
2.7%
103d6e3015bfc39530c68e10fd5909125d4a2a34dc69255b056d621d7ad46ef4 162392
 
2.4%
8f51637dae41e0dfd0b33441e040cb6176ea5408704577ded1409aa2ea814174 146532
 
2.2%
c0df6c858d229375c86da93fac165f076a001b52eb84d061c6186c2c7f992c74 142551
 
2.1%
9d1e7e8daba12e0340d8807b957bb954eb73d1dcf5dc7cadd127c7231d3642b8 122362
 
1.8%
dc29410d367ce71811bd82e927d539a5361ff108b1f8bdcfb22a215acb8a24bd 114292
 
1.7%
a8de8f44324d531f761b70e2b725fa09f9f0f182e43018fba325db4ecc904115 108793
 
1.6%
Other values (12922) 4751367
70.6%
2024-11-23T03:31:22.450897image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 29171239
 
6.8%
1 28531437
 
6.6%
7 28216384
 
6.5%
4 28062669
 
6.5%
0 27921812
 
6.5%
e 27422054
 
6.4%
d 27413354
 
6.4%
5 27102756
 
6.3%
f 26611769
 
6.2%
a 26564438
 
6.2%
Other values (6) 153837896
35.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
6 29171239
 
6.8%
1 28531437
 
6.6%
7 28216384
 
6.5%
4 28062669
 
6.5%
0 27921812
 
6.5%
e 27422054
 
6.4%
d 27413354
 
6.4%
5 27102756
 
6.3%
f 26611769
 
6.2%
a 26564438
 
6.2%
Other values (6) 153837896
35.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
6 29171239
 
6.8%
1 28531437
 
6.6%
7 28216384
 
6.5%
4 28062669
 
6.5%
0 27921812
 
6.5%
e 27422054
 
6.4%
d 27413354
 
6.4%
5 27102756
 
6.3%
f 26611769
 
6.2%
a 26564438
 
6.2%
Other values (6) 153837896
35.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
6 29171239
 
6.8%
1 28531437
 
6.6%
7 28216384
 
6.5%
4 28062669
 
6.5%
0 27921812
 
6.5%
e 27422054
 
6.4%
d 27413354
 
6.4%
5 27102756
 
6.3%
f 26611769
 
6.2%
a 26564438
 
6.2%
Other values (6) 153837896
35.7%

capabilities
Text

allows local area networks (LANs) to operate without cables and wiring

Distinct229
Distinct (%)< 0.1%
Missing1100
Missing (%)< 0.1%
Memory size371.7 MiB
2024-11-23T03:31:22.552052image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length126
Median length98
Mean length41.86799999
Min length7

Characters and Unicode

Total characters281814429
Distinct characters36
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row['WPA2-PSK-CCMP', 'ESS', 'WPS']
2nd row['WPA2-PSK-CCMP', 'ESS', 'WPS']
3rd row['WPA2-PSK-CCMP', 'ESS', 'WPS']
4th row['WPA2-PSK-CCMP', 'ESS', 'WPS']
5th row['WPA2-PSK-CCMP', 'ESS', 'WPS']
ValueCountFrequency (%)
ess 6727904
29.6%
wps 3602685
15.8%
wpa2-psk-ccmp 3426603
15.1%
rsn-psk-ccmp 2855716
12.6%
wfa-ht 1319875
 
5.8%
wpa2-psk-ccmp+tkip 712547
 
3.1%
wpa-psk-ccmp+tkip 699598
 
3.1%
rsn-psk-ccmp+tkip 529806
 
2.3%
wpa2-psk-tkip+ccmp 417370
 
1.8%
wpa-psk-tkip+ccmp 404822
 
1.8%
Other values (43) 2049223
 
9.0%
2024-11-23T03:31:22.804046image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
' 45492298
16.1%
P 34456077
12.2%
S 31218260
11.1%
- 22564496
 
8.0%
C 20721078
 
7.4%
, 16015127
 
5.7%
16015127
 
5.7%
K 13403949
 
4.8%
W 11736624
 
4.2%
M 10354354
 
3.7%
Other values (26) 59837039
21.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 281814429
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
' 45492298
16.1%
P 34456077
12.2%
S 31218260
11.1%
- 22564496
 
8.0%
C 20721078
 
7.4%
, 16015127
 
5.7%
16015127
 
5.7%
K 13403949
 
4.8%
W 11736624
 
4.2%
M 10354354
 
3.7%
Other values (26) 59837039
21.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 281814429
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
' 45492298
16.1%
P 34456077
12.2%
S 31218260
11.1%
- 22564496
 
8.0%
C 20721078
 
7.4%
, 16015127
 
5.7%
16015127
 
5.7%
K 13403949
 
4.8%
W 11736624
 
4.2%
M 10354354
 
3.7%
Other values (26) 59837039
21.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 281814429
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
' 45492298
16.1%
P 34456077
12.2%
S 31218260
11.1%
- 22564496
 
8.0%
C 20721078
 
7.4%
, 16015127
 
5.7%
16015127
 
5.7%
K 13403949
 
4.8%
W 11736624
 
4.2%
M 10354354
 
3.7%
Other values (26) 59837039
21.2%

frequency
Real number (ℝ)

the WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI

Distinct36
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4023.710827
Minimum2412
Maximum5805
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size102.7 MiB
2024-11-23T03:31:22.925026image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum2412
5-th percentile2412
Q12442
median5180
Q35300
95-th percentile5580
Maximum5805
Range3393
Interquartile range (IQR)2858

Descriptive statistics

Standard deviation1470.487076
Coefficient of variation (CV)0.3654554563
Kurtosis-1.948085694
Mean4023.710827
Median Absolute Deviation (MAD)480
Skewness-0.1309870183
Sum2.708811218 × 1010
Variance2162332.242
MonotonicityNot monotonic
2024-11-23T03:31:23.058382image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
5540 851731
12.7%
2462 700220
 
10.4%
2412 677756
 
10.1%
2437 506615
 
7.5%
5180 425136
 
6.3%
5240 385935
 
5.7%
5220 290282
 
4.3%
2472 282668
 
4.2%
5200 281471
 
4.2%
5260 278818
 
4.1%
Other values (26) 2051490
30.5%
ValueCountFrequency (%)
2412 677756
10.1%
2417 83625
 
1.2%
2422 122435
 
1.8%
2427 131302
 
2.0%
2432 132383
 
2.0%
ValueCountFrequency (%)
5805 149650
2.2%
5785 11195
 
0.2%
5765 3566
 
0.1%
5745 7035
 
0.1%
5700 7222
 
0.1%

name
Text

the name assigned to the WIFI network

Distinct8096
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size513.6 MiB
2024-11-23T03:31:23.199261image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters430855808
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique506 ?
Unique (%)< 0.1%

Sample

1st rowf9a426b1a333ea1ede29cbb41cc864e4bf1564d45f4f4ce560d0b89d2a56ea37
2nd rowf9a426b1a333ea1ede29cbb41cc864e4bf1564d45f4f4ce560d0b89d2a56ea37
3rd rowf9a426b1a333ea1ede29cbb41cc864e4bf1564d45f4f4ce560d0b89d2a56ea37
4th rowf9a426b1a333ea1ede29cbb41cc864e4bf1564d45f4f4ce560d0b89d2a56ea37
5th rowf9a426b1a333ea1ede29cbb41cc864e4bf1564d45f4f4ce560d0b89d2a56ea37
ValueCountFrequency (%)
2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27 795704
 
11.8%
fca569d2db8931ec7143045fcb7786f4a7cee5edf2e5942f2de40bcde103da9d 526338
 
7.8%
464a86174e1e242f0bc1df2bc7c1692bb4d929af7b86df3735fd75217ee55bb1 254398
 
3.8%
ae5462da1592283124a2308a42867d0c4c75089380950d6af44347784d362308 247906
 
3.7%
736bdef492b43906461e3fe1e4713592e343e82e50fa2697e7c1a36aa5255c2c 182997
 
2.7%
bf9ea07f15c100d18e6a96d713b8f52b32292f0cb9b1f40e555583f1feeb67e2 162392
 
2.4%
be7c05a6622deecd6cf2282b33923c543cd90d744fa7f93b60d6c3e10ee1cf37 147126
 
2.2%
f56cb17c2c3e3f75b152732782646453fc66b065083c3b30a2d052344729f90a 146532
 
2.2%
f6a4d260894be3fe6efb87ade1952ae9d6446208f72a6a062c7f43baddf94991 123980
 
1.8%
4a4c84f0d48e9ca09bd35e4ff9088c4b7278b09903b37bfd3902e464edc824e9 123237
 
1.8%
Other values (8086) 4021512
59.7%
2024-11-23T03:31:23.440503image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 31270072
 
7.3%
e 28998490
 
6.7%
7 28098124
 
6.5%
4 27970822
 
6.5%
6 27623076
 
6.4%
a 27551101
 
6.4%
9 27304725
 
6.3%
b 26934731
 
6.3%
3 26592761
 
6.2%
d 26592253
 
6.2%
Other values (6) 151919653
35.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 31270072
 
7.3%
e 28998490
 
6.7%
7 28098124
 
6.5%
4 27970822
 
6.5%
6 27623076
 
6.4%
a 27551101
 
6.4%
9 27304725
 
6.3%
b 26934731
 
6.3%
3 26592761
 
6.2%
d 26592253
 
6.2%
Other values (6) 151919653
35.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 31270072
 
7.3%
e 28998490
 
6.7%
7 28098124
 
6.5%
4 27970822
 
6.5%
6 27623076
 
6.4%
a 27551101
 
6.4%
9 27304725
 
6.3%
b 26934731
 
6.3%
3 26592761
 
6.2%
d 26592253
 
6.2%
Other values (6) 151919653
35.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 430855808
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 31270072
 
7.3%
e 28998490
 
6.7%
7 28098124
 
6.5%
4 27970822
 
6.5%
6 27623076
 
6.4%
a 27551101
 
6.4%
9 27304725
 
6.3%
b 26934731
 
6.3%
3 26592761
 
6.2%
d 26592253
 
6.2%
Other values (6) 151919653
35.3%

rssi
Real number (ℝ)

(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Distinct81
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-89.20148328
Minimum-102
Maximum-22
Zeros0
Zeros (%)0.0%
Negative6732122
Negative (%)100.0%
Memory size102.7 MiB
2024-11-23T03:31:23.573315image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-102
5-th percentile-94
Q1-92
median-90
Q3-87
95-th percentile-82
Maximum-22
Range80
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.438957034
Coefficient of variation (CV)-0.04976326481
Kurtosis16.51708478
Mean-89.20148328
Median Absolute Deviation (MAD)2
Skewness2.565676407
Sum-600515268
Variance19.70433955
MonotonicityNot monotonic
2024-11-23T03:31:23.713303image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-90 960485
14.3%
-89 751207
11.2%
-91 688804
10.2%
-92 661649
9.8%
-93 599535
8.9%
-88 501693
7.5%
-94 482521
7.2%
-87 377553
 
5.6%
-86 315436
 
4.7%
-85 290510
 
4.3%
Other values (71) 1102729
16.4%
ValueCountFrequency (%)
-102 11
 
< 0.1%
-101 25
 
< 0.1%
-100 147
 
< 0.1%
-99 2589
< 0.1%
-98 2802
< 0.1%
ValueCountFrequency (%)
-22 4
 
< 0.1%
-23 1
 
< 0.1%
-24 35
< 0.1%
-25 4
 
< 0.1%
-26 2
 
< 0.1%

Correlations

2024-11-23T03:31:23.804710image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
frequencyrssiuserid
frequency1.000-0.038-0.195
rssi-0.0381.000-0.088
userid-0.195-0.0881.000