Overview

Dataset statistics

Number of variables8
Number of observations1409600
Missing cells0
Missing cells (%)0.0%
Total size in memory348.0 MiB
Average record size in memory258.9 B

Variable types

Text4
Numeric3
DateTime1

Dataset

Description[unitless] Returns all WIFI networks detected by the smartphone. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
addressis a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.
capabilitiesallows local area networks (LANs) to operate without cables and wiring
frequencythe WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI
namethe name assigned to the WIFI network
rssi(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Alerts

experimentid has constant value "wenetIndia"Constant
frequency is highly overall correlated with rssi and 1 other fieldsHigh correlation
rssi is highly overall correlated with frequency and 1 other fieldsHigh correlation
userid is highly overall correlated with frequency and 1 other fieldsHigh correlation
userid has 772157 (54.8%) zerosZeros

Reproduction

Analysis started2024-11-22 13:01:25.889709
Analysis finished2024-11-22 13:01:36.930935
Duration11.04 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.0 MiB
2024-11-22T14:01:36.982229image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters14096000
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetIndia
2nd rowwenetIndia
3rd rowwenetIndia
4th rowwenetIndia
5th rowwenetIndia
ValueCountFrequency (%)
wenetindia 1409600
100.0%
2024-11-22T14:01:37.166483image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 2819200
20.0%
n 2819200
20.0%
w 1409600
10.0%
t 1409600
10.0%
I 1409600
10.0%
d 1409600
10.0%
i 1409600
10.0%
a 1409600
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 14096000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 2819200
20.0%
n 2819200
20.0%
w 1409600
10.0%
t 1409600
10.0%
I 1409600
10.0%
d 1409600
10.0%
i 1409600
10.0%
a 1409600
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 14096000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 2819200
20.0%
n 2819200
20.0%
w 1409600
10.0%
t 1409600
10.0%
I 1409600
10.0%
d 1409600
10.0%
i 1409600
10.0%
a 1409600
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 14096000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 2819200
20.0%
n 2819200
20.0%
w 1409600
10.0%
t 1409600
10.0%
I 1409600
10.0%
d 1409600
10.0%
i 1409600
10.0%
a 1409600
10.0%

userid
Real number (ℝ)

HIGH CORRELATION  ZEROS 

User id

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.631676362
Minimum0
Maximum57
Zeros772157
Zeros (%)54.8%
Negative0
Negative (%)0.0%
Memory size21.5 MiB
2024-11-22T14:01:37.263249image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q317
95-th percentile44
Maximum57
Range57
Interquartile range (IQR)17

Descriptive statistics

Standard deviation14.90240192
Coefficient of variation (CV)1.547228267
Kurtosis2.635286839
Mean9.631676362
Median Absolute Deviation (MAD)0
Skewness1.848083664
Sum13576811
Variance222.0815831
MonotonicityIncreasing
2024-11-22T14:01:37.347269image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0 772157
54.8%
9 236648
 
16.8%
17 198958
 
14.1%
43 56948
 
4.0%
57 51792
 
3.7%
44 32445
 
2.3%
12 28009
 
2.0%
26 21849
 
1.6%
35 5413
 
0.4%
8 2082
 
0.1%
Other values (4) 3299
 
0.2%
ValueCountFrequency (%)
0 772157
54.8%
8 2082
 
0.1%
9 236648
 
16.8%
12 28009
 
2.0%
17 198958
 
14.1%
ValueCountFrequency (%)
57 51792
3.7%
49 1622
 
0.1%
44 32445
2.3%
43 56948
4.0%
40 81
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct1409245
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size21.5 MiB
Minimum2021-07-12 08:00:05.314000
Maximum2021-08-12 14:40:48.155000
2024-11-22T14:01:37.455170image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-22T14:01:37.582696image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

address
Text

is a unique identifier assigned to a network interface controller (NIC) for use as a network address in communications within a network segment.

Distinct595
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size107.5 MiB
2024-11-22T14:01:37.708683image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters90214400
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)< 0.1%

Sample

1st rowc6e05c297aafb929b82fdce46161a85fa6b8c165bdf488a1705339e03e448e6b
2nd row1e7dbd3f775652c65e896be1f6800b16d6cf1a690f5504b43e63c79ba39224d6
3rd rowc6e05c297aafb929b82fdce46161a85fa6b8c165bdf488a1705339e03e448e6b
4th row2a3d1a7488e88a81dc667411bf1256b5cfc1b217a584eb371dbf8b4fd2c6f156
5th row1e7dbd3f775652c65e896be1f6800b16d6cf1a690f5504b43e63c79ba39224d6
ValueCountFrequency (%)
1e7dbd3f775652c65e896be1f6800b16d6cf1a690f5504b43e63c79ba39224d6 229336
16.3%
c6e05c297aafb929b82fdce46161a85fa6b8c165bdf488a1705339e03e448e6b 150974
10.7%
10cb68d4bba94cf2a450f04cebdd3fd1472119b4c2e23cffcfde3c2342df7a05 148025
10.5%
66d4413841419dd441229a2816cb7beb6c8675dc9e6e5d11095840c2d616b33e 143898
10.2%
772ffcaf9760d6479a9ed866c5faa4d7cb676f3d4e2de171659d412eadd64800 133902
 
9.5%
2a3d1a7488e88a81dc667411bf1256b5cfc1b217a584eb371dbf8b4fd2c6f156 100390
 
7.1%
65a6b9f991c535f62ed351146cd42c00513520d4752ab2bd0827a5b09ed76aff 69316
 
4.9%
a3d689bd790c8c0c7afbd69eac4286722d2fb5ffeda6f03e6a2c4d3a412d54fb 36583
 
2.6%
29ae08a4058a54e03fb306fd4e756209299fb883a7e53a92d259785928227a78 26387
 
1.9%
b211153b19ff531bc52218afa3d7ed3fca1a22faf763777c70331562c1878ff1 25720
 
1.8%
Other values (585) 345069
24.5%
2024-11-22T14:01:37.927135image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 7562824
 
8.4%
d 6578580
 
7.3%
1 6409048
 
7.1%
4 6053265
 
6.7%
f 6013991
 
6.7%
2 6004556
 
6.7%
b 5727921
 
6.3%
c 5657279
 
6.3%
5 5467161
 
6.1%
9 5167068
 
5.7%
Other values (6) 29572707
32.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
6 7562824
 
8.4%
d 6578580
 
7.3%
1 6409048
 
7.1%
4 6053265
 
6.7%
f 6013991
 
6.7%
2 6004556
 
6.7%
b 5727921
 
6.3%
c 5657279
 
6.3%
5 5467161
 
6.1%
9 5167068
 
5.7%
Other values (6) 29572707
32.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
6 7562824
 
8.4%
d 6578580
 
7.3%
1 6409048
 
7.1%
4 6053265
 
6.7%
f 6013991
 
6.7%
2 6004556
 
6.7%
b 5727921
 
6.3%
c 5657279
 
6.3%
5 5467161
 
6.1%
9 5167068
 
5.7%
Other values (6) 29572707
32.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
6 7562824
 
8.4%
d 6578580
 
7.3%
1 6409048
 
7.1%
4 6053265
 
6.7%
f 6013991
 
6.7%
2 6004556
 
6.7%
b 5727921
 
6.3%
c 5657279
 
6.3%
5 5467161
 
6.1%
9 5167068
 
5.7%
Other values (6) 29572707
32.8%

capabilities
Text

allows local area networks (LANs) to operate without cables and wiring

Distinct82
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.2 MiB
2024-11-22T14:01:38.027435image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length254
Median length47
Mean length48.8575078
Min length7

Characters and Unicode

Total characters68869543
Distinct characters30
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row['WPA2-PSK-CCMP', 'RSN-PSK-CCMP', 'ESS', 'WPS']
2nd row['WPA2-PSK-CCMP', 'RSN-PSK-CCMP', 'ESS', 'WPS']
3rd row['WPA2-PSK-CCMP', 'RSN-PSK-CCMP', 'ESS', 'WPS']
4th row['WPA2-PSK-CCMP', 'RSN-PSK-CCMP', 'ESS', 'WPS']
5th row['WPA2-PSK-CCMP', 'RSN-PSK-CCMP', 'ESS', 'WPS']
ValueCountFrequency (%)
ess 1409600
24.6%
wpa2-psk-ccmp 1292362
22.5%
wps 1171520
20.4%
rsn-psk-ccmp 1034479
18.0%
wfa-ht 200030
 
3.5%
wfa-vht 158022
 
2.8%
wpa-psk-tkip 80065
 
1.4%
partial 76052
 
1.3%
wpa-psk-ccmp+tkip 70440
 
1.2%
wpa2-psk-ccmp+tkip 64064
 
1.1%
Other values (18) 180143
 
3.1%
2024-11-22T14:01:38.263626image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
' 11473554
16.7%
P 8548220
12.4%
S 7831635
11.4%
- 5793577
8.4%
C 5256596
7.6%
, 4327177
 
6.3%
4327177
 
6.3%
W 3122008
 
4.5%
K 3070697
 
4.5%
M 2628168
 
3.8%
Other values (20) 12490734
18.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 68869543
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
' 11473554
16.7%
P 8548220
12.4%
S 7831635
11.4%
- 5793577
8.4%
C 5256596
7.6%
, 4327177
 
6.3%
4327177
 
6.3%
W 3122008
 
4.5%
K 3070697
 
4.5%
M 2628168
 
3.8%
Other values (20) 12490734
18.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 68869543
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
' 11473554
16.7%
P 8548220
12.4%
S 7831635
11.4%
- 5793577
8.4%
C 5256596
7.6%
, 4327177
 
6.3%
4327177
 
6.3%
W 3122008
 
4.5%
K 3070697
 
4.5%
M 2628168
 
3.8%
Other values (20) 12490734
18.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 68869543
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
' 11473554
16.7%
P 8548220
12.4%
S 7831635
11.4%
- 5793577
8.4%
C 5256596
7.6%
, 4327177
 
6.3%
4327177
 
6.3%
W 3122008
 
4.5%
K 3070697
 
4.5%
M 2628168
 
3.8%
Other values (20) 12490734
18.1%

frequency
Real number (ℝ)

HIGH CORRELATION 

the WIFI frequency band, that includes two frequency ranges within the wireless spectrum that are designated to carry WIFI

Distinct24
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4315.261725
Minimum2412
Maximum5825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.5 MiB
2024-11-22T14:01:38.371556image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum2412
5-th percentile2412
Q12457
median5180
Q35785
95-th percentile5785
Maximum5825
Range3413
Interquartile range (IQR)3328

Descriptive statistics

Standard deviation1560.739531
Coefficient of variation (CV)0.3616789967
Kurtosis-1.829303608
Mean4315.261725
Median Absolute Deviation (MAD)605
Skewness-0.3311095279
Sum6082792928
Variance2435907.885
MonotonicityNot monotonic
2024-11-22T14:01:38.472083image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
5785 412272
29.2%
2457 242137
17.2%
5180 225711
16.0%
5745 169821
12.0%
2437 85093
 
6.0%
2412 75147
 
5.3%
2462 47993
 
3.4%
2452 33928
 
2.4%
5220 27525
 
2.0%
2432 24846
 
1.8%
Other values (14) 65127
 
4.6%
ValueCountFrequency (%)
2412 75147
5.3%
2417 13627
 
1.0%
2422 8867
 
0.6%
2427 18979
 
1.3%
2432 24846
 
1.8%
ValueCountFrequency (%)
5825 18
 
< 0.1%
5805 19
 
< 0.1%
5785 412272
29.2%
5745 169821
12.0%
5300 2
 
< 0.1%

name
Text

the name assigned to the WIFI network

Distinct486
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size107.5 MiB
2024-11-22T14:01:38.590438image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters90214400
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)< 0.1%

Sample

1st row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
2nd row9594f53a3d193f187fe45509c4d5c3ca6476ade80efb2b852d319617f99d69d5
3rd row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
4th row2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27
5th row9594f53a3d193f187fe45509c4d5c3ca6476ade80efb2b852d319617f99d69d5
ValueCountFrequency (%)
2a72f9e6961b786b0a9797003a4ce2ea88d6a7f57b30f5b298b6b4e85a5dcc27 342326
24.3%
9594f53a3d193f187fe45509c4d5c3ca6476ade80efb2b852d319617f99d69d5 229336
16.3%
dd09b5a7ca9a988843a7d93b9d56857f8cc52d29108d8759a4f279badf5b8bee 217341
15.4%
4a414ac3ba4c44985b38bb03cb0c449d99a708b3d4a5eb6e641e11d1bcf9dd75 143898
10.2%
9aae9f941f21ac53c20547ee2500865530401837935e9647f0f68bba0546ed80 133902
 
9.5%
74ffed59db12afd67e1cb25a997a48db6796272b762e7e139a8a4abf48f549dc 36583
 
2.6%
434911ed373feeeb6070d33500cc7a20774e87e165cba2a7f37974a1757688d9 26393
 
1.9%
097f33f9af04d4b9c0d19c5e96ebf1d8f475808a3033a6cd05f5d3f5c992d686 25720
 
1.8%
a02f9f3d9d90854bd3054584f1e37b25d52b622e436696032b87131d31174636 21446
 
1.5%
2dae6a8273df53c8821a86bd9de2d42e84cf633aab7d99835a936b738ac4687b 18947
 
1.3%
Other values (476) 213708
15.2%
2024-11-22T14:01:38.808711image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 8210255
 
9.1%
5 6899629
 
7.6%
a 6738057
 
7.5%
8 6518463
 
7.2%
7 6421907
 
7.1%
b 6246838
 
6.9%
d 5761245
 
6.4%
4 5526628
 
6.1%
3 5047893
 
5.6%
6 5005651
 
5.5%
Other values (6) 27837834
30.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
9 8210255
 
9.1%
5 6899629
 
7.6%
a 6738057
 
7.5%
8 6518463
 
7.2%
7 6421907
 
7.1%
b 6246838
 
6.9%
d 5761245
 
6.4%
4 5526628
 
6.1%
3 5047893
 
5.6%
6 5005651
 
5.5%
Other values (6) 27837834
30.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
9 8210255
 
9.1%
5 6899629
 
7.6%
a 6738057
 
7.5%
8 6518463
 
7.2%
7 6421907
 
7.1%
b 6246838
 
6.9%
d 5761245
 
6.4%
4 5526628
 
6.1%
3 5047893
 
5.6%
6 5005651
 
5.5%
Other values (6) 27837834
30.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 90214400
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
9 8210255
 
9.1%
5 6899629
 
7.6%
a 6738057
 
7.5%
8 6518463
 
7.2%
7 6421907
 
7.1%
b 6246838
 
6.9%
d 5761245
 
6.4%
4 5526628
 
6.1%
3 5047893
 
5.6%
6 5005651
 
5.5%
Other values (6) 27837834
30.9%

rssi
Real number (ℝ)

HIGH CORRELATION 

(Received Signal Strength Indicator) is an estimated measurement of how well a device can hear, detect, and receive signals from any wireless access point or Wi-Fi router. An RSSI closer to 0 is stronger, and closer to –100 is weaker.

Distinct100
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-76.59347616
Minimum-99
Maximum0
Zeros4
Zeros (%)< 0.1%
Negative1409596
Negative (%)> 99.9%
Memory size21.5 MiB
2024-11-22T14:01:38.933722image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum-99
5-th percentile-90
Q1-87
median-84
Q3-68
95-th percentile-42
Maximum0
Range99
Interquartile range (IQR)19

Descriptive statistics

Standard deviation14.93668714
Coefficient of variation (CV)-0.1950125244
Kurtosis0.5816614356
Mean-76.59347616
Median Absolute Deviation (MAD)5
Skewness1.260988344
Sum-107966164
Variance223.1046228
MonotonicityNot monotonic
2024-11-22T14:01:39.055200image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-87 143018
 
10.1%
-88 131267
 
9.3%
-89 117667
 
8.3%
-86 115535
 
8.2%
-85 87960
 
6.2%
-90 67052
 
4.8%
-84 58318
 
4.1%
-83 43244
 
3.1%
-82 32469
 
2.3%
-81 25449
 
1.8%
Other values (90) 587621
41.7%
ValueCountFrequency (%)
-99 4
 
< 0.1%
-98 35
 
< 0.1%
-97 33
 
< 0.1%
-96 150
< 0.1%
-95 186
< 0.1%
ValueCountFrequency (%)
0 4
 
< 0.1%
-1 4
 
< 0.1%
-2 3
 
< 0.1%
-3 48
< 0.1%
-4 12
 
< 0.1%

Correlations

2024-11-22T14:01:39.127608image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
frequencyrssiuserid
frequency1.000-0.503-0.602
rssi-0.5031.0000.660
userid-0.6020.6601.000