Overview

Dataset statistics

Number of variables6
Number of observations114248
Missing cells0
Missing cells (%)0.0%
Total size in memory19.5 MiB
Average record size in memory179.0 B

Variable types

Text3
Numeric1
DateTime1
Boolean1

Dataset

Description[unitless] Returns information related to the WIFI network to which the phone is connected to, if connected will also report the WIFI network id. To compare each sensor observation, the frequency was reduced to one minute. The first non-missing name is reported for each of the categorical variables.
CreatorAndrea Bontempelli, Matteo Busso, Roy Alia Asiku
AuthorAndrea Bontempelli, Matteo Busso, Fausto Giunchiglia
URL
Copyright(c) University of Trento - Knowledge Diversity 2023

Variable descriptions

experimentidExperiment Id
useridUser id
timestampshow month(2), day(2), hour(2), minute(2), second(2), decimals(3)
bssid(Basic Service Set Identifier) Special SSID used to define a wireless computer network configured to communicate directly with each other without an access point.
isconnectedReturn if the phone is connected to the WIFI
ssid(Service Set Identifier) ID or unique identifier of a digital network (Wi-Fi or WLAN)

Alerts

experimentid has constant value "wenetItaly"Constant
isconnected is highly imbalanced (99.9%)Imbalance

Reproduction

Analysis started2024-11-23 11:27:41.207209
Analysis finished2024-11-23 11:27:42.202609
Duration1 second
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

experimentid
Text

CONSTANT 

Experiment Id

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.0 MiB
2024-11-23T12:27:42.298832image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1142480
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowwenetItaly
2nd rowwenetItaly
3rd rowwenetItaly
4th rowwenetItaly
5th rowwenetItaly
ValueCountFrequency (%)
wenetitaly 114248
100.0%
2024-11-23T12:27:42.517662image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 228496
20.0%
t 228496
20.0%
w 114248
10.0%
n 114248
10.0%
I 114248
10.0%
a 114248
10.0%
l 114248
10.0%
y 114248
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1142480
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 228496
20.0%
t 228496
20.0%
w 114248
10.0%
n 114248
10.0%
I 114248
10.0%
a 114248
10.0%
l 114248
10.0%
y 114248
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1142480
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 228496
20.0%
t 228496
20.0%
w 114248
10.0%
n 114248
10.0%
I 114248
10.0%
a 114248
10.0%
l 114248
10.0%
y 114248
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1142480
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 228496
20.0%
t 228496
20.0%
w 114248
10.0%
n 114248
10.0%
I 114248
10.0%
a 114248
10.0%
l 114248
10.0%
y 114248
10.0%

userid
Real number (ℝ)

User id

Distinct217
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.5404646
Minimum0
Maximum265
Zeros937
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size892.7 KiB
2024-11-23T12:27:42.659850image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q118
median138
Q3191
95-th percentile255
Maximum265
Range265
Interquartile range (IQR)173

Descriptive statistics

Standard deviation88.60205449
Coefficient of variation (CV)0.7230432395
Kurtosis-1.483531
Mean122.5404646
Median Absolute Deviation (MAD)81
Skewness0.003767345016
Sum14000003
Variance7850.32406
MonotonicityIncreasing
2024-11-23T12:27:42.782276image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18 17378
 
15.2%
191 5860
 
5.1%
78 5850
 
5.1%
250 5359
 
4.7%
5 5239
 
4.6%
219 4918
 
4.3%
182 4495
 
3.9%
255 2497
 
2.2%
162 2319
 
2.0%
3 2304
 
2.0%
Other values (207) 58029
50.8%
ValueCountFrequency (%)
0 937
0.8%
1 14
 
< 0.1%
2 12
 
< 0.1%
3 2304
2.0%
4 12
 
< 0.1%
ValueCountFrequency (%)
265 4
 
< 0.1%
264 471
 
0.4%
263 1864
1.6%
262 109
 
0.1%
260 7
 
< 0.1%

timestamp
Date

show month(2), day(2), hour(2), minute(2), second(2), decimals(3)

Distinct114244
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size892.7 KiB
Minimum2020-11-16 07:02:16.403000
Maximum2020-12-11 21:59:35.810000
2024-11-23T12:27:42.903693image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-11-23T12:27:43.117454image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

bssid
Text

(Basic Service Set Identifier) Special SSID used to define a wireless computer network configured to communicate directly with each other without an access point.

Distinct2595
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
2024-11-23T12:27:43.291876image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters7311872
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1030 ?
Unique (%)0.9%

Sample

1st row5f2ec65f9e70ea00d131a3dd3acd66bc118b0c339c43b7aa80e6e8e44f01d455
2nd rowab84e10c4cdfbafc53adca8e89db0a0e8cf22d8f223c2f2ceaddeacf2e4695de
3rd row5f2ec65f9e70ea00d131a3dd3acd66bc118b0c339c43b7aa80e6e8e44f01d455
4th rowab84e10c4cdfbafc53adca8e89db0a0e8cf22d8f223c2f2ceaddeacf2e4695de
5th row5f2ec65f9e70ea00d131a3dd3acd66bc118b0c339c43b7aa80e6e8e44f01d455
ValueCountFrequency (%)
eb57d76adf8de2cc9cc2921988feef2a3dd4d4fc4bfd5462bbc0589ca6237d11 3714
 
3.3%
b099a2ca18adb58456384ad54911cc0c785175f57106f7b690db18c5b191d476 3498
 
3.1%
daa114c15be1f5b95d419e130a451452948771a12d7bd8d1931beb9c612c20c3 2677
 
2.3%
0b33ec420ae5eb98d87f83a6f81ad4d78b99847796f5fefd3d3dedc58cf596b7 2673
 
2.3%
4b4ff394a7a995e40f660facbf4dd4db366489834575e74e59de5c2f78eb54b0 2488
 
2.2%
2d0a5849ec4055b48242e36db28593d15c87665bbb6d519e6bee7f557014717c 2486
 
2.2%
444a8f5ede2d8cc822673529437c8045889e7f06629a398be49c8426a256f7e0 2422
 
2.1%
02aff3913f0ae10d2ba49ca93683c555d43b4c483a26756c5e8001b95a35076f 2416
 
2.1%
72cf2d48d96beab992b7816ff711bc0f5dba0fd7f16c472daebbb172eb51939e 2402
 
2.1%
0d22707bbd8be66e4cecc414c31facaceb2adb105f2d8aaf8c5de7a472a9e9cf 2073
 
1.8%
Other values (2585) 87399
76.5%
2024-11-23T12:27:43.549002image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 501189
 
6.9%
5 499442
 
6.8%
9 476212
 
6.5%
e 464672
 
6.4%
d 464356
 
6.4%
7 459696
 
6.3%
8 458702
 
6.3%
c 457918
 
6.3%
a 456588
 
6.2%
1 451892
 
6.2%
Other values (6) 2621205
35.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
4 501189
 
6.9%
5 499442
 
6.8%
9 476212
 
6.5%
e 464672
 
6.4%
d 464356
 
6.4%
7 459696
 
6.3%
8 458702
 
6.3%
c 457918
 
6.3%
a 456588
 
6.2%
1 451892
 
6.2%
Other values (6) 2621205
35.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
4 501189
 
6.9%
5 499442
 
6.8%
9 476212
 
6.5%
e 464672
 
6.4%
d 464356
 
6.4%
7 459696
 
6.3%
8 458702
 
6.3%
c 457918
 
6.3%
a 456588
 
6.2%
1 451892
 
6.2%
Other values (6) 2621205
35.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
4 501189
 
6.9%
5 499442
 
6.8%
9 476212
 
6.5%
e 464672
 
6.4%
d 464356
 
6.4%
7 459696
 
6.3%
8 458702
 
6.3%
c 457918
 
6.3%
a 456588
 
6.2%
1 451892
 
6.2%
Other values (6) 2621205
35.8%

isconnected
Boolean

IMBALANCE 

Return if the phone is connected to the WIFI

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size111.7 KiB
True
114240 
False
 
8
ValueCountFrequency (%)
True 114240
> 99.9%
False 8
 
< 0.1%
2024-11-23T12:27:43.658449image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

ssid
Text

(Service Set Identifier) ID or unique identifier of a digital network (Wi-Fi or WLAN)

Distinct427
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.8 MiB
2024-11-23T12:27:43.752885image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length64
Median length64
Mean length64
Min length64

Characters and Unicode

Total characters7311872
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)0.1%

Sample

1st row822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a
2nd row822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a
3rd row822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a
4th row822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a
5th row822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a
ValueCountFrequency (%)
d5bf6bd0a7796aa61000a368e13003a103f8e7e7fcaf9c3ace406b1c5c8ed46e 18945
 
16.6%
822fdae77a423476651f5a80e9ce3667684917851f5411b9d7e9e885de2bdb9a 8072
 
7.1%
91fcdc904deaace2c802c439e8912a6404a11b42023fa9a85a151becada7cda2 5350
 
4.7%
24fb8e36cb87143e38dcfe61b2e9c3070b95e9e7dceb4068ff42a7485496bc2b 5239
 
4.6%
c8c065e81b029712098fe21a5f77db42eb6f2a3cbdee08528d588d56159b1d57 4838
 
4.2%
2b01a9a813372175bac66c71c9e07c65891c39d8bdf2126b6f5730fcd54dcd60 4143
 
3.6%
51930f8f1f0eca735d11c25aa8e08bcb83825447db8046ad288082b663b253e1 3627
 
3.2%
b7cd3908e041c588ac3b359abbd0efd8812f440800f7aefd993544b934b1e30e 3487
 
3.1%
913afd539bd1cf5b0714c37a8ef080319b6c61af1994815188450a0462b157dc 2315
 
2.0%
3cf5236935da1c1321f6f2f751d3e4847115ffb4d4fac7c1830d99f2c95a2da4 2248
 
2.0%
Other values (417) 55984
49.0%
2024-11-23T12:27:43.967848image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 522746
 
7.1%
1 503012
 
6.9%
e 497576
 
6.8%
8 497087
 
6.8%
a 481123
 
6.6%
6 476022
 
6.5%
3 463710
 
6.3%
c 463281
 
6.3%
b 456069
 
6.2%
d 451584
 
6.2%
Other values (6) 2499662
34.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 522746
 
7.1%
1 503012
 
6.9%
e 497576
 
6.8%
8 497087
 
6.8%
a 481123
 
6.6%
6 476022
 
6.5%
3 463710
 
6.3%
c 463281
 
6.3%
b 456069
 
6.2%
d 451584
 
6.2%
Other values (6) 2499662
34.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 522746
 
7.1%
1 503012
 
6.9%
e 497576
 
6.8%
8 497087
 
6.8%
a 481123
 
6.6%
6 476022
 
6.5%
3 463710
 
6.3%
c 463281
 
6.3%
b 456069
 
6.2%
d 451584
 
6.2%
Other values (6) 2499662
34.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7311872
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 522746
 
7.1%
1 503012
 
6.9%
e 497576
 
6.8%
8 497087
 
6.8%
a 481123
 
6.6%
6 476022
 
6.5%
3 463710
 
6.3%
c 463281
 
6.3%
b 456069
 
6.2%
d 451584
 
6.2%
Other values (6) 2499662
34.2%

Correlations

2024-11-23T12:27:44.043855image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
isconnecteduserid
isconnected1.0000.003
userid0.0031.000