Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 290618 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 35.4 MiB |
Average record size in memory | 127.6 B |
Variable types
Text | 3 |
---|---|
Numeric | 2 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
cellid | The cell id |
dbm | (DeciBel-Milliwatts)The received signal strength. |
type | The technology type of the network (lte, wcdma, gsm, etc…) |
experimentid has constant value "wenetDenmark" | Constant |
dbm is highly skewed (γ1 = 23.17735007) | Skewed |
userid has 29840 (10.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-23 02:34:20.810686 |
---|---|
Analysis finished | 2024-11-23 02:34:22.650984 |
Duration | 1.84 second |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.5 MiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 3487416 |
---|---|
Distinct characters | 9 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetDenmark |
---|---|
2nd row | wenetDenmark |
3rd row | wenetDenmark |
4th row | wenetDenmark |
5th row | wenetDenmark |
Value | Count | Frequency (%) |
wenetdenmark | 290618 |
Most occurring characters
Value | Count | Frequency (%) |
e | 871854 | |
n | 581236 | |
w | 290618 | 8.3% |
t | 290618 | 8.3% |
D | 290618 | 8.3% |
m | 290618 | 8.3% |
a | 290618 | 8.3% |
r | 290618 | 8.3% |
k | 290618 | 8.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3487416 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 871854 | |
n | 581236 | |
w | 290618 | 8.3% |
t | 290618 | 8.3% |
D | 290618 | 8.3% |
m | 290618 | 8.3% |
a | 290618 | 8.3% |
r | 290618 | 8.3% |
k | 290618 | 8.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3487416 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 871854 | |
n | 581236 | |
w | 290618 | 8.3% |
t | 290618 | 8.3% |
D | 290618 | 8.3% |
m | 290618 | 8.3% |
a | 290618 | 8.3% |
r | 290618 | 8.3% |
k | 290618 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3487416 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 871854 | |
n | 581236 | |
w | 290618 | 8.3% |
t | 290618 | 8.3% |
D | 290618 | 8.3% |
m | 290618 | 8.3% |
a | 290618 | 8.3% |
r | 290618 | 8.3% |
k | 290618 | 8.3% |
Distinct | 17 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.27714044 |
Minimum | 0 |
---|---|
Maximum | 27 |
Zeros | 29840 |
Zeros (%) | 10.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 6 |
Q3 | 23 |
95-th percentile | 26 |
Maximum | 27 |
Range | 27 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 9.847369968 |
---|---|
Coefficient of variation (CV) | 0.8732151579 |
Kurtosis | -1.629274916 |
Mean | 11.27714044 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.3511866412 |
Sum | 3277340 |
Variance | 96.9706953 |
Monotonicity | Increasing |
Histogram with fixed size bins (bins=17)
Value | Count | Frequency (%) |
3 | 50197 | |
6 | 46982 | |
23 | 43375 | |
2 | 38469 | |
0 | 29840 | |
26 | 29309 | |
17 | 19396 | 6.7% |
20 | 14536 | 5.0% |
25 | 8638 | 3.0% |
18 | 3443 | 1.2% |
Other values (7) | 6433 | 2.2% |
Value | Count | Frequency (%) |
0 | 29840 | |
2 | 38469 | |
3 | 50197 | |
6 | 46982 | |
8 | 1476 | 0.5% |
Value | Count | Frequency (%) |
27 | 1336 | 0.5% |
26 | 29309 | |
25 | 8638 | 3.0% |
23 | 43375 | |
22 | 211 | 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 290610 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 MiB |
Minimum | 2020-11-16 07:00:00.391000 |
---|---|
Maximum | 2020-12-11 21:59:42.994000 |
Histogram with fixed size bins (bins=50)
cellid
Text
The cell id
Distinct | 1576 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.0 MiB |
Length
Max length | 64 |
---|---|
Median length | 64 |
Mean length | 64 |
Min length | 64 |
Characters and Unicode
Total characters | 18599552 |
---|---|
Distinct characters | 16 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 629 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
---|---|
2nd row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
3rd row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
4th row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
5th row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
Value | Count | Frequency (%) |
577d62dbdbd880d5a686c8f5a444a6a02d197f8197896b64428177e632006377 | 198202 | |
44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 | 37236 | 12.8% |
fbfe7b357545b694b593418c0252cd12f226714e7062daefbd7f940860260fa2 | 4377 | 1.5% |
0ea3a3e82671bf011f353c0fd7e18b33dc5be36f455628ade0b112499c207dee | 4374 | 1.5% |
ebb91593d8fa7dec356cb2ae99d305b92ad42b21cf14e61dfb45483591040eb8 | 3766 | 1.3% |
07f849465564f588324a6e67562fe2ecc53fc8b7741a1616a019e54b7c053eef | 2752 | 0.9% |
9ecc0bc4a79c0ae2c59a2b6c066f016b98f3e6830a90982e70de95886c99ada9 | 2177 | 0.7% |
bbc08f7dfa3ab56176b4b7e12c51a16d33c140f6caca26baaaf301b14b240f18 | 2162 | 0.7% |
1ecb31ff9603ac95499399a242e0c51714b57a6edcf8ad1186634819d042d593 | 1829 | 0.6% |
249243aae618c6531967517bdf2283e5615d8733e3164b21b427cf7f1b488020 | 1818 | 0.6% |
Other values (1566) | 31925 | 11.0% |
Most occurring characters
Value | Count | Frequency (%) |
6 | 1975072 | |
7 | 1904425 | |
8 | 1709602 | 9.2% |
4 | 1548399 | 8.3% |
d | 1459968 | 7.8% |
2 | 1241700 | 6.7% |
0 | 1164455 | 6.3% |
a | 1072561 | 5.8% |
5 | 1000222 | 5.4% |
b | 973662 | 5.2% |
Other values (6) | 4549486 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 18599552 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
6 | 1975072 | |
7 | 1904425 | |
8 | 1709602 | 9.2% |
4 | 1548399 | 8.3% |
d | 1459968 | 7.8% |
2 | 1241700 | 6.7% |
0 | 1164455 | 6.3% |
a | 1072561 | 5.8% |
5 | 1000222 | 5.4% |
b | 973662 | 5.2% |
Other values (6) | 4549486 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 18599552 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
6 | 1975072 | |
7 | 1904425 | |
8 | 1709602 | 9.2% |
4 | 1548399 | 8.3% |
d | 1459968 | 7.8% |
2 | 1241700 | 6.7% |
0 | 1164455 | 6.3% |
a | 1072561 | 5.8% |
5 | 1000222 | 5.4% |
b | 973662 | 5.2% |
Other values (6) | 4549486 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 18599552 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
6 | 1975072 | |
7 | 1904425 | |
8 | 1709602 | 9.2% |
4 | 1548399 | 8.3% |
d | 1459968 | 7.8% |
2 | 1241700 | 6.7% |
0 | 1164455 | 6.3% |
a | 1072561 | 5.8% |
5 | 1000222 | 5.4% |
b | 973662 | 5.2% |
Other values (6) | 4549486 |
Distinct | 98 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3975377.983 |
Minimum | -141 |
---|---|
Maximum | 2147483647 |
Zeros | 23 |
Zeros (%) | < 0.1% |
Negative | 290057 |
Negative (%) | 99.8% |
Memory size | 2.2 MiB |
Quantile statistics
Minimum | -141 |
---|---|
5-th percentile | -120 |
Q1 | -113 |
median | -106 |
Q3 | -93 |
95-th percentile | -75 |
Maximum | 2147483647 |
Range | 2147483788 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 92311998.84 |
---|---|
Coefficient of variation (CV) | 23.22093628 |
Kurtosis | 535.1932392 |
Mean | 3975377.983 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 23.17735007 |
Sum | 1.155316399 × 1012 |
Variance | 8.52150513 × 1015 |
Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
-113 | 60333 | |
-120 | 28310 | 9.7% |
-97 | 8934 | 3.1% |
-101 | 8258 | 2.8% |
-103 | 7962 | 2.7% |
-109 | 7747 | 2.7% |
-107 | 7622 | 2.6% |
-105 | 7558 | 2.6% |
-111 | 7324 | 2.5% |
-95 | 7284 | 2.5% |
Other values (88) | 139286 |
Value | Count | Frequency (%) |
-141 | 1 | < 0.1% |
-140 | 236 | |
-139 | 51 | < 0.1% |
-138 | 31 | < 0.1% |
-137 | 48 | < 0.1% |
Value | Count | Frequency (%) |
2147483647 | 538 | |
0 | 23 | < 0.1% |
-40 | 1 | < 0.1% |
-47 | 1 | < 0.1% |
-48 | 12 | < 0.1% |
type
Text
The technology type of the network (lte, wcdma, gsm, etc…)
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.2 MiB |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.638542692 |
Min length | 3 |
Characters and Unicode
Total characters | 1057426 |
---|---|
Distinct characters | 10 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | lte |
---|---|
2nd row | lte |
3rd row | lte |
4th row | lte |
5th row | lte |
Value | Count | Frequency (%) |
lte | 132328 | |
wcdma | 92786 | |
gsm | 65504 |
Most occurring characters
Value | Count | Frequency (%) |
m | 158290 | |
l | 132328 | |
t | 132328 | |
e | 132328 | |
w | 92786 | |
c | 92786 | |
d | 92786 | |
a | 92786 | |
g | 65504 | |
s | 65504 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1057426 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
m | 158290 | |
l | 132328 | |
t | 132328 | |
e | 132328 | |
w | 92786 | |
c | 92786 | |
d | 92786 | |
a | 92786 | |
g | 65504 | |
s | 65504 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1057426 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
m | 158290 | |
l | 132328 | |
t | 132328 | |
e | 132328 | |
w | 92786 | |
c | 92786 | |
d | 92786 | |
a | 92786 | |
g | 65504 | |
s | 65504 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1057426 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
m | 158290 | |
l | 132328 | |
t | 132328 | |
e | 132328 | |
w | 92786 | |
c | 92786 | |
d | 92786 | |
a | 92786 | |
g | 65504 | |
s | 65504 |
dbm | userid | |
---|---|---|
dbm | 1.000 | 0.065 |
userid | 0.065 | 1.000 |