Dataset statistics
| Number of variables | 6 |
|---|---|
| Number of observations | 290618 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Total size in memory | 35.4 MiB |
| Average record size in memory | 127.6 B |
Variable types
| Text | 3 |
|---|---|
| Numeric | 2 |
| DateTime | 1 |
Dataset
Variable descriptions
| experimentid | Experiment Id |
|---|---|
| userid | User id |
| timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
| cellid | The cell id |
| dbm | (DeciBel-Milliwatts)The received signal strength. |
| type | The technology type of the network (lte, wcdma, gsm, etc…) |
experimentid has constant value "wenetDenmark" | Constant |
dbm is highly skewed (γ1 = 23.17735007) | Skewed |
userid has 29840 (10.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-23 02:34:20.810686 |
|---|---|
| Analysis finished | 2024-11-23 02:34:22.650984 |
| Duration | 1.84 second |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3487416 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | wenetDenmark |
|---|---|
| 2nd row | wenetDenmark |
| 3rd row | wenetDenmark |
| 4th row | wenetDenmark |
| 5th row | wenetDenmark |
| Value | Count | Frequency (%) |
| wenetdenmark | 290618 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 871854 | |
| n | 581236 | |
| w | 290618 | 8.3% |
| t | 290618 | 8.3% |
| D | 290618 | 8.3% |
| m | 290618 | 8.3% |
| a | 290618 | 8.3% |
| r | 290618 | 8.3% |
| k | 290618 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3487416 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 871854 | |
| n | 581236 | |
| w | 290618 | 8.3% |
| t | 290618 | 8.3% |
| D | 290618 | 8.3% |
| m | 290618 | 8.3% |
| a | 290618 | 8.3% |
| r | 290618 | 8.3% |
| k | 290618 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3487416 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 871854 | |
| n | 581236 | |
| w | 290618 | 8.3% |
| t | 290618 | 8.3% |
| D | 290618 | 8.3% |
| m | 290618 | 8.3% |
| a | 290618 | 8.3% |
| r | 290618 | 8.3% |
| k | 290618 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3487416 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 871854 | |
| n | 581236 | |
| w | 290618 | 8.3% |
| t | 290618 | 8.3% |
| D | 290618 | 8.3% |
| m | 290618 | 8.3% |
| a | 290618 | 8.3% |
| r | 290618 | 8.3% |
| k | 290618 | 8.3% |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.27714044 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 29840 |
| Zeros (%) | 10.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 6 |
| Q3 | 23 |
| 95-th percentile | 26 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 9.847369968 |
|---|---|
| Coefficient of variation (CV) | 0.8732151579 |
| Kurtosis | -1.629274916 |
| Mean | 11.27714044 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.3511866412 |
| Sum | 3277340 |
| Variance | 96.9706953 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) |
| 3 | 50197 | |
| 6 | 46982 | |
| 23 | 43375 | |
| 2 | 38469 | |
| 0 | 29840 | |
| 26 | 29309 | |
| 17 | 19396 | 6.7% |
| 20 | 14536 | 5.0% |
| 25 | 8638 | 3.0% |
| 18 | 3443 | 1.2% |
| Other values (7) | 6433 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 29840 | |
| 2 | 38469 | |
| 3 | 50197 | |
| 6 | 46982 | |
| 8 | 1476 | 0.5% |
| Value | Count | Frequency (%) |
| 27 | 1336 | 0.5% |
| 26 | 29309 | |
| 25 | 8638 | 3.0% |
| 23 | 43375 | |
| 22 | 211 | 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
| Distinct | 290610 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| Minimum | 2020-11-16 07:00:00.391000 |
|---|---|
| Maximum | 2020-12-11 21:59:42.994000 |
Histogram with fixed size bins (bins=50)
cellid
Text
The cell id
| Distinct | 1576 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.0 MiB |
Length
| Max length | 64 |
|---|---|
| Median length | 64 |
| Mean length | 64 |
| Min length | 64 |
Characters and Unicode
| Total characters | 18599552 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 629 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
|---|---|
| 2nd row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
| 3rd row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
| 4th row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
| 5th row | 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 |
| Value | Count | Frequency (%) |
| 577d62dbdbd880d5a686c8f5a444a6a02d197f8197896b64428177e632006377 | 198202 | |
| 44e5ce649ed7651f28a9cb5e544db44ec24024a88f573191f30679bf62eb0220 | 37236 | 12.8% |
| fbfe7b357545b694b593418c0252cd12f226714e7062daefbd7f940860260fa2 | 4377 | 1.5% |
| 0ea3a3e82671bf011f353c0fd7e18b33dc5be36f455628ade0b112499c207dee | 4374 | 1.5% |
| ebb91593d8fa7dec356cb2ae99d305b92ad42b21cf14e61dfb45483591040eb8 | 3766 | 1.3% |
| 07f849465564f588324a6e67562fe2ecc53fc8b7741a1616a019e54b7c053eef | 2752 | 0.9% |
| 9ecc0bc4a79c0ae2c59a2b6c066f016b98f3e6830a90982e70de95886c99ada9 | 2177 | 0.7% |
| bbc08f7dfa3ab56176b4b7e12c51a16d33c140f6caca26baaaf301b14b240f18 | 2162 | 0.7% |
| 1ecb31ff9603ac95499399a242e0c51714b57a6edcf8ad1186634819d042d593 | 1829 | 0.6% |
| 249243aae618c6531967517bdf2283e5615d8733e3164b21b427cf7f1b488020 | 1818 | 0.6% |
| Other values (1566) | 31925 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 1975072 | |
| 7 | 1904425 | |
| 8 | 1709602 | 9.2% |
| 4 | 1548399 | 8.3% |
| d | 1459968 | 7.8% |
| 2 | 1241700 | 6.7% |
| 0 | 1164455 | 6.3% |
| a | 1072561 | 5.8% |
| 5 | 1000222 | 5.4% |
| b | 973662 | 5.2% |
| Other values (6) | 4549486 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18599552 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6 | 1975072 | |
| 7 | 1904425 | |
| 8 | 1709602 | 9.2% |
| 4 | 1548399 | 8.3% |
| d | 1459968 | 7.8% |
| 2 | 1241700 | 6.7% |
| 0 | 1164455 | 6.3% |
| a | 1072561 | 5.8% |
| 5 | 1000222 | 5.4% |
| b | 973662 | 5.2% |
| Other values (6) | 4549486 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18599552 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6 | 1975072 | |
| 7 | 1904425 | |
| 8 | 1709602 | 9.2% |
| 4 | 1548399 | 8.3% |
| d | 1459968 | 7.8% |
| 2 | 1241700 | 6.7% |
| 0 | 1164455 | 6.3% |
| a | 1072561 | 5.8% |
| 5 | 1000222 | 5.4% |
| b | 973662 | 5.2% |
| Other values (6) | 4549486 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18599552 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6 | 1975072 | |
| 7 | 1904425 | |
| 8 | 1709602 | 9.2% |
| 4 | 1548399 | 8.3% |
| d | 1459968 | 7.8% |
| 2 | 1241700 | 6.7% |
| 0 | 1164455 | 6.3% |
| a | 1072561 | 5.8% |
| 5 | 1000222 | 5.4% |
| b | 973662 | 5.2% |
| Other values (6) | 4549486 |
| Distinct | 98 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3975377.983 |
| Minimum | -141 |
|---|---|
| Maximum | 2147483647 |
| Zeros | 23 |
| Zeros (%) | < 0.1% |
| Negative | 290057 |
| Negative (%) | 99.8% |
| Memory size | 2.2 MiB |
Quantile statistics
| Minimum | -141 |
|---|---|
| 5-th percentile | -120 |
| Q1 | -113 |
| median | -106 |
| Q3 | -93 |
| 95-th percentile | -75 |
| Maximum | 2147483647 |
| Range | 2147483788 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 92311998.84 |
|---|---|
| Coefficient of variation (CV) | 23.22093628 |
| Kurtosis | 535.1932392 |
| Mean | 3975377.983 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 23.17735007 |
| Sum | 1.155316399 × 1012 |
| Variance | 8.52150513 × 1015 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -113 | 60333 | |
| -120 | 28310 | 9.7% |
| -97 | 8934 | 3.1% |
| -101 | 8258 | 2.8% |
| -103 | 7962 | 2.7% |
| -109 | 7747 | 2.7% |
| -107 | 7622 | 2.6% |
| -105 | 7558 | 2.6% |
| -111 | 7324 | 2.5% |
| -95 | 7284 | 2.5% |
| Other values (88) | 139286 |
| Value | Count | Frequency (%) |
| -141 | 1 | < 0.1% |
| -140 | 236 | |
| -139 | 51 | < 0.1% |
| -138 | 31 | < 0.1% |
| -137 | 48 | < 0.1% |
| Value | Count | Frequency (%) |
| 2147483647 | 538 | |
| 0 | 23 | < 0.1% |
| -40 | 1 | < 0.1% |
| -47 | 1 | < 0.1% |
| -48 | 12 | < 0.1% |
type
Text
The technology type of the network (lte, wcdma, gsm, etc…)
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.638542692 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1057426 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | lte |
|---|---|
| 2nd row | lte |
| 3rd row | lte |
| 4th row | lte |
| 5th row | lte |
| Value | Count | Frequency (%) |
| lte | 132328 | |
| wcdma | 92786 | |
| gsm | 65504 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 158290 | |
| l | 132328 | |
| t | 132328 | |
| e | 132328 | |
| w | 92786 | |
| c | 92786 | |
| d | 92786 | |
| a | 92786 | |
| g | 65504 | |
| s | 65504 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1057426 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| m | 158290 | |
| l | 132328 | |
| t | 132328 | |
| e | 132328 | |
| w | 92786 | |
| c | 92786 | |
| d | 92786 | |
| a | 92786 | |
| g | 65504 | |
| s | 65504 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1057426 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| m | 158290 | |
| l | 132328 | |
| t | 132328 | |
| e | 132328 | |
| w | 92786 | |
| c | 92786 | |
| d | 92786 | |
| a | 92786 | |
| g | 65504 | |
| s | 65504 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1057426 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| m | 158290 | |
| l | 132328 | |
| t | 132328 | |
| e | 132328 | |
| w | 92786 | |
| c | 92786 | |
| d | 92786 | |
| a | 92786 | |
| g | 65504 | |
| s | 65504 |
| dbm | userid | |
|---|---|---|
| dbm | 1.000 | 0.065 |
| userid | 0.065 | 1.000 |