Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 8347758 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 851.6 MiB |
Average record size in memory | 107.0 B |
Variable types
Text | 2 |
---|---|
Numeric | 7 |
DateTime | 1 |
Dataset
Variable descriptions
experimentid | Experiment Id |
---|---|
userid | User id |
timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
accuracy | The GPS accuracy in meters |
bearing | The compass direction from the current position the intended destination. Bearing is measured in degrees and calculated clockwise from true north (e.g., the bearing for the direction of east is 090°) |
latitude | Geographic coordinate that specifies the N/S position. Latitude is an angle which ranges from 0° at the Equator to 90° at the poles. It is expressed in sexadecimal notation. |
longitude | Geographic coordinate that specifies the E/W position. Longitude is an angle which ranges from 0° at the prime Meridian to 180°. It is expressed in sexadecimal notation |
altitude | Elevation above sea level in meters. |
provider | It indicates whether the coordinates were found using the network/Wi-Fi It indicates whether the coordinates were found using GPS |
speed | The speed of the device, measured in meters/second over ground |
experimentid has constant value "wenetDenmark" | Constant |
accuracy is highly overall correlated with bearing | High correlation |
altitude is highly overall correlated with bearing and 1 other fields | High correlation |
bearing is highly overall correlated with accuracy and 2 other fields | High correlation |
latitude is highly overall correlated with userid | High correlation |
speed is highly overall correlated with altitude and 1 other fields | High correlation |
userid is highly overall correlated with latitude | High correlation |
latitude is highly skewed (γ1 = -20.43444326) | Skewed |
userid has 355760 (4.3%) zeros | Zeros |
bearing has 3165485 (37.9%) zeros | Zeros |
speed has 3043352 (36.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-24 10:26:55.908496 |
---|---|
Analysis finished | 2024-11-24 10:28:14.514396 |
Duration | 1 minute and 18.61 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 222.9 MiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 100173096 |
---|---|
Distinct characters | 9 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | wenetDenmark |
---|---|
2nd row | wenetDenmark |
3rd row | wenetDenmark |
4th row | wenetDenmark |
5th row | wenetDenmark |
Value | Count | Frequency (%) |
wenetdenmark | 8347758 |
Most occurring characters
Value | Count | Frequency (%) |
e | 25043274 | |
n | 16695516 | |
w | 8347758 | 8.3% |
t | 8347758 | 8.3% |
D | 8347758 | 8.3% |
m | 8347758 | 8.3% |
a | 8347758 | 8.3% |
r | 8347758 | 8.3% |
k | 8347758 | 8.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 100173096 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 25043274 | |
n | 16695516 | |
w | 8347758 | 8.3% |
t | 8347758 | 8.3% |
D | 8347758 | 8.3% |
m | 8347758 | 8.3% |
a | 8347758 | 8.3% |
r | 8347758 | 8.3% |
k | 8347758 | 8.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 100173096 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 25043274 | |
n | 16695516 | |
w | 8347758 | 8.3% |
t | 8347758 | 8.3% |
D | 8347758 | 8.3% |
m | 8347758 | 8.3% |
a | 8347758 | 8.3% |
r | 8347758 | 8.3% |
k | 8347758 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 100173096 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 25043274 | |
n | 16695516 | |
w | 8347758 | 8.3% |
t | 8347758 | 8.3% |
D | 8347758 | 8.3% |
m | 8347758 | 8.3% |
a | 8347758 | 8.3% |
r | 8347758 | 8.3% |
k | 8347758 | 8.3% |
Distinct | 17 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.25540367 |
Minimum | 0 |
---|---|
Maximum | 27 |
Zeros | 355760 |
Zeros (%) | 4.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 2 |
median | 6 |
Q3 | 23 |
95-th percentile | 23 |
Maximum | 27 |
Range | 27 |
Interquartile range (IQR) | 21 |
Descriptive statistics
Standard deviation | 9.820145598 |
---|---|
Coefficient of variation (CV) | 0.8724827549 |
Kurtosis | -1.795675052 |
Mean | 11.25540367 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 0.2430803855 |
Sum | 93957386 |
Variance | 96.43525957 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2 | 2879409 | |
23 | 2574351 | |
17 | 888516 | 10.6% |
3 | 763339 | 9.1% |
6 | 540901 | 6.5% |
0 | 355760 | 4.3% |
26 | 122914 | 1.5% |
25 | 87108 | 1.0% |
27 | 56672 | 0.7% |
20 | 30343 | 0.4% |
Other values (7) | 48445 | 0.6% |
Value | Count | Frequency (%) |
0 | 355760 | 4.3% |
2 | 2879409 | |
3 | 763339 | 9.1% |
6 | 540901 | 6.5% |
8 | 3550 | < 0.1% |
Value | Count | Frequency (%) |
27 | 56672 | 0.7% |
26 | 122914 | 1.5% |
25 | 87108 | 1.0% |
23 | 2574351 | |
22 | 403 | < 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
Distinct | 8335509 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 127.4 MiB |
Minimum | 2020-11-16 07:00:01.596000 |
---|---|
Maximum | 2020-12-11 21:59:59.410000 |
Distinct | 58219 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.12208393 |
Minimum | 0 |
---|---|
Maximum | 19491.63086 |
Zeros | 65 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3.790092468 |
Q1 | 11.79199982 |
median | 16.07999992 |
Q3 | 23.59000015 |
95-th percentile | 500 |
Maximum | 19491.63086 |
Range | 19491.63086 |
Interquartile range (IQR) | 11.79800034 |
Descriptive statistics
Standard deviation | 160.84333 |
---|---|
Coefficient of variation (CV) | 2.865954339 |
Kurtosis | 242.8493051 |
Mean | 56.12208393 |
Median Absolute Deviation (MAD) | 5.420000076 |
Skewness | 8.021189504 |
Sum | 468493575.1 |
Variance | 25870.5768 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10.72000027 | 507904 | 6.1% |
9.648000717 | 397815 | 4.8% |
11.79199982 | 383980 | 4.6% |
500 | 291305 | 3.5% |
12.86400032 | 277304 | 3.3% |
13.93600082 | 203121 | 2.4% |
15.00800037 | 148263 | 1.8% |
3.21600008 | 133966 | 1.6% |
4.288000107 | 96780 | 1.2% |
3 | 87054 | 1.0% |
Other values (58209) | 5820266 |
Value | Count | Frequency (%) |
0 | 65 | < 0.1% |
0.75 | 47 | < 0.1% |
1 | 17275 | 0.2% |
1.5 | 24593 | 0.3% |
2 | 66553 |
Value | Count | Frequency (%) |
19491.63086 | 1 | |
15895.68066 | 1 | |
14766.28711 | 1 | |
13052.87207 | 1 | |
11670.86719 | 1 |
bearing
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
The compass direction from the current position the intended destination. Bearing is measured in degrees and calculated clockwise from true north (e.g., the bearing for the direction of east is 090°)
Distinct | 22006 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.71276922 |
Minimum | -1 |
---|---|
Maximum | 359.98 |
Zeros | 3165485 |
Zeros (%) | 37.9% |
Negative | 3868586 |
Negative (%) | 46.3% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | -1 |
Q1 | -1 |
median | 0 |
Q3 | 0 |
95-th percentile | 247.5 |
Maximum | 359.98 |
Range | 360.98 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 78.21396772 |
---|---|
Coefficient of variation (CV) | 2.724013386 |
Kurtosis | 6.085894308 |
Mean | 28.71276922 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.693662012 |
Sum | 239687249 |
Variance | 6117.424746 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-1 | 3868586 | |
0 | 3165485 | |
57.69 | 1704 | < 0.1% |
215.4 | 1182 | < 0.1% |
181.8 | 1053 | < 0.1% |
234.4 | 1047 | < 0.1% |
78.7 | 1013 | < 0.1% |
256 | 1007 | < 0.1% |
171.5 | 989 | < 0.1% |
82.1 | 981 | < 0.1% |
Other values (21996) | 1304711 | 15.6% |
Value | Count | Frequency (%) |
-1 | 3868586 | |
0 | 3165485 | |
0.01 | 22 | < 0.1% |
0.06 | 5 | < 0.1% |
0.07 | 29 | < 0.1% |
Value | Count | Frequency (%) |
359.98 | 1 | < 0.1% |
359.95 | 9 | |
359.94 | 5 | |
359.93 | 2 | < 0.1% |
359.92 | 1 | < 0.1% |
latitude
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Geographic coordinate that specifies the N/S position. Latitude is an angle which ranges from 0° at the Equator to 90° at the poles. It is expressed in sexadecimal notation.
Distinct | 2400 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 55.67934835 |
Minimum | 49.1523 |
---|---|
Maximum | 57.073 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | 49.1523 |
---|---|
5-th percentile | 55.6498 |
Q1 | 55.6738 |
median | 55.6836 |
Q3 | 55.7049 |
95-th percentile | 55.7068 |
Maximum | 57.073 |
Range | 7.9207 |
Interquartile range (IQR) | 0.0311 |
Descriptive statistics
Standard deviation | 0.2782766924 |
---|---|
Coefficient of variation (CV) | 0.004997843916 |
Kurtosis | 439.5341958 |
Mean | 55.67934835 |
Median Absolute Deviation (MAD) | 0.0176 |
Skewness | -20.43444326 |
Sum | 464797725.6 |
Variance | 0.07743791753 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
55.705 | 1107618 | 13.3% |
55.7049 | 1030127 | 12.3% |
55.6739 | 909351 | 10.9% |
55.6738 | 669453 | 8.0% |
55.6977 | 345131 | 4.1% |
55.6836 | 338150 | 4.1% |
55.6835 | 236385 | 2.8% |
55.6976 | 235314 | 2.8% |
55.6894 | 196035 | 2.3% |
55.7068 | 172727 | 2.1% |
Other values (2390) | 3107467 |
Value | Count | Frequency (%) |
49.1523 | 2 | < 0.1% |
49.1585 | 3 | < 0.1% |
49.1586 | 9 | |
49.1591 | 6 | |
49.1592 | 6 |
Value | Count | Frequency (%) |
57.073 | 1 | < 0.1% |
57.0728 | 4 | < 0.1% |
57.0726 | 4 | < 0.1% |
57.0724 | 14 | |
57.0722 | 12 |
longitude
Real number (ℝ)
Geographic coordinate that specifies the E/W position. Longitude is an angle which ranges from 0° at the prime Meridian to 180°. It is expressed in sexadecimal notation
Distinct | 4380 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.49972573 |
Minimum | 9.218 |
---|---|
Maximum | 12.6574 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | 9.218 |
---|---|
5-th percentile | 12.3587 |
Q1 | 12.5168 |
median | 12.5567 |
Q3 | 12.5804 |
95-th percentile | 12.5922 |
Maximum | 12.6574 |
Range | 3.4394 |
Interquartile range (IQR) | 0.0636 |
Descriptive statistics
Standard deviation | 0.3270433062 |
---|---|
Coefficient of variation (CV) | 0.02616403856 |
Kurtosis | 53.67855847 |
Mean | 12.49972573 |
Median Absolute Deviation (MAD) | 0.0319 |
Skewness | -7.306708426 |
Sum | 104344685.5 |
Variance | 0.1069573241 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12.5568 | 626481 | 7.5% |
12.5922 | 570335 | 6.8% |
12.5168 | 478753 | 5.7% |
12.5921 | 425770 | 5.1% |
12.5569 | 413810 | 5.0% |
12.557 | 389087 | 4.7% |
12.592 | 369964 | 4.4% |
12.5566 | 257748 | 3.1% |
12.5472 | 256489 | 3.1% |
12.5567 | 216578 | 2.6% |
Other values (4370) | 4342743 |
Value | Count | Frequency (%) |
9.218 | 2 | < 0.1% |
9.2237 | 12 | |
9.2239 | 6 | |
9.2243 | 6 | |
9.2254 | 6 |
Value | Count | Frequency (%) |
12.6574 | 91 | < 0.1% |
12.6573 | 277 | |
12.6572 | 104 | < 0.1% |
12.6571 | 73 | < 0.1% |
12.657 | 144 |
Distinct | 80078 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.96541956 |
Minimum | -775.0223 |
---|---|
Maximum | 1592 |
Zeros | 3313 |
Zeros (%) | < 0.1% |
Negative | 3918952 |
Negative (%) | 46.9% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | -775.0223 |
---|---|
5-th percentile | -1 |
Q1 | -1 |
median | 34 |
Q3 | 54 |
95-th percentile | 72 |
Maximum | 1592 |
Range | 2367.0223 |
Interquartile range (IQR) | 55 |
Descriptive statistics
Standard deviation | 33.52588441 |
---|---|
Coefficient of variation (CV) | 1.157445151 |
Kurtosis | 30.21041423 |
Mean | 28.96541956 |
Median Absolute Deviation (MAD) | 35 |
Skewness | 2.058292561 |
Sum | 241796312.9 |
Variance | 1123.984926 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-1 | 3869594 | |
52 | 215888 | 2.6% |
53 | 164625 | 2.0% |
54 | 162402 | 1.9% |
51 | 159207 | 1.9% |
50 | 144514 | 1.7% |
55 | 133617 | 1.6% |
56 | 121845 | 1.5% |
57 | 119441 | 1.4% |
49 | 112472 | 1.3% |
Other values (80068) | 3144153 |
Value | Count | Frequency (%) |
-775.0223 | 6 | |
-651.6061 | 8 | |
-558.2589 | 1 | < 0.1% |
-490.9298 | 8 | |
-432.1291 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1592 | 10 | |
1105.0639 | 8 | |
1027.3932 | 8 | |
1011.5348 | 8 | |
986.5196 | 7 |
provider
Text
It indicates whether the coordinates were found using the network/Wi-Fi It indicates whether the coordinates were found using GPS
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 182.8 MiB |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 6.965081403 |
Min length | 3 |
Characters and Unicode
Total characters | 58142814 |
---|---|
Distinct characters | 13 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | network |
---|---|
2nd row | passive |
3rd row | passive |
4th row | passive |
5th row | network |
Value | Count | Frequency (%) |
passive | 8109094 | |
network | 165791 | 2.0% |
gps | 72873 | 0.9% |
Most occurring characters
Value | Count | Frequency (%) |
s | 16291061 | |
e | 8274885 | |
p | 8181967 | |
a | 8109094 | |
i | 8109094 | |
v | 8109094 | |
n | 165791 | 0.3% |
t | 165791 | 0.3% |
w | 165791 | 0.3% |
o | 165791 | 0.3% |
Other values (3) | 404455 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 58142814 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
s | 16291061 | |
e | 8274885 | |
p | 8181967 | |
a | 8109094 | |
i | 8109094 | |
v | 8109094 | |
n | 165791 | 0.3% |
t | 165791 | 0.3% |
w | 165791 | 0.3% |
o | 165791 | 0.3% |
Other values (3) | 404455 | 0.7% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 58142814 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
s | 16291061 | |
e | 8274885 | |
p | 8181967 | |
a | 8109094 | |
i | 8109094 | |
v | 8109094 | |
n | 165791 | 0.3% |
t | 165791 | 0.3% |
w | 165791 | 0.3% |
o | 165791 | 0.3% |
Other values (3) | 404455 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 58142814 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
s | 16291061 | |
e | 8274885 | |
p | 8181967 | |
a | 8109094 | |
i | 8109094 | |
v | 8109094 | |
n | 165791 | 0.3% |
t | 165791 | 0.3% |
w | 165791 | 0.3% |
o | 165791 | 0.3% |
Other values (3) | 404455 | 0.7% |
speed
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
The speed of the device, measured in meters/second over ground
Distinct | 2535 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.4649905785 |
Minimum | -1 |
---|---|
Maximum | 81.5 |
Zeros | 3043352 |
Zeros (%) | 36.5% |
Negative | 3868586 |
Negative (%) | 46.3% |
Memory size | 127.4 MiB |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | -0.009999999776 |
Q1 | -0.009999999776 |
median | 0 |
Q3 | 0 |
95-th percentile | 3.730000019 |
Maximum | 81.5 |
Range | 82.5 |
Interquartile range (IQR) | 0.009999999776 |
Descriptive statistics
Standard deviation | 2.09217548 |
---|---|
Coefficient of variation (CV) | 4.499393272 |
Kurtosis | 189.388445 |
Mean | 0.4649905785 |
Median Absolute Deviation (MAD) | 0.009999999776 |
Skewness | 11.24471986 |
Sum | 3881628.821 |
Variance | 4.377198241 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-0.009999999776 | 3702795 | |
0 | 3043352 | |
-1 | 165791 | 2.0% |
0.3899999857 | 11024 | 0.1% |
0.2199999988 | 11006 | 0.1% |
0.5199999809 | 10693 | 0.1% |
0.4499999881 | 10588 | 0.1% |
1.159999967 | 10417 | 0.1% |
0.1899999976 | 10023 | 0.1% |
0.2599999905 | 9980 | 0.1% |
Other values (2525) | 1362089 | 16.3% |
Value | Count | Frequency (%) |
-1 | 165791 | 2.0% |
-0.009999999776 | 3702795 | |
0 | 3043352 | |
0.009999999776 | 7747 | 0.1% |
0.01999999955 | 5031 | 0.1% |
Value | Count | Frequency (%) |
81.5 | 18 | |
59.04000092 | 4 | < 0.1% |
58.40999985 | 2 | < 0.1% |
57.83000183 | 4 | < 0.1% |
53.95999908 | 2 | < 0.1% |
accuracy | altitude | bearing | latitude | longitude | speed | userid | |
---|---|---|---|---|---|---|---|
accuracy | 1.000 | -0.488 | -0.503 | -0.209 | 0.260 | -0.494 | 0.328 |
altitude | -0.488 | 1.000 | 0.826 | 0.290 | -0.275 | 0.815 | -0.402 |
bearing | -0.503 | 0.826 | 1.000 | 0.164 | -0.301 | 0.980 | -0.272 |
latitude | -0.209 | 0.290 | 0.164 | 1.000 | -0.116 | 0.144 | -0.697 |
longitude | 0.260 | -0.275 | -0.301 | -0.116 | 1.000 | -0.299 | 0.329 |
speed | -0.494 | 0.815 | 0.980 | 0.144 | -0.299 | 1.000 | -0.241 |
userid | 0.328 | -0.402 | -0.272 | -0.697 | 0.329 | -0.241 | 1.000 |