Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 256875 |
| Missing cells | 74713 |
| Missing cells (%) | 5.8% |
| Total size in memory | 12.6 MiB |
| Average record size in memory | 51.3 B |
Variable types
| Text | 2 |
|---|---|
| Numeric | 1 |
| DateTime | 1 |
| Boolean | 1 |
Dataset
Variable descriptions
| experimentid | Experiment Id |
|---|---|
| userid | User id |
| timestamp | show month(2), day(2), hour(2), minute(2), second(2), decimals(3) |
| source | The charge source name |
| status | Return if the battery is charging |
experimentid has constant value "wenetItaly" | Constant |
source has 74713 (29.1%) missing values | Missing |
Reproduction
| Analysis started | 2024-11-23 06:05:17.978546 |
|---|---|
| Analysis finished | 2024-11-23 06:05:19.205048 |
| Duration | 1.23 second |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 2568750 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | wenetItaly |
|---|---|
| 2nd row | wenetItaly |
| 3rd row | wenetItaly |
| 4th row | wenetItaly |
| 5th row | wenetItaly |
| Value | Count | Frequency (%) |
| wenetitaly | 256875 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 513750 | |
| t | 513750 | |
| w | 256875 | |
| n | 256875 | |
| I | 256875 | |
| a | 256875 | |
| l | 256875 | |
| y | 256875 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2568750 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 513750 | |
| t | 513750 | |
| w | 256875 | |
| n | 256875 | |
| I | 256875 | |
| a | 256875 | |
| l | 256875 | |
| y | 256875 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2568750 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 513750 | |
| t | 513750 | |
| w | 256875 | |
| n | 256875 | |
| I | 256875 | |
| a | 256875 | |
| l | 256875 | |
| y | 256875 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2568750 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 513750 | |
| t | 513750 | |
| w | 256875 | |
| n | 256875 | |
| I | 256875 | |
| a | 256875 | |
| l | 256875 | |
| y | 256875 |
userid
Real number (ℝ)
User id
| Distinct | 219 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 128.0284925 |
| Minimum | 0 |
|---|---|
| Maximum | 265 |
| Zeros | 45 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 59 |
| median | 126 |
| Q3 | 212 |
| 95-th percentile | 254 |
| Maximum | 265 |
| Range | 265 |
| Interquartile range (IQR) | 153 |
Descriptive statistics
| Standard deviation | 82.18000846 |
|---|---|
| Coefficient of variation (CV) | 0.6418884335 |
| Kurtosis | -1.279889334 |
| Mean | 128.0284925 |
| Median Absolute Deviation (MAD) | 72 |
| Skewness | 0.09631016219 |
| Sum | 32887319 |
| Variance | 6753.553791 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 75 | 21808 | 8.5% |
| 217 | 20598 | 8.0% |
| 126 | 19847 | 7.7% |
| 2 | 15422 | 6.0% |
| 252 | 11052 | 4.3% |
| 65 | 5384 | 2.1% |
| 18 | 4960 | 1.9% |
| 258 | 4537 | 1.8% |
| 103 | 4288 | 1.7% |
| 163 | 3675 | 1.4% |
| Other values (209) | 145304 |
| Value | Count | Frequency (%) |
| 0 | 45 | < 0.1% |
| 1 | 725 | 0.3% |
| 2 | 15422 | |
| 3 | 152 | 0.1% |
| 4 | 367 | 0.1% |
| Value | Count | Frequency (%) |
| 265 | 1670 | |
| 264 | 8 | < 0.1% |
| 263 | 1383 | |
| 262 | 254 | 0.1% |
| 260 | 51 | < 0.1% |
timestamp
Date
show month(2), day(2), hour(2), minute(2), second(2), decimals(3)
| Distinct | 256859 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.0 MiB |
| Minimum | 2020-11-16 07:01:50.738000 |
|---|---|
| Maximum | 2020-12-11 21:58:01.238000 |
Histogram with fixed size bins (bins=50)
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 74713 |
| Missing (%) | 29.1% |
| Memory size | 4.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.59010661 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2111277 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | charging_unknown |
|---|---|
| 2nd row | charging_ac |
| 3rd row | charging_ac |
| 4th row | charging_ac |
| 5th row | charging_ac |
| Value | Count | Frequency (%) |
| charging_ac | 142381 | |
| charging_usb | 14106 | 7.7% |
| charging_unknown | 14013 | 7.7% |
| charging_wifi | 11662 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 364324 | |
| c | 324543 | |
| a | 324543 | |
| n | 224201 | |
| i | 205486 | |
| h | 182162 | |
| r | 182162 | |
| _ | 182162 | |
| u | 28119 | 1.3% |
| w | 25675 | 1.2% |
| Other values (5) | 67900 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2111277 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| g | 364324 | |
| c | 324543 | |
| a | 324543 | |
| n | 224201 | |
| i | 205486 | |
| h | 182162 | |
| r | 182162 | |
| _ | 182162 | |
| u | 28119 | 1.3% |
| w | 25675 | 1.2% |
| Other values (5) | 67900 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2111277 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| g | 364324 | |
| c | 324543 | |
| a | 324543 | |
| n | 224201 | |
| i | 205486 | |
| h | 182162 | |
| r | 182162 | |
| _ | 182162 | |
| u | 28119 | 1.3% |
| w | 25675 | 1.2% |
| Other values (5) | 67900 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2111277 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| g | 364324 | |
| c | 324543 | |
| a | 324543 | |
| n | 224201 | |
| i | 205486 | |
| h | 182162 | |
| r | 182162 | |
| _ | 182162 | |
| u | 28119 | 1.3% |
| w | 25675 | 1.2% |
| Other values (5) | 67900 | 3.2% |
status
Boolean
Return if the battery is charging
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 251.0 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 182162 | |
| False | 74713 |
| status | userid | |
|---|---|---|
| status | 1.000 | -0.023 |
| userid | -0.023 | 1.000 |