Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 371 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 19 |
Duplicate rows (%) | 5.1% |
Total size in memory | 14.6 KiB |
Average record size in memory | 40.3 B |
Variable types
Numeric | 5 |
---|
Dataset has 19 (5.1%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2021-05-28 13:53:33.106774 |
---|---|
Analysis finished | 2021-05-28 13:53:37.866704 |
Duration | 4.76 seconds |
Software version | pandas-profiling v2.11.0 |
Download configuration | config.yaml |
Bearing 1
Real number (ℝ≥0)
Distinct | 62 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.268463612 |
---|---|
Minimum | 1.3 |
Maximum | 14 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 1.3 |
---|---|
5-th percentile | 4.1 |
Q1 | 4.7 |
median | 6.1 |
Q3 | 7 |
95-th percentile | 9.7 |
Maximum | 14 |
Range | 12.7 |
Interquartile range (IQR) | 2.3 |
Descriptive statistics
Standard deviation | 1.986277943 |
---|---|
Coefficient of variation (CV) | 0.3168683852 |
Kurtosis | 3.686907958 |
Mean | 6.268463612 |
Median Absolute Deviation (MAD) | 1.2 |
Skewness | 1.589447026 |
Sum | 2325.6 |
Variance | 3.945300066 |
Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
4.6 | 17 | 4.6% |
6.1 | 14 | 3.8% |
4.7 | 14 | 3.8% |
6.3 | 13 | 3.5% |
5.1 | 13 | 3.5% |
6.8 | 13 | 3.5% |
6.7 | 12 | 3.2% |
4.4 | 11 | 3.0% |
4.1 | 11 | 3.0% |
4.3 | 11 | 3.0% |
Other values (52) | 242 |
Value | Count | Frequency (%) |
1.3 | 1 | 0.3% |
3.6 | 2 | |
3.7 | 2 | |
3.8 | 4 | |
3.9 | 2 |
Value | Count | Frequency (%) |
14 | 5 | |
13 | 5 | |
12 | 4 | |
11 | 4 | |
9.9 | 1 | 0.3% |
Bearing 2
Real number (ℝ≥0)
Distinct | 60 |
---|---|
Distinct (%) | 16.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.71239892 |
---|---|
Minimum | 1.3 |
Maximum | 47 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 1.3 |
---|---|
5-th percentile | 7.5 |
Q1 | 10 |
median | 12 |
Q3 | 16 |
95-th percentile | 35.5 |
Maximum | 47 |
Range | 45.7 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 8.561728634 |
---|---|
Coefficient of variation (CV) | 0.5819396741 |
Kurtosis | 4.348909722 |
Mean | 14.71239892 |
Median Absolute Deviation (MAD) | 2.3 |
Skewness | 2.153116908 |
Sum | 5458.3 |
Variance | 73.3031972 |
Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
12 | 59 | |
11 | 52 | 14.0% |
13 | 32 | 8.6% |
14 | 20 | 5.4% |
10 | 18 | 4.9% |
16 | 15 | 4.0% |
8 | 10 | 2.7% |
15 | 10 | 2.7% |
18 | 10 | 2.7% |
17 | 9 | 2.4% |
Other values (50) | 136 |
Value | Count | Frequency (%) |
1.3 | 1 | 0.3% |
5.9 | 1 | 0.3% |
6.6 | 3 | |
6.9 | 2 | |
7 | 2 |
Value | Count | Frequency (%) |
47 | 2 | |
46 | 3 | |
45 | 2 | |
44 | 1 | 0.3% |
43 | 3 |
Bearing 3
Real number (ℝ≥0)
Distinct | 80 |
---|---|
Distinct (%) | 21.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.1410673854 |
---|---|
Minimum | 0.029 |
Maximum | 0.42 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 0.029 |
---|---|
5-th percentile | 0.056 |
Q1 | 0.0865 |
median | 0.12 |
Q3 | 0.18 |
95-th percentile | 0.285 |
Maximum | 0.42 |
Range | 0.391 |
Interquartile range (IQR) | 0.0935 |
Descriptive statistics
Standard deviation | 0.07340383147 |
---|---|
Coefficient of variation (CV) | 0.5203458704 |
Kurtosis | 1.7163783 |
Mean | 0.1410673854 |
Median Absolute Deviation (MAD) | 0.041 |
Skewness | 1.237572961 |
Sum | 52.336 |
Variance | 0.005388122474 |
Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
0.12 | 33 | 8.9% |
0.13 | 27 | 7.3% |
0.15 | 19 | 5.1% |
0.11 | 16 | 4.3% |
0.16 | 16 | 4.3% |
0.2 | 15 | 4.0% |
0.1 | 13 | 3.5% |
0.14 | 13 | 3.5% |
0.21 | 12 | 3.2% |
0.19 | 10 | 2.7% |
Other values (70) | 197 |
Value | Count | Frequency (%) |
0.029 | 1 | |
0.037 | 2 | |
0.043 | 1 | |
0.044 | 1 | |
0.045 | 1 |
Value | Count | Frequency (%) |
0.42 | 3 | |
0.38 | 2 | |
0.37 | 2 | |
0.35 | 1 | 0.3% |
0.34 | 1 | 0.3% |
Axial Front
Real number (ℝ≥0)
Distinct | 75 |
---|---|
Distinct (%) | 20.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.89326146 |
---|---|
Minimum | 0.4 |
Maximum | 40 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 0.4 |
---|---|
5-th percentile | 3.7 |
Q1 | 7.65 |
median | 14 |
Q3 | 20 |
95-th percentile | 33 |
Maximum | 40 |
Range | 39.6 |
Interquartile range (IQR) | 12.35 |
Descriptive statistics
Standard deviation | 8.608605555 |
---|---|
Coefficient of variation (CV) | 0.5780201725 |
Kurtosis | -0.04253778304 |
Mean | 14.89326146 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.6401550473 |
Sum | 5525.4 |
Variance | 74.1080896 |
Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
20 | 37 | 10.0% |
14 | 26 | 7.0% |
16 | 20 | 5.4% |
19 | 18 | 4.9% |
21 | 17 | 4.6% |
15 | 17 | 4.6% |
10 | 15 | 4.0% |
13 | 15 | 4.0% |
33 | 13 | 3.5% |
23 | 10 | 2.7% |
Other values (65) | 183 |
Value | Count | Frequency (%) |
0.4 | 1 | |
0.5 | 2 | |
0.8 | 2 | |
1.5 | 1 | |
1.6 | 1 |
Value | Count | Frequency (%) |
40 | 3 | 0.8% |
37 | 5 | 1.3% |
33 | 13 | |
32 | 5 | 1.3% |
29 | 4 | 1.1% |
Radial Front
Real number (ℝ≥0)
Distinct | 72 |
---|---|
Distinct (%) | 19.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 57.71428571 |
---|---|
Minimum | 8.2 |
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 3.0 KiB |
Quantile statistics
Minimum | 8.2 |
---|---|
5-th percentile | 15.5 |
Q1 | 42 |
median | 52 |
Q3 | 83.5 |
95-th percentile | 97 |
Maximum | 100 |
Range | 91.8 |
Interquartile range (IQR) | 41.5 |
Descriptive statistics
Standard deviation | 25.01380129 |
---|---|
Coefficient of variation (CV) | 0.433407448 |
Kurtosis | -0.9940357449 |
Mean | 57.71428571 |
Median Absolute Deviation (MAD) | 16 |
Skewness | 0.09606051909 |
Sum | 21412 |
Variance | 625.6902548 |
Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
Value | Count | Frequency (%) |
94 | 22 | 5.9% |
46 | 19 | 5.1% |
90 | 18 | 4.9% |
44 | 16 | 4.3% |
45 | 11 | 3.0% |
87 | 11 | 3.0% |
76 | 11 | 3.0% |
97 | 10 | 2.7% |
61 | 10 | 2.7% |
52 | 10 | 2.7% |
Other values (62) | 233 |
Value | Count | Frequency (%) |
8.2 | 1 | 0.3% |
8.8 | 1 | 0.3% |
12 | 2 | |
13 | 2 | |
14 | 4 |
Value | Count | Frequency (%) |
100 | 1 | 0.3% |
99 | 9 | |
97 | 10 | |
96 | 1 | 0.3% |
95 | 2 | 0.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
Bearing 1 | Bearing 2 | Bearing 3 | Axial Front | Radial Front | |
---|---|---|---|---|---|
0 | 6.6 | 12.0 | 0.070 | 20.0 | 39.0 |
1 | 6.6 | 12.0 | 0.070 | 20.0 | 39.0 |
2 | 6.7 | 12.0 | 0.063 | 21.0 | 34.0 |
3 | 6.1 | 12.0 | 0.037 | 21.0 | 34.0 |
4 | 4.2 | 7.4 | 0.053 | 21.0 | 34.0 |
5 | 5.2 | 9.7 | 0.072 | 21.0 | 34.0 |
6 | 4.1 | 6.6 | 0.073 | 21.0 | 34.0 |
7 | 4.1 | 6.6 | 0.073 | 21.0 | 37.0 |
8 | 4.3 | 7.4 | 0.190 | 27.0 | 45.0 |
9 | 4.2 | 7.8 | 0.130 | 20.0 | 44.0 |
Last rows
Bearing 1 | Bearing 2 | Bearing 3 | Axial Front | Radial Front | |
---|---|---|---|---|---|
361 | 14.0 | 47.0 | 0.120 | 17.0 | 18.0 |
362 | 4.5 | 13.0 | 0.037 | 20.0 | 23.0 |
363 | 5.4 | 13.0 | 0.048 | 20.0 | 15.0 |
364 | 6.1 | 13.0 | 0.140 | 18.0 | 14.0 |
365 | 6.7 | 13.0 | 0.150 | 20.0 | 15.0 |
366 | 6.9 | 13.0 | 0.160 | 20.0 | 18.0 |
367 | 3.7 | 7.2 | 0.087 | 18.0 | 17.0 |
368 | 3.8 | 8.0 | 0.079 | 20.0 | 15.0 |
369 | 3.8 | 8.0 | 0.079 | 20.0 | 15.0 |
370 | 4.7 | 8.2 | 0.290 | 21.0 | 13.0 |
Most frequent
Bearing 1 | Bearing 2 | Bearing 3 | Axial Front | Radial Front | count | |
---|---|---|---|---|---|---|
15 | 7.8 | 12.0 | 0.120 | 33.0 | 94.0 | 4 |
0 | 3.8 | 8.0 | 0.079 | 20.0 | 15.0 | 2 |
1 | 3.9 | 9.0 | 0.190 | 6.4 | 40.0 | 2 |
2 | 4.2 | 7.0 | 0.150 | 20.0 | 43.0 | 2 |
3 | 4.4 | 11.0 | 0.260 | 7.7 | 46.0 | 2 |
4 | 4.6 | 13.0 | 0.180 | 7.4 | 54.0 | 2 |
5 | 4.7 | 12.0 | 0.150 | 25.0 | 56.0 | 2 |
6 | 5.1 | 12.0 | 0.200 | 9.9 | 43.0 | 2 |
7 | 5.1 | 14.0 | 0.220 | 11.0 | 46.0 | 2 |
8 | 5.8 | 11.0 | 0.073 | 19.0 | 47.0 | 2 |