Dataset statistics
Number of variables | 24 |
---|---|
Number of observations | 991346 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 26 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 181.5 MiB |
Average record size in memory | 192.0 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 19 |
Dataset has 26 (< 0.1%) duplicate rows | Duplicates |
height is highly overall correlated with weight and 2 other fields | High correlation |
weight is highly overall correlated with height and 3 other fields | High correlation |
waistline is highly overall correlated with weight | High correlation |
sight_left is highly overall correlated with sight_right | High correlation |
sight_right is highly overall correlated with sight_left | High correlation |
sbp is highly overall correlated with dbp | High correlation |
dbp is highly overall correlated with sbp | High correlation |
tot_chole is highly overall correlated with ldl_chole | High correlation |
ldl_chole is highly overall correlated with tot_chole | High correlation |
hemoglobin is highly overall correlated with height and 2 other fields | High correlation |
sgot_ast is highly overall correlated with sgot_alt | High correlation |
sgot_alt is highly overall correlated with sgot_ast and 1 other fields | High correlation |
gamma_gtp is highly overall correlated with sgot_alt | High correlation |
sex is highly overall correlated with height and 3 other fields | High correlation |
hear_left is highly overall correlated with hear_right | High correlation |
hear_right is highly overall correlated with hear_left | High correlation |
smk_stat_type_cd is highly overall correlated with sex | High correlation |
hear_left is highly imbalanced (79.8%) | Imbalance |
hear_right is highly imbalanced (80.3%) | Imbalance |
waistline is highly skewed (γ1 = 26.78843978) | Skewed |
hdl_chole is highly skewed (γ1 = 104.5776351) | Skewed |
serum_creatinine is highly skewed (γ1 = 111.022058) | Skewed |
sgot_ast is highly skewed (γ1 = 150.4916897) | Skewed |
sgot_alt is highly skewed (γ1 = 50.03887229) | Skewed |
Reproduction
Analysis started | 2023-11-25 02:29:24.903586 |
---|---|
Analysis finished | 2023-11-25 02:31:32.778612 |
Duration | 2 minutes and 7.88 seconds |
Software version | ydata-profiling vv4.6.1 |
Download configuration | config.json |
sex
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.6 MiB |
Male | |
---|---|
Female |
Common Values
Value | Count | Frequency (%) |
Male | 526415 | |
Female | 464931 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
male | 526415 | |
female | 464931 |
Most occurring characters
Value | Count | Frequency (%) |
e | 1456277 | |
a | 991346 | |
l | 991346 | |
M | 526415 | 10.8% |
F | 464931 | 9.5% |
m | 464931 | 9.5% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 3903900 | |
Uppercase Letter | 991346 | 20.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 1456277 | |
a | 991346 | |
l | 991346 | |
m | 464931 | 11.9% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 526415 | |
F | 464931 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4895246 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 1456277 | |
a | 991346 | |
l | 991346 | |
M | 526415 | 10.8% |
F | 464931 | 9.5% |
m | 464931 | 9.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4895246 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 1456277 | |
a | 991346 | |
l | 991346 | |
M | 526415 | 10.8% |
F | 464931 | 9.5% |
m | 464931 | 9.5% |
age
Real number (ℝ)
Distinct | 14 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.614491 |
Minimum | 20 |
---|---|
Maximum | 85 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 20 |
---|---|
5-th percentile | 25 |
Q1 | 35 |
median | 45 |
Q3 | 60 |
95-th percentile | 70 |
Maximum | 85 |
Range | 65 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 14.181339 |
---|---|
Coefficient of variation (CV) | 0.29783662 |
Kurtosis | -0.57561552 |
Mean | 47.614491 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.15365339 |
Sum | 47202435 |
Variance | 201.11038 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40 | 130385 | |
50 | 129434 | |
45 | 118355 | |
55 | 111223 | |
60 | 106063 | |
35 | 84726 | |
30 | 77600 | |
25 | 64370 | |
65 | 52961 | |
70 | 50666 | 5.1% |
Other values (4) | 65563 |
Value | Count | Frequency (%) |
20 | 21971 | 2.2% |
25 | 64370 | |
30 | 77600 | |
35 | 84726 | |
40 | 130385 | |
45 | 118355 | |
50 | 129434 | |
55 | 111223 | |
60 | 106063 | |
65 | 52961 |
Value | Count | Frequency (%) |
85 | 3291 | 0.3% |
80 | 14968 | 1.5% |
75 | 25333 | 2.6% |
70 | 50666 | 5.1% |
65 | 52961 | |
60 | 106063 | |
55 | 111223 | |
50 | 129434 | |
45 | 118355 | |
40 | 130385 |
height
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 162.24063 |
Minimum | 130 |
---|---|
Maximum | 190 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 130 |
---|---|
5-th percentile | 145 |
Q1 | 155 |
median | 160 |
Q3 | 170 |
95-th percentile | 175 |
Maximum | 190 |
Range | 60 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 9.2829575 |
---|---|
Coefficient of variation (CV) | 0.057217219 |
Kurtosis | -0.53564034 |
Mean | 162.24063 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.02273717 |
Sum | 1.608366 × 108 |
Variance | 86.173299 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
160 | 181809 | |
165 | 178228 | |
170 | 166328 | |
155 | 165678 | |
150 | 107929 | |
175 | 98850 | |
145 | 39176 | 4.0% |
180 | 35970 | 3.6% |
140 | 9100 | 0.9% |
185 | 6588 | 0.7% |
Other values (3) | 1690 | 0.2% |
Value | Count | Frequency (%) |
130 | 86 | < 0.1% |
135 | 1241 | 0.1% |
140 | 9100 | 0.9% |
145 | 39176 | 4.0% |
150 | 107929 | |
155 | 165678 | |
160 | 181809 | |
165 | 178228 | |
170 | 166328 | |
175 | 98850 |
Value | Count | Frequency (%) |
190 | 363 | < 0.1% |
185 | 6588 | 0.7% |
180 | 35970 | 3.6% |
175 | 98850 | |
170 | 166328 | |
165 | 178228 | |
160 | 181809 | |
155 | 165678 | |
150 | 107929 | |
145 | 39176 | 4.0% |
weight
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 63.28405 |
Minimum | 25 |
---|---|
Maximum | 140 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 45 |
Q1 | 55 |
median | 60 |
Q3 | 70 |
95-th percentile | 85 |
Maximum | 140 |
Range | 115 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 12.514241 |
---|---|
Coefficient of variation (CV) | 0.19774715 |
Kurtosis | 0.35922025 |
Mean | 63.28405 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.5765566 |
Sum | 62736390 |
Variance | 156.60622 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
60 | 151134 | |
55 | 150415 | |
65 | 141241 | |
50 | 125079 | |
70 | 122281 | |
75 | 90207 | |
45 | 63047 | |
80 | 58176 | 5.9% |
85 | 33708 | 3.4% |
90 | 18250 | 1.8% |
Other values (14) | 37808 | 3.8% |
Value | Count | Frequency (%) |
25 | 9 | < 0.1% |
30 | 157 | < 0.1% |
35 | 1948 | 0.2% |
40 | 16639 | 1.7% |
45 | 63047 | |
50 | 125079 | |
55 | 150415 | |
60 | 151134 | |
65 | 141241 | |
70 | 122281 |
Value | Count | Frequency (%) |
140 | 3 | < 0.1% |
135 | 5 | < 0.1% |
130 | 43 | < 0.1% |
125 | 80 | < 0.1% |
120 | 236 | < 0.1% |
115 | 573 | 0.1% |
110 | 1177 | 0.1% |
105 | 2454 | 0.2% |
100 | 4829 | |
95 | 9655 |
waistline
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 737 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 81.233358 |
Minimum | 8 |
---|---|
Maximum | 999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 8 |
---|---|
5-th percentile | 66 |
Q1 | 74.1 |
median | 81 |
Q3 | 87.8 |
95-th percentile | 97 |
Maximum | 999 |
Range | 991 |
Interquartile range (IQR) | 13.7 |
Descriptive statistics
Standard deviation | 11.850323 |
---|---|
Coefficient of variation (CV) | 0.14588001 |
Kurtosis | 2066.8122 |
Mean | 81.233358 |
Median Absolute Deviation (MAD) | 6.8 |
Skewness | 26.78844 |
Sum | 80530364 |
Variance | 140.43016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 37790 | 3.8% |
81 | 34603 | 3.5% |
82 | 34024 | 3.4% |
84 | 33913 | 3.4% |
86 | 32723 | 3.3% |
83 | 32282 | 3.3% |
76 | 31254 | 3.2% |
78 | 30832 | 3.1% |
85 | 30626 | 3.1% |
79 | 28853 | 2.9% |
Other values (727) | 664446 |
Value | Count | Frequency (%) |
8 | 1 | < 0.1% |
27 | 1 | < 0.1% |
30 | 2 | |
32 | 3 | |
35 | 2 | |
40 | 1 | < 0.1% |
42 | 1 | < 0.1% |
43 | 1 | < 0.1% |
48 | 1 | < 0.1% |
49 | 1 | < 0.1% |
Value | Count | Frequency (%) |
999 | 57 | |
149.1 | 1 | < 0.1% |
145 | 1 | < 0.1% |
140 | 1 | < 0.1% |
138 | 1 | < 0.1% |
136.8 | 1 | < 0.1% |
136 | 2 | < 0.1% |
135 | 1 | < 0.1% |
134 | 3 | < 0.1% |
133 | 1 | < 0.1% |
sight_left
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.98083434 |
Minimum | 0.1 |
---|---|
Maximum | 9.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 0.1 |
---|---|
5-th percentile | 0.4 |
Q1 | 0.7 |
median | 1 |
Q3 | 1.2 |
95-th percentile | 1.5 |
Maximum | 9.9 |
Range | 9.8 |
Interquartile range (IQR) | 0.5 |
Descriptive statistics
Standard deviation | 0.60594863 |
---|---|
Coefficient of variation (CV) | 0.61778897 |
Kurtosis | 144.94968 |
Mean | 0.98083434 |
Median Absolute Deviation (MAD) | 0.2 |
Skewness | 9.994626 |
Sum | 972346.2 |
Variance | 0.36717375 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 201418 | |
1.2 | 188460 | |
1.5 | 121713 | |
0.9 | 105297 | |
0.8 | 99913 | |
0.7 | 83749 | |
0.6 | 53644 | 5.4% |
0.5 | 51895 | 5.2% |
0.4 | 30744 | 3.1% |
0.3 | 20388 | 2.1% |
Other values (14) | 34125 | 3.4% |
Value | Count | Frequency (%) |
0.1 | 9503 | 1.0% |
0.2 | 12255 | 1.2% |
0.3 | 20388 | 2.1% |
0.4 | 30744 | 3.1% |
0.5 | 51895 | 5.2% |
0.6 | 53644 | 5.4% |
0.7 | 83749 | |
0.8 | 99913 | |
0.9 | 105297 | |
1 | 201418 |
Value | Count | Frequency (%) |
9.9 | 3118 | 0.3% |
2.5 | 7 | < 0.1% |
2.2 | 2 | < 0.1% |
2.1 | 3 | < 0.1% |
2 | 8452 | 0.9% |
1.9 | 32 | < 0.1% |
1.8 | 25 | < 0.1% |
1.7 | 14 | < 0.1% |
1.6 | 371 | < 0.1% |
1.5 | 121713 |
sight_right
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.97842913 |
Minimum | 0.1 |
---|---|
Maximum | 9.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 0.1 |
---|---|
5-th percentile | 0.4 |
Q1 | 0.7 |
median | 1 |
Q3 | 1.2 |
95-th percentile | 1.5 |
Maximum | 9.9 |
Range | 9.8 |
Interquartile range (IQR) | 0.5 |
Descriptive statistics
Standard deviation | 0.60477411 |
---|---|
Coefficient of variation (CV) | 0.61810722 |
Kurtosis | 145.92255 |
Mean | 0.97842913 |
Median Absolute Deviation (MAD) | 0.2 |
Skewness | 10.033647 |
Sum | 969961.8 |
Variance | 0.36575173 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 204493 | |
1.2 | 187266 | |
1.5 | 120620 | |
0.9 | 106186 | |
0.8 | 98777 | |
0.7 | 84168 | |
0.6 | 53238 | 5.4% |
0.5 | 50803 | 5.1% |
0.4 | 31318 | 3.2% |
0.3 | 20090 | 2.0% |
Other values (14) | 34387 | 3.5% |
Value | Count | Frequency (%) |
0.1 | 10028 | 1.0% |
0.2 | 13002 | 1.3% |
0.3 | 20090 | 2.0% |
0.4 | 31318 | 3.2% |
0.5 | 50803 | 5.1% |
0.6 | 53238 | 5.4% |
0.7 | 84168 | |
0.8 | 98777 | |
0.9 | 106186 | |
1 | 204493 |
Value | Count | Frequency (%) |
9.9 | 3111 | 0.3% |
2.5 | 10 | < 0.1% |
2.2 | 1 | < 0.1% |
2.1 | 10 | < 0.1% |
2 | 7363 | 0.7% |
1.9 | 21 | < 0.1% |
1.8 | 32 | < 0.1% |
1.7 | 24 | < 0.1% |
1.6 | 390 | < 0.1% |
1.5 | 120620 |
hear_left
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.6 MiB |
1.0 | |
---|---|
2.0 | 31222 |
Common Values
Value | Count | Frequency (%) |
1.0 | 960124 | |
2.0 | 31222 | 3.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1.0 | 960124 | |
2.0 | 31222 | 3.1% |
Most occurring characters
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 960124 | |
2 | 31222 | 1.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1982692 | |
Other Punctuation | 991346 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 991346 | |
1 | 960124 | |
2 | 31222 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 991346 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2974038 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 960124 | |
2 | 31222 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2974038 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 960124 | |
2 | 31222 | 1.0% |
hear_right
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.6 MiB |
1.0 | |
---|---|
2.0 | 30212 |
Common Values
Value | Count | Frequency (%) |
1.0 | 961134 | |
2.0 | 30212 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1.0 | 961134 | |
2.0 | 30212 | 3.0% |
Most occurring characters
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 961134 | |
2 | 30212 | 1.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1982692 | |
Other Punctuation | 991346 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 991346 | |
1 | 961134 | |
2 | 30212 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 991346 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2974038 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 961134 | |
2 | 30212 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2974038 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 991346 | |
0 | 991346 | |
1 | 961134 | |
2 | 30212 | 1.0% |
sbp
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 171 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 122.4325 |
Minimum | 67 |
---|---|
Maximum | 273 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 67 |
---|---|
5-th percentile | 100 |
Q1 | 112 |
median | 120 |
Q3 | 131 |
95-th percentile | 148 |
Maximum | 273 |
Range | 206 |
Interquartile range (IQR) | 19 |
Descriptive statistics
Standard deviation | 14.543148 |
---|---|
Coefficient of variation (CV) | 0.11878503 |
Kurtosis | 0.99663922 |
Mean | 122.4325 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.48206032 |
Sum | 1.2137297 × 108 |
Variance | 211.50315 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
120 | 78786 | 7.9% |
110 | 72193 | 7.3% |
130 | 71714 | 7.2% |
118 | 40078 | 4.0% |
100 | 30829 | 3.1% |
138 | 24426 | 2.5% |
119 | 24166 | 2.4% |
128 | 23766 | 2.4% |
124 | 22224 | 2.2% |
116 | 22177 | 2.2% |
Other values (161) | 580987 |
Value | Count | Frequency (%) |
67 | 1 | < 0.1% |
70 | 3 | < 0.1% |
72 | 1 | < 0.1% |
73 | 4 | < 0.1% |
74 | 3 | < 0.1% |
75 | 8 | |
76 | 7 | |
77 | 6 | |
78 | 11 | |
79 | 6 |
Value | Count | Frequency (%) |
273 | 1 | |
270 | 1 | |
255 | 1 | |
253 | 1 | |
244 | 1 | |
241 | 1 | |
240 | 1 | |
238 | 1 | |
236 | 1 | |
235 | 1 |
dbp
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 127 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 76.052627 |
Minimum | 32 |
---|---|
Maximum | 185 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 32 |
---|---|
5-th percentile | 60 |
Q1 | 70 |
median | 76 |
Q3 | 82 |
95-th percentile | 92 |
Maximum | 185 |
Range | 153 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 9.8893654 |
---|---|
Coefficient of variation (CV) | 0.13003318 |
Kurtosis | 0.89150383 |
Mean | 76.052627 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0.4000338 |
Sum | 75394468 |
Variance | 97.799547 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
80 | 123156 | 12.4% |
70 | 111699 | 11.3% |
78 | 44628 | 4.5% |
60 | 41253 | 4.2% |
72 | 33644 | 3.4% |
75 | 32575 | 3.3% |
76 | 31976 | 3.2% |
74 | 31773 | 3.2% |
82 | 27195 | 2.7% |
90 | 25959 | 2.6% |
Other values (117) | 487488 |
Value | Count | Frequency (%) |
32 | 1 | < 0.1% |
33 | 1 | < 0.1% |
34 | 1 | < 0.1% |
36 | 2 | < 0.1% |
37 | 3 | < 0.1% |
38 | 1 | < 0.1% |
39 | 3 | < 0.1% |
40 | 14 | |
41 | 7 | |
42 | 12 |
Value | Count | Frequency (%) |
185 | 1 | |
181 | 1 | |
180 | 1 | |
170 | 1 | |
164 | 1 | |
163 | 1 | |
160 | 2 | |
156 | 2 | |
154 | 2 | |
153 | 2 |
blds
Real number (ℝ)
Distinct | 498 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 100.42445 |
Minimum | 25 |
---|---|
Maximum | 852 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 79 |
Q1 | 88 |
median | 96 |
Q3 | 105 |
95-th percentile | 137 |
Maximum | 852 |
Range | 827 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 24.17996 |
---|---|
Coefficient of variation (CV) | 0.24077762 |
Kurtosis | 40.470487 |
Mean | 100.42445 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 4.6173775 |
Sum | 99555374 |
Variance | 584.67045 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
93 | 35243 | 3.6% |
92 | 35227 | 3.6% |
95 | 35190 | 3.5% |
94 | 35173 | 3.5% |
91 | 34389 | 3.5% |
96 | 33814 | 3.4% |
90 | 33754 | 3.4% |
97 | 32981 | 3.3% |
89 | 32178 | 3.2% |
98 | 31902 | 3.2% |
Other values (488) | 651495 |
Value | Count | Frequency (%) |
25 | 1 | < 0.1% |
30 | 1 | < 0.1% |
32 | 1 | < 0.1% |
33 | 2 | |
34 | 2 | |
36 | 2 | |
37 | 1 | < 0.1% |
38 | 4 | |
39 | 1 | < 0.1% |
40 | 1 | < 0.1% |
Value | Count | Frequency (%) |
852 | 1 | |
801 | 1 | |
800 | 1 | |
784 | 1 | |
769 | 1 | |
741 | 1 | |
685 | 1 | |
663 | 1 | |
638 | 1 | |
629 | 2 |
tot_chole
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 474 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 195.55702 |
Minimum | 30 |
---|---|
Maximum | 2344 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 30 |
---|---|
5-th percentile | 137 |
Q1 | 169 |
median | 193 |
Q3 | 219 |
95-th percentile | 261 |
Maximum | 2344 |
Range | 2314 |
Interquartile range (IQR) | 50 |
Descriptive statistics
Standard deviation | 38.660155 |
---|---|
Coefficient of variation (CV) | 0.19769249 |
Kurtosis | 49.462386 |
Mean | 195.55702 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 1.5568817 |
Sum | 1.9386467 × 108 |
Variance | 1494.6076 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
199 | 11079 | 1.1% |
184 | 10873 | 1.1% |
189 | 10857 | 1.1% |
190 | 10825 | 1.1% |
188 | 10796 | 1.1% |
197 | 10775 | 1.1% |
187 | 10746 | 1.1% |
192 | 10746 | 1.1% |
196 | 10723 | 1.1% |
186 | 10717 | 1.1% |
Other values (464) | 883209 |
Value | Count | Frequency (%) |
30 | 1 | < 0.1% |
45 | 1 | < 0.1% |
54 | 1 | < 0.1% |
55 | 1 | < 0.1% |
57 | 3 | |
58 | 1 | < 0.1% |
59 | 1 | < 0.1% |
60 | 1 | < 0.1% |
62 | 1 | < 0.1% |
63 | 2 |
Value | Count | Frequency (%) |
2344 | 1 | |
2196 | 1 | |
2067 | 1 | |
2046 | 1 | |
2033 | 1 | |
1815 | 1 | |
1736 | 1 | |
1619 | 1 | |
1605 | 1 | |
1575 | 1 |
hdl_chole
Real number (ℝ)
SKEWED
 
Distinct | 223 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.9368 |
Minimum | 1 |
---|---|
Maximum | 8110 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 36 |
Q1 | 46 |
median | 55 |
Q3 | 66 |
95-th percentile | 84 |
Maximum | 8110 |
Range | 8109 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 17.238479 |
---|---|
Coefficient of variation (CV) | 0.30276515 |
Kurtosis | 48094.155 |
Mean | 56.9368 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 104.57764 |
Sum | 56444069 |
Variance | 297.16516 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 29602 | 3.0% |
52 | 28335 | 2.9% |
53 | 28323 | 2.9% |
51 | 28126 | 2.8% |
54 | 27952 | 2.8% |
49 | 27869 | 2.8% |
48 | 27428 | 2.8% |
55 | 27092 | 2.7% |
56 | 26827 | 2.7% |
47 | 26476 | 2.7% |
Other values (213) | 713316 |
Value | Count | Frequency (%) |
1 | 3 | < 0.1% |
2 | 7 | |
3 | 3 | < 0.1% |
4 | 5 | < 0.1% |
5 | 2 | < 0.1% |
6 | 6 | < 0.1% |
7 | 12 | |
8 | 6 | < 0.1% |
9 | 11 | |
10 | 15 |
Value | Count | Frequency (%) |
8110 | 1 | |
1206 | 1 | |
933 | 1 | |
797 | 1 | |
727 | 1 | |
701 | 1 | |
697 | 1 | |
677 | 1 | |
658 | 1 | |
636 | 1 |
ldl_chole
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 432 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 113.03769 |
Minimum | 1 |
---|---|
Maximum | 5119 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 60 |
Q1 | 89 |
median | 111 |
Q3 | 135 |
95-th percentile | 172 |
Maximum | 5119 |
Range | 5118 |
Interquartile range (IQR) | 46 |
Descriptive statistics
Standard deviation | 35.842812 |
---|---|
Coefficient of variation (CV) | 0.31708726 |
Kurtosis | 481.28298 |
Mean | 113.03769 |
Median Absolute Deviation (MAD) | 23 |
Skewness | 5.2517394 |
Sum | 1.1205946 × 108 |
Variance | 1284.7072 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
109 | 11824 | 1.2% |
104 | 11795 | 1.2% |
107 | 11782 | 1.2% |
110 | 11773 | 1.2% |
102 | 11740 | 1.2% |
112 | 11656 | 1.2% |
115 | 11631 | 1.2% |
108 | 11611 | 1.2% |
105 | 11607 | 1.2% |
106 | 11597 | 1.2% |
Other values (422) | 874330 |
Value | Count | Frequency (%) |
1 | 81 | |
2 | 13 | < 0.1% |
3 | 13 | < 0.1% |
4 | 11 | < 0.1% |
5 | 20 | < 0.1% |
6 | 23 | < 0.1% |
7 | 29 | < 0.1% |
8 | 40 | |
9 | 31 | < 0.1% |
10 | 39 |
Value | Count | Frequency (%) |
5119 | 1 | |
2254 | 1 | |
2114 | 1 | |
2111 | 1 | |
2043 | 1 | |
2026 | 1 | |
1933 | 1 | |
1798 | 1 | |
1750 | 1 | |
1696 | 1 |
triglyceride
Real number (ℝ)
Distinct | 1657 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 132.14175 |
Minimum | 1 |
---|---|
Maximum | 9490 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 46 |
Q1 | 73 |
median | 106 |
Q3 | 159 |
95-th percentile | 297 |
Maximum | 9490 |
Range | 9489 |
Interquartile range (IQR) | 86 |
Descriptive statistics
Standard deviation | 102.19698 |
---|---|
Coefficient of variation (CV) | 0.77338906 |
Kurtosis | 175.38524 |
Mean | 132.14175 |
Median Absolute Deviation (MAD) | 39 |
Skewness | 6.5293729 |
Sum | 1.309982 × 108 |
Variance | 10444.224 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
72 | 8236 | 0.8% |
78 | 8207 | 0.8% |
79 | 8178 | 0.8% |
69 | 8139 | 0.8% |
70 | 8131 | 0.8% |
76 | 8122 | 0.8% |
68 | 8120 | 0.8% |
82 | 8102 | 0.8% |
75 | 8096 | 0.8% |
77 | 8095 | 0.8% |
Other values (1647) | 909920 |
Value | Count | Frequency (%) |
1 | 4 | < 0.1% |
2 | 1 | < 0.1% |
3 | 1 | < 0.1% |
4 | 2 | < 0.1% |
5 | 2 | < 0.1% |
6 | 2 | < 0.1% |
7 | 10 | |
8 | 7 | |
9 | 11 | |
10 | 8 |
Value | Count | Frequency (%) |
9490 | 1 | |
6430 | 1 | |
6173 | 1 | |
5236 | 1 | |
4164 | 1 | |
4000 | 1 | |
3858 | 1 | |
3848 | 1 | |
3830 | 1 | |
3771 | 1 |
hemoglobin
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 190 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.229824 |
Minimum | 1 |
---|---|
Maximum | 25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 11.7 |
Q1 | 13.2 |
median | 14.3 |
Q3 | 15.4 |
95-th percentile | 16.6 |
Maximum | 25 |
Range | 24 |
Interquartile range (IQR) | 2.2 |
Descriptive statistics
Standard deviation | 1.5849287 |
---|---|
Coefficient of variation (CV) | 0.11138077 |
Kurtosis | 0.71137942 |
Mean | 14.229824 |
Median Absolute Deviation (MAD) | 1.1 |
Skewness | -0.3839878 |
Sum | 14106679 |
Variance | 2.5119991 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
13.5 | 23297 | 2.4% |
14 | 23108 | 2.3% |
13.6 | 23093 | 2.3% |
13.4 | 22946 | 2.3% |
13.8 | 22781 | 2.3% |
13.3 | 22734 | 2.3% |
13.9 | 22635 | 2.3% |
15 | 22600 | 2.3% |
13.7 | 22591 | 2.3% |
14.8 | 22181 | 2.2% |
Other values (180) | 763380 |
Value | Count | Frequency (%) |
1 | 3 | |
2.8 | 1 | < 0.1% |
3.7 | 3 | |
3.8 | 1 | < 0.1% |
3.9 | 3 | |
4 | 4 | |
4.1 | 2 | |
4.2 | 4 | |
4.3 | 3 | |
4.4 | 2 |
Value | Count | Frequency (%) |
25 | 2 | |
24.2 | 1 | |
23.9 | 1 | |
23.6 | 1 | |
23.3 | 1 | |
22.7 | 1 | |
22.1 | 1 | |
22 | 1 | |
21.8 | 1 | |
21.7 | 2 |
urine_protein
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0942244 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.43772355 |
---|---|
Coefficient of variation (CV) | 0.40003087 |
Kurtosis | 36.899552 |
Mean | 1.0942244 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.6724908 |
Sum | 1084755 |
Variance | 0.19160191 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 935175 | |
2 | 30850 | 3.1% |
3 | 16405 | 1.7% |
4 | 6427 | 0.6% |
5 | 1977 | 0.2% |
6 | 512 | 0.1% |
Value | Count | Frequency (%) |
1 | 935175 | |
2 | 30850 | 3.1% |
3 | 16405 | 1.7% |
4 | 6427 | 0.6% |
5 | 1977 | 0.2% |
6 | 512 | 0.1% |
Value | Count | Frequency (%) |
6 | 512 | 0.1% |
5 | 1977 | 0.2% |
4 | 6427 | 0.6% |
3 | 16405 | 1.7% |
2 | 30850 | 3.1% |
1 | 935175 |
serum_creatinine
Real number (ℝ)
SKEWED
 
Distinct | 183 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.86046668 |
Minimum | 0.1 |
---|---|
Maximum | 98 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 0.1 |
---|---|
5-th percentile | 0.6 |
Q1 | 0.7 |
median | 0.8 |
Q3 | 1 |
95-th percentile | 1.2 |
Maximum | 98 |
Range | 97.9 |
Interquartile range (IQR) | 0.3 |
Descriptive statistics
Standard deviation | 0.48053042 |
---|---|
Coefficient of variation (CV) | 0.55845326 |
Kurtosis | 19089.83 |
Mean | 0.86046668 |
Median Absolute Deviation (MAD) | 0.1 |
Skewness | 111.02206 |
Sum | 853020.2 |
Variance | 0.23090948 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.8 | 194902 | |
0.9 | 180626 | |
0.7 | 164293 | |
1 | 140743 | |
0.6 | 109236 | |
1.1 | 86355 | |
1.2 | 40744 | 4.1% |
0.5 | 38932 | 3.9% |
1.3 | 15160 | 1.5% |
0.4 | 6050 | 0.6% |
Other values (173) | 14305 | 1.4% |
Value | Count | Frequency (%) |
0.1 | 425 | < 0.1% |
0.2 | 99 | < 0.1% |
0.3 | 597 | 0.1% |
0.4 | 6050 | 0.6% |
0.5 | 38932 | 3.9% |
0.6 | 109236 | |
0.7 | 164293 | |
0.8 | 194902 | |
0.9 | 180626 | |
1 | 140743 |
Value | Count | Frequency (%) |
98 | 2 | |
96 | 2 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
87 | 1 | |
85 | 1 | |
81 | 1 | |
80 | 1 | |
79 | 1 |
sgot_ast
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 568 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25.989308 |
Minimum | 1 |
---|---|
Maximum | 9999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 7.6 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15 |
Q1 | 19 |
median | 23 |
Q3 | 28 |
95-th percentile | 46 |
Maximum | 9999 |
Range | 9998 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 23.493386 |
---|---|
Coefficient of variation (CV) | 0.90396349 |
Kurtosis | 50432.651 |
Mean | 25.989308 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 150.49169 |
Sum | 25764397 |
Variance | 551.93919 |
Monotonicity | Not monotonic |