Overview

Dataset statistics

Number of variables9
Number of observations537
Missing cells0
Missing cells (%)0.0%
Total size in memory38.3 KiB
Average record size in memory73.0 B

Variable types

Numeric8
Boolean1

Alerts

Pregnancies has 76 (14.2%) zerosZeros
BloodPressure has 19 (3.5%) zerosZeros
SkinThickness has 154 (28.7%) zerosZeros
Insulin has 261 (48.6%) zerosZeros

Reproduction

Analysis started2026-03-12 11:28:35.345016
Analysis finished2026-03-12 11:28:35.365284
Duration0.02 seconds
Software versionydata-profiling vv4.18.1
Download configurationconfig.json

Variables

Pregnancies
Real number (ℝ)

Zeros 

Distinct17
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.726256983
Minimum0
Maximum17
Zeros76
Zeros (%)14.2%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:35.418645image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q36
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.262964891
Coefficient of variation (CV)0.8756682392
Kurtosis0.3572700088
Mean3.726256983
Median Absolute Deviation (MAD)2
Skewness0.9228620592
Sum2001
Variance10.64693988
MonotonicityNot monotonic
2026-03-12T12:28:35.554288image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
1106
19.7%
076
14.2%
265
12.1%
354
10.1%
445
8.4%
542
 
7.8%
639
 
7.3%
731
 
5.8%
826
 
4.8%
919
 
3.5%
Other values (7)34
 
6.3%
ValueCountFrequency (%)
076
14.2%
1106
19.7%
265
12.1%
354
10.1%
445
8.4%
ValueCountFrequency (%)
171
 
0.2%
151
 
0.2%
142
0.4%
133
0.6%
124
0.7%

Glucose
Real number (ℝ)

Distinct128
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.849162
Minimum0
Maximum199
Zeros5
Zeros (%)0.9%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:35.657563image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile79
Q199
median117
Q3139
95-th percentile181
Maximum199
Range199
Interquartile range (IQR)40

Descriptive statistics

Standard deviation32.33952292
Coefficient of variation (CV)0.2676023762
Kurtosis1.116704041
Mean120.849162
Median Absolute Deviation (MAD)19
Skewness0.07338163583
Sum64896
Variance1045.844743
MonotonicityNot monotonic
2026-03-12T12:28:35.726856image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9915
 
2.8%
12512
 
2.2%
10011
 
2.0%
11411
 
2.0%
10810
 
1.9%
10610
 
1.9%
11110
 
1.9%
9510
 
1.9%
12810
 
1.9%
12210
 
1.9%
Other values (118)428
79.7%
ValueCountFrequency (%)
05
0.9%
441
 
0.2%
561
 
0.2%
572
 
0.4%
651
 
0.2%
ValueCountFrequency (%)
1991
 
0.2%
1981
 
0.2%
1973
0.6%
1963
0.6%
1952
0.4%

BloodPressure
Real number (ℝ)

Zeros 

Distinct44
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.68528864
Minimum0
Maximum122
Zeros19
Zeros (%)3.5%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:35.791647image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile44
Q164
median72
Q380
95-th percentile90
Maximum122
Range122
Interquartile range (IQR)16

Descriptive statistics

Standard deviation18.09437396
Coefficient of variation (CV)0.2596584489
Kurtosis5.802639348
Mean69.68528864
Median Absolute Deviation (MAD)8
Skewness-1.831636222
Sum37421
Variance327.406369
MonotonicityNot monotonic
2026-03-12T12:28:35.856873image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
7041
 
7.6%
7438
 
7.1%
6837
 
6.9%
8034
 
6.3%
6432
 
6.0%
7229
 
5.4%
7628
 
5.2%
7827
 
5.0%
6225
 
4.7%
6623
 
4.3%
Other values (34)223
41.5%
ValueCountFrequency (%)
019
3.5%
241
 
0.2%
302
 
0.4%
381
 
0.2%
401
 
0.2%
ValueCountFrequency (%)
1221
0.2%
1102
0.4%
1062
0.4%
1042
0.4%
1021
0.2%

SkinThickness
Real number (ℝ)

Zeros 

Distinct47
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.4320298
Minimum0
Maximum63
Zeros154
Zeros (%)28.7%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:35.917341image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median23
Q332
95-th percentile43.2
Maximum63
Range63
Interquartile range (IQR)32

Descriptive statistics

Standard deviation15.49071515
Coefficient of variation (CV)0.7581584063
Kurtosis-1.127757802
Mean20.4320298
Median Absolute Deviation (MAD)12
Skewness-0.02648576357
Sum10972
Variance239.9622558
MonotonicityNot monotonic
2026-03-12T12:28:35.980374image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
0154
28.7%
3022
 
4.1%
3221
 
3.9%
2318
 
3.4%
2715
 
2.8%
2514
 
2.6%
3914
 
2.6%
1913
 
2.4%
1813
 
2.4%
2813
 
2.4%
Other values (37)240
44.7%
ValueCountFrequency (%)
0154
28.7%
81
 
0.2%
104
 
0.7%
115
 
0.9%
126
 
1.1%
ValueCountFrequency (%)
631
 
0.2%
601
 
0.2%
521
 
0.2%
511
 
0.2%
503
0.6%

Insulin
Real number (ℝ)

Zeros 

Distinct153
Distinct (%)28.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.83612663
Minimum0
Maximum846
Zeros261
Zeros (%)48.6%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:36.043907image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median36
Q3129
95-th percentile291.4
Maximum846
Range846
Interquartile range (IQR)129

Descriptive statistics

Standard deviation115.1967297
Coefficient of variation (CV)1.442914812
Kurtosis8.017463042
Mean79.83612663
Median Absolute Deviation (MAD)36
Skewness2.358206734
Sum42872
Variance13270.28653
MonotonicityNot monotonic
2026-03-12T12:28:36.105531image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0261
48.6%
1408
 
1.5%
947
 
1.3%
1057
 
1.3%
1207
 
1.3%
1806
 
1.1%
1306
 
1.1%
1005
 
0.9%
1105
 
0.9%
1355
 
0.9%
Other values (143)220
41.0%
ValueCountFrequency (%)
0261
48.6%
141
 
0.2%
181
 
0.2%
221
 
0.2%
232
 
0.4%
ValueCountFrequency (%)
8461
0.2%
7441
0.2%
6001
0.2%
5431
0.2%
5401
0.2%

BMI
Real number (ℝ)

Distinct218
Distinct (%)40.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.97560521
Minimum0
Maximum67.1
Zeros5
Zeros (%)0.9%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:36.167583image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21.8
Q126.8
median32
Q336.5
95-th percentile44.52
Maximum67.1
Range67.1
Interquartile range (IQR)9.7

Descriptive statistics

Standard deviation7.624495387
Coefficient of variation (CV)0.238447258
Kurtosis2.755199972
Mean31.97560521
Median Absolute Deviation (MAD)4.8
Skewness-0.1080921598
Sum17170.9
Variance58.1329299
MonotonicityNot monotonic
2026-03-12T12:28:36.231409image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3211
 
2.0%
31.69
 
1.7%
31.29
 
1.7%
32.48
 
1.5%
32.88
 
1.5%
30.17
 
1.3%
29.76
 
1.1%
39.46
 
1.1%
27.86
 
1.1%
25.96
 
1.1%
Other values (208)461
85.8%
ValueCountFrequency (%)
05
0.9%
18.23
0.6%
19.31
 
0.2%
19.41
 
0.2%
19.51
 
0.2%
ValueCountFrequency (%)
67.11
0.2%
59.41
0.2%
551
0.2%
52.91
0.2%
49.71
0.2%

DiabetesPedigreeFunction
Real number (ℝ)

Distinct402
Distinct (%)74.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4699199255
Minimum0.078
Maximum2.42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:36.295613image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum0.078
5-th percentile0.137
Q10.241
median0.374
Q30.612
95-th percentile1.1364
Maximum2.42
Range2.342
Interquartile range (IQR)0.371

Descriptive statistics

Standard deviation0.3420873633
Coefficient of variation (CV)0.7279694788
Kurtosis6.847192339
Mean0.4699199255
Median Absolute Deviation (MAD)0.167
Skewness2.158685873
Sum252.347
Variance0.1170237641
MonotonicityNot monotonic
2026-03-12T12:28:36.362553image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.2585
 
0.9%
0.2685
 
0.9%
0.2614
 
0.7%
0.263
 
0.6%
0.2993
 
0.6%
0.2353
 
0.6%
0.193
 
0.6%
0.2453
 
0.6%
0.2923
 
0.6%
0.2053
 
0.6%
Other values (392)502
93.5%
ValueCountFrequency (%)
0.0781
0.2%
0.0841
0.2%
0.0852
0.4%
0.0881
0.2%
0.0891
0.2%
ValueCountFrequency (%)
2.421
0.2%
2.3291
0.2%
2.2881
0.2%
2.1371
0.2%
1.8931
0.2%

Age
Real number (ℝ)

Distinct51
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.0744879
Minimum21
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2026-03-12T12:28:36.424602image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile21
Q124
median29
Q341
95-th percentile57
Maximum81
Range60
Interquartile range (IQR)17

Descriptive statistics

Standard deviation11.68531899
Coefficient of variation (CV)0.3533030967
Kurtosis0.8842483866
Mean33.0744879
Median Absolute Deviation (MAD)7
Skewness1.169307637
Sum17761
Variance136.54668
MonotonicityNot monotonic
2026-03-12T12:28:36.487143image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2252
 
9.7%
2146
 
8.6%
2535
 
6.5%
2431
 
5.8%
2326
 
4.8%
2824
 
4.5%
2624
 
4.5%
2722
 
4.1%
3118
 
3.4%
2918
 
3.4%
Other values (41)241
44.9%
ValueCountFrequency (%)
2146
8.6%
2252
9.7%
2326
4.8%
2431
5.8%
2535
6.5%
ValueCountFrequency (%)
811
0.2%
721
0.2%
701
0.2%
692
0.4%
681
0.2%

Outcome
Boolean

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
False
349 
True
188 
ValueCountFrequency (%)
False349
65.0%
True188
35.0%
2026-03-12T12:28:36.542316image/svg+xmlMatplotlib v3.7.5, https://matplotlib.org/