Retrospective Study
Copyright ©The Author(s) 2024.
World J Clin Cases. May 26, 2024; 12(15): 2506-2521
Published online May 26, 2024. doi: 10.12998/wjcc.v12.i15.2506
Table 1 Participant descriptive statistics and risk factors, percentage and mean (± SD)
Characteristics
None
Fatty liver
P value
N number34, 33531, 200
Age (yr)36.75 ± 12.3347.52 ± 12.8< 0.001
Income2.04 ± 1.471.61 ± 1.57< 0.001
Body fat (%)26.66 ± 5.5535.96 ± 6.83< 0.001
Systolic blood pressure (mmHg)111.83 ± 16.07124.52 ± 19.68< 0.001
Diastolic blood pressure (mmHg)66.81 ± 10.2173.8 ± 11.69< 0.001
Leukocyte (× 103/μL)5.93 ± 1.736.45 ± 1.73< 0.001
Hemoglobin (× 106/μL)13.09 ± 1.1413.36 ± 1.18< 0.001
Platelets (× 103/μL)248.96 ± 57.74264.39 ± 63.31< 0.001
Fasting plasma glucose (mg/dL)92.75 ± 10.64103.7 ± 25.6< 0.001
Total bilirubin (mg/dL)0.77 ± 0.320.73 ± 0.33< 0.001
Albumin (mg/dL)4.5 ± 0.264.45 ± 0.24< 0.001
Globulin (mg/dL)3.08 ± 0.363.15 ± 0.36< 0.001
Alkaline Phosphatase (IU/L)101.86 ± 49.29110.72 ± 59.94< 0.001
Serum glutamic oxaloacetic transaminase (mg/dL)19.88 ± 11.7124.61 ± 17.56< 0.001
Serum glutamic pyruvic transaminase (IU/L)17.68 ± 18.6228.72 ± 27.22< 0.001
Serum γ-glutamyl transpeptidase (IU/L)14.22 ± 13.9425.3 ± 30.09< 0.001
Lactate dehydrogenase (IU/L)241.42 ± 84.03246.65 ± 92.63< 0.001
Estimated glomerular filtration rate (mL/min/1.73 m2)89.6 ± 76.8984.33 ± 72.4< 0.001
Uric acid (mg/dL)4.88 ± 1.095.67 ± 1.34< 0.001
Triglyceride (mg/dL)78.81 ± 42.95139.02 ± 101.03< 0.001
High density lipoprotein cholesterol (mg/dL)61.73 ± 14.854.17 ± 13.47< 0.001
Low density lipoprotein cholesterol (mg/dL)107.97 ± 30.48125.76 ± 34.09< 0.001
Calcium (mg/dL)9.2 ± 0.399.29 ± 0.41< 0.001
Phosphorus (mg/dL)3.72 ± 0.443.7 ± 0.46< 0.001
Thyroid stimulating hormone (IU/mL)1.75 ± 3.251.94 ± 3.64< 0.001
C-reactive protein (mg/dL)0.19 ± 0.450.3 ± 0.51< 0.001
Forced expiratory volume in one second (L)2.2 ± 0.461.96 ± 0.53< 0.001
Drink area0.97 ± 6.51.38 ± 8.72< 0.001
Smoke area1.5 ± 7.151.57 ± 7.89< 0.001
Betel nut area0 ± 00.02 ± 1.2< 0.001
Sport area3.32 ± 5.993.96 ± 6.19< 0.001
Sleep time2.91 ± 0.592.87 ± 0.720.25
Marriage, n (%)
Unmarried13 (458)8 (438)< 0.001
Married19 (939)2 (1545)
Table 2 Comparison with SMAPE, RAE, RRSE, and RMSE between multiple linear regression and machine learning methods
NAFLD+ group with age
MAPE
SMAPE
RAE
RRSE
RMSE
Linear0.139    0.1320.8450.84213.959
SGB0.138    0.1310.8410.83413.825
XGBoost0.139    0.1320.8450.84213.946
Elasticnet0.139    0.1320.8450.84213.954
NAFLD- group with age    
Linear0.133    0.1280.8680.86214.671
SGB0.132    0.1260.8550.85714.59
XGboost0.132    0.1260.8530.85714.58
Elasticnet0.134    0.1280.8680.86214.673
NAFLD+ group without age    
Linear0.154    0.140.8720.89715.606
SGB0.153    0.1390.8650.88815.444
XGboost0.153    0.140.8690.89115.49
Elasticnet0.154    0.140.8720.89715.596
NAFLD- group without age    
Linear0.134    0.130.9050.90615.149
SGB0.133    0.1290.8950.89214.915
XGboost0.133    0.1290.8950.89314.916
Elasticnet0.134    0.130.9040.90515.119
Table 3 The average of the importance of risk factors derived from stochastic gradient boosting, random forest and extreme gradient boost, in NAFLD+ (Model 1, including age)
Variables
SGB
XGBoost
Elasticnet
Average
Rank
Age10010015.0871.69 1
Income0.14000.05
Body fat3.941.93.273.04
Systolic blood pressure1.370.671.011.02
Diastolic blood pressure2.670.671.81.71
Leukocyte0.720.334.321.79
Hemoglobin1000.33
Platelets3.641.620.311.86
Fasting plasma glucose2.921.180.661.59
Total bilirubin6.862.2940.2416.46 6
Albumin1.840.5449.8517.41 5
Globulin0.280.2917.836.13
Alkaline Phosphatase0.990.1500.38
Serum glutamic oxaloacetic transaminase1.630.3600.66
Serum glutamic pyruvic transaminase3.831.940.822.20
Serum γ-glutamyl transpeptidase1.891.330.481.23
Lactate dehydrogenase23.2123.250.8615.77
Uric acid27.0324.0560.4237.17 2
Triglyceride0.8400.020.29
High density lipoprotein cholesterol1.990.81.161.32
Low density lipoprotein cholesterol1.780.140.10.67
Calcium3.972.876523.95 4
Phosphorus0.790.3600.38
Thyroid stimulating hormone6.923.994.915.27
C-reactive protein0.610.226.992.61
Forced expiratory volume in one second6.633.4110036.68 3
Drink area0.1100.070.06
Smoke area0.25000.08
Betel nut area0000.00
Sport area0.450.20.280.31
Sleep time0.14000.05
Marriage001.990.66
Table 4 The average of the importance of risk factors derived from stochastic gradient boosting, random forest and extreme gradient boost, in NAFLD- (Model 1, including age)
Variables
SGB
XGBoost
Elasticnet
Average
Rank
Age10010015.1971.73 1
Income001.410.47
Body fat3.691.054.152.96
Systolic blood pressure0.460.090.750.43
Diastolic blood pressure4.193.092.963.41
Leukocyte1.210.344.522.02
Hemoglobin4.81111.575.79
Platelets2.611.060.21.29
Fasting plasma glucose0.420.210.170.27
Total bilirubin3.111.8419.248.06
Albumin2.281.3469.5324.38 4
Globulin0.420.123.031.19
Alkaline Phosphatase1.840.220.040.70
Serum glutamic oxaloacetic transaminase0.52000.17
Serum glutamic pyruvic transaminase3.791.941.122.28
Serum γ-glutamyl transpeptidase1.1600.380.51
Lactate dehydrogenase21.9918.240.9713.73 5
Uric acid26.6222.9976.3541.99 2
Triglyceride1.590.3100.63
High density lipoprotein cholesterol1.40.250.650.77
Low density lipoprotein cholesterol2.370.260.180.94
Calcium1.660.6429.9610.75 6
Phosphorus2.052.0710.985.03
Thyroid stimulating hormone11.868.695.028.52
C-reactive protein0.4202.731.05
Forced expiratory volume in one second5.063.0110036.02 3
Drink area00.2500.08
Smoke area0.370.170.790.44
Betel nut area0000.00
Sport area1.130.651.551.11
Sleep time0000.00
Marriage0.1305.511.88
Table 5 The average of the importance of risk factors derived from stochastic gradient boosting, random forest and extreme gradient boost, in NAFLD+ (Model 2, excluding age)
Variables
SGB
XGBoost
Elasticnet
Average
Rank
Income12.8815.998.0612.31
Body fat22.2516.833.8514.31 5
Systolic blood pressure11.299.240.416.98
Diastolic blood pressure9.176.851.195.74
Leukocyte1.730.581.91.40
Hemoglobin2.570.314.272.38
Platelets17.8314.870.3211.01
Fasting plasma glucose9.726.710.25.54
Total bilirubin9.63.5628.6513.94
Albumin12.059.8110040.62 3
Globulin1.851.2510.624.57
Alkaline Phosphatase4.280.990.061.78
Serum glutamic oxaloacetic transaminase3.453.051.452.65
Serum glutamic pyruvic transaminase16.2711.921.579.92
Serum γ-glutamyl transpeptidase1.140.650.280.69
Lactate dehydrogenase1001000.666.87 1
Uric acid50.1645.6630.1541.99 2
Triglyceride6.233.560.143.31
High density lipoprotein cholesterol0.860.820.050.58
Low density lipoprotein cholesterol9.66.730.465.60
Calcium12.489.0753.4825.01 4
Phosphorus0.791.871.481.38
Thyroid stimulating hormone16.4211.232.239.96
C-reactive protein002.110.70
Forced expiratory volume in one second39.3944.3238.1540.62 3
Drink area000.260.09
Smoke area0000.00
Betel nut area0000.00
Sport area3.953.831.493.09
Sleep time0.860.54.041.80
Marriage002.380.79
Table 6 The average of the importance of risk factors derived from stochastic gradient boosting, random forest and extreme gradient boost, in NAFLD- (Model 2, excluding age)
Variables
SGB
XGBoost
Elasticnet
Average
Rank
Income2.491.951.612.02
Body fat7.572.681.483.91
Systolic blood pressure28.6630.680.5919.98 6
Diastolic blood pressure18.4421.961.7314.04
Leukocyte9.075.266.636.99
Hemoglobin12.511.953.145.87
Platelets12.138.680.237.01
Fasting plasma glucose6.674.960.74.11
Total bilirubin9.075.163.375.87
Albumin21.9520.1610047.37 4
Globulin1.32000.44
Alkaline Phosphatase2.7500.020.92
Serum glutamic oxaloacetic transaminase4.063.151.592.93
Serum glutamic pyruvic transaminase9.096.481.745.77
Serum γ-glutamyl transpeptidase1.1100.110.41
Lactate dehydrogenase1001000.6366.88 1
Uric acid66.9263.2436.6855.61 2
Triglyceride12.3980.346.91
High density lipoprotein cholesterol2.640.670.171.16
Low density lipoprotein cholesterol14.1810.110.518.27
Calcium3.822.421.049.09
Phosphorus5.656.520.14.09
Thyroid stimulating hormone34.3224.162.5620.35 5
C-reactive protein2.68000.89
Forced expiratory volume in one second61.0264.426.9350.78 3
Drink area1.011.2100.74
Smoke area2.241.220.851.44
Betel nut area0000.00
Sport area13.3211.442.469.07
Sleep time0.790.4710.693.98
Marriage8.887.8530.5515.76