BPG is committed to discovery and dissemination of knowledge
Retrospective Study
Copyright: ©Author(s) 2026.
World J Hepatol. May 27, 2026; 18(5): 119798
Published online May 27, 2026. doi: 10.4254/wjh.v18.i5.119798
Table 1 Baseline characteristics of the training and validation datasets, n (%)/median (interquartile range)
Variables
Training set (n = 768)
Validation set (n = 330)
P values
Gender (male/female)Male 473 (61.6)Male 211 (63.9)0.50
Female 295 (38.4%)Female 119 (36.1%)
Age (years)36.00 (30.00, 44.25)37.00 (30.25, 45.75)0.56
Weight (kg)61.00 (53.00, 69.00)61.80 (54.00, 69.00)0.85
Height (cm)167.00 (160.00, 171.00)168.00 (160.00, 173.00)0.27
Hepatitis B virus DNA (log10 IU/mL)5.90 (3.78, 7.63)6.14 (3.56, 7.62)0.59
Hepatitis B surface antigen (log10 IU/mL)3.59 (3.07, 4.22)3.57 (3.04, 4.32)0.76
Hepatitis B surface antibody (mIU/mL)0.19 (0.00, 1.10)0.10 (0.00, 1.00)0.21
Hepatitis B e antigen (COI)0.59 (0.11, 700.00)0.50 (0.17, 700.00)0.86
Hepatitis B e antibody (COI)0.32 (0.01, 13.39)0.22 (0.01, 7.99)0.26
Hepatitis B core antibody (COI)7.95 (0.01, 9.27)7.66 (0.01, 9.35)0.71
White blood cell (109/L)5.30 (4.50, 6.20)5.30 (4.40, 6.10)0.84
Neutrophil Abs. count (109/L)2.90 (2.40, 3.60)3.00 (2.40, 3.70)0.45
Hemoglobin (g/L)145.00 (132.00, 155.25)146.00 (132.25, 156.00)0.67
Platelet (109/L)202.00 (168.00, 239.00)195.00 (165.00, 229.00)0.02
Prothrombin time (seconds)13.10 (12.62, 13.70)13.10 (12.60, 13.70)0.91
International normalized ratio1.01 (0.97, 1.06)1.02 (0.97, 1.06)0.62
Albumin (g/L)44.00 (42.00, 46.00)44.00 (42.00, 46.00)0.52
Globulin (g/L)31.00 (28.00, 34.00)30.00 (27.00, 34.00)0.62
Total bilirubin (μmol/L)16.05 (11.80, 21.50)15.50 (12.10, 21.10)0.88
Direct bilirubin (μmol/L)4.85 (3.00, 7.20)4.80 (2.90, 7.47)0.92
Alanine aminotransferase (U/L)68.00 (44.00, 116.00)68.00 (47.00, 118.00)0.65
Aspartate aminotransferase (U/L)49.00 (27.75, 94.00)44.00 (25.00, 89.75)0.17
Gamma-glutamyl transferase (U/L)33.00 (21.00, 62.00)31.50 (21.00, 56.00)0.44
Alkaline phosphatase (U/L)63.00 (33.00, 83.00)61.00 (31.25, 81.00)0.64
Alpha-fetoprotein (ng/mL)2.95 (1.75, 5.55)2.79 (1.82, 4.64)0.51
Liver stiffness measurement (kPa)7.40 (5.60, 10.60)7.30 (5.50, 10.20)0.70
Table 2 Detailed model performance metrics for liver fibrosis diagnostic models
Metric
Liver stiffness-platelet ratio index
Fibrosis-4
AST to platelet ratio index
Gamma-glutamyl transpeptidase to platelet ratio
S-Index
Hui model
Age-male-ALP-platelets risk score
AST to platelet and age-gender model
Training data
AUROC0.840.650.660.700.710.750.690.71
95%CI lower0.800.610.620.660.670.710.650.66
95%CI upper0.870.690.700.740.750.780.740.75
UA0.060.090.080.080.080.080.090.09
Matthews correlation coefficient0.460.210.200.260.260.330.270.31
Youden index (%)52.9620.3822.8728.5830.0037.8731.1333.25
PPV (%)47.5540.5632.0840.3936.5440.2938.0444.83
NPV (%)92.5380.4485.0483.4385.7888.0885.5284.33
Sensitivity (%)82.4538.8372.8754.7968.6272.8765.9655.32
Specificity (%)70.5281.5550.0073.7961.3865.0065.1777.93
Accuracy (%)73.4471.0955.6069.1463.1566.9365.3672.40
TP (%)20.189.5117.8413.4116.8017.8416.1513.54
Validation data
AUROC0.830.650.610.710.710.740.700.73
95%CI lower0.780.580.540.640.640.670.630.67
95%CI upper0.880.720.680.780.780.790.760.80
UA0.100.140.130.130.130.120.130.13
Matthews correlation coefficient0.430.230.080.300.260.290.260.30
Youden index (%)49.6523.459.2631.7430.0233.7530.0231.72
PPV (%)44.9741.6727.9144.0036.6037.5036.6044.79
NPV (%)92.2281.2278.9883.8485.8087.5785.8083.69
Sensitivity (%)82.72 43.2159.2654.3269.1474.0769.1453.09
Specificity (%)66.94 80.2450.0077.4260.8959.6860.8978.63
accuracy (%)70.82 71.1252.2871.7362.9263.2262.9272.34
TP (%)20.36 10.6414.5913.3717.0218.2417.0213.07
Table 3 Performance of machine learning and deep learning models with 26 features and liver stiffness-platelet ratio index + 26 features on imbalanced dataset
Models
Area under the receiver operating characteristic curve
Youden
Positive predictive value (%)
Negative predictive value (%)
Accuracy (%)
True positives (n)
False positive (n)
True negative (n)
False negative (n)
Sensitivity (%)
Specificity (%)
Using 26 features
LogisticRegression0.850.580.570.920.8041.4031.60134.0012.400.770.81
SVM0.830.550.550.910.7940.4033.40132.2013.400.750.80
NuSVC0.810.520.510.910.7641.4040.80124.8012.400.770.75
DecisionTree0.700.400.540.850.7729.8025.60140.0024.000.550.85
ExtraTree0.630.270.440.820.7224.8032.00133.6029.000.460.81
GaussianNB0.800.490.520.890.7738.2035.80129.8015.600.710.78
GradientBoosting0.880.630.600.930.8243.2029.00136.6010.600.800.82
HistGradientBoosting0.870.620.550.940.7945.2037.00128.608.600.840.78
AdaBoost0.860.610.540.950.7846.2041.60124.007.600.860.75
RandomForest0.870.620.580.930.8144.0032.00133.609.800.820.81
KNeighbors0.770.430.470.890.7238.0046.40119.2015.800.710.72
KAN0.830.550.560.910.7940.8034.20131.4013.000.760.79
NeuralNetwork0.840.570.550.920.7842.8036.80128.8011.000.800.78
Using liver stiffness-platelet ratio index + 26 features
LogisticRegression0.850.580.580.910.8140.8029.00136.6013.000.760.82
SVM0.830.550.540.910.7841.0034.80130.8012.800.760.79
NuSVC0.810.530.540.910.7740.4037.20128.4013.400.750.78
DecisionTree0.670.350.510.840.7626.8025.00140.6027.000.500.85
ExtraTree0.660.320.480.830.7526.4028.40137.2027.400.490.83
GaussianNB0.810.520.580.890.8037.2027.80137.8016.600.690.83
GradientBoosting0.870.620.560.940.8044.8034.80130.809.000.830.79
HistGradientBoosting0.870.610.530.940.7846.0041.40124.207.800.860.75
AdaBoost0.850.580.590.920.8042.0033.00132.6011.800.780.80
RandomForest0.860.640.620.930.8243.6028.80136.8010.200.810.83
KNeighbors0.780.420.470.880.7237.0044.20121.4016.800.690.73
KAN0.840.580.550.920.7942.6035.80129.8011.200.790.78
NeuralNetwork0.850.590.590.920.8141.0028.80136.8012.800.760.83


Write to the Help Desk