BPG is committed to discovery and dissemination of knowledge
Observational Study
Copyright: ©Author(s) 2026.
World J Psychiatry. Apr 19, 2026; 16(4): 116428
Published online Apr 19, 2026. doi: 10.5498/wjp.v16.i4.116428
Table 1 Participant demographics

Depression group
Non-depression group
P value
OR (95%CI)
Number543228
Age (years), mean ± SD15.20 ± 1.6915.30 ± 1.670.460
Gender
Male126 (23.20)119 (52.19)< 0.0013.61 (2.60-5.01)
Female417 (76.80)109 (47.81)
Education
Junior high school242 (44.57)148 (64.91)< 0.0012.30 (1.67-3.17)
High school301 (55.43)80 (35.09)
Table 2 Comparative performance of multimodal models, mean ± SD

Accuracy
Precision
Recall
F1 score
AUC-ROC
AUC-PR
XGBoost0.95 ± 0.030.96 ± 0.030.97 ± 0.02a0.97 ± 0.02a0.99 ± 0.011.00 ± 0.00
Random forest0.94 ± 0.030.95 ± 0.030.96 ± 0.030.95 ± 0.020.98 ± 0.010.99 ± 0.01
Logistic regression0.94 ± 0.030.96 ± 0.030.96 ± 0.030.96 ± 0.020.98 ± 0.010.99 ± 0.01
Support vector machine0.94 ± 0.030.96 ± 0.030.95 ± 0.030.95 ± 0.020.98 ± 0.010.99 ± 0.01
Artificial neural networks0.74 ± 0.03c0.73 ± 0.02c1.00 ± 0.00c0.85 ± 0.01c0.78 ± 0.08c0.85 ± 0.06c
Table 3 Comparative performance of bimodal models

Accuracy
Precision
Recall
F1 score
AUC-ROC
AUC-PR
XGBoost0.93 ± 0.03c0.94 ± 0.04c0.96 ± 0.030.95 ± 0.02a0.96 ± 0.04a0.98 ± 0.03a
Random forest0.91 ± 0.040.92 ± 0.050.97 ± 0.030.94 ± 0.030.96 ± 0.030.98 ± 0.02
Logistic regression0.90 ± 0.040.91 ± 0.040.95 ± 0.040.93 ± 0.030.95 ± 0.030.97 ± 0.02
Support vector machine0.89 ± 0.040.88 ± 0.040.99 ± 0.010.93 ± 0.020.95 ± 0.040.97 ± 0.03
Artificial neural networks0.71 ± 0.00c0.71 ± 0.04c1.00 ± 0.00c0.83 ± 0.00c0.83 ± 0.06c0.92 ± 0.03c
Table 4 Statistical comparison between multimodal and bimodal extreme gradient boosting models

Multimodal
Bimodal
t value
P value
Cohen’s d
Effect size
AUC-ROC0.99 ± 0.010.96 ± 0.04 4.520.001.17Large
AUC-PR0.99 ± 0.000.98 ± 0.033.870.001.04Large
Accuracy0.95 ± 0.030.93 ± 0.032.950.010.76Moderate-to-large
Precision0.96 ± 0.030.94 ± 0.042.180.030.56Moderate
Recall0.97 ± 0.020.96 ± 0.031.510.140.39Small-to-moderate
F1 score0.97 ± 0.020.95 ± 0.022.950.010.74Moderate-to-large
Table 5 Performance stability: Multimodal vs bimodal extreme gradient boosting models

Levene’s test P value
Interpretation
AUC-ROC0.00Significant difference
AUC-PR0.00Significant difference
Accuracy0.39No significant difference
Precision0.55No significant difference
Recall0.04Significant difference
F1 score0.49No significant difference