BPG is committed to discovery and dissemination of knowledge
Minireviews
©Author(s) (or their employer(s)) 2026.
Artif Intell Gastrointest Endosc. Mar 8, 2026; 7(1): 114426
Published online Mar 8, 2026. doi: 10.37126/aige.v7.i1.114426
Table 1 Machine learning models for the prediction of gastrointestinal bleed
Supervised machine learning
Classification task
Comment
Regression task
Comment
K-nearest neighboursData classification as per k-nearest neighbours, non-parametricGradient boosting modelA combination of weaker models (e.g., a decision tree) to create a stronger prediction model
XGBoost
LightGBM
CatBoost
Neural networkDeep learning method composed of interconnected layers of artificial neuronsSupport vector machineTechnique to categorise data points by finding an optimal hyperplane
ANN
CNN
Decision treeAn arranged tree in which internal nodes are attributes, branches are decisions, and leaves are outcomes or labelsRegression analysisUseful for predicting time-to-event outcomes, including covariates and event times, such as bleeding recurrence
Table 2 Evidence and efficacy of use of artificial intelligence in gastrointestinal bleed
Ref.
Primary focus
Data type
Study type
Study metrics
Key AI/ML method(s)
Comment
Raghareutai et al[31], 2025Pre-endoscopy risk stratificationDemograph, clinical and lab values, n = 1389Retrospective review of prospectively collected data, internal validationAUROC 0.74 to predict endoscopic intervention (0.81 in validation test set)linear discriminant analysisIdentify patients of AUGIB who need endoscopic intervention. Dynamic changes need multiple inputs of data
Shung et al[6], 2020Pre-endoscopy risk stratificationClinical and lab data, n = 1958Retrospective, multicentre, external validationML vs GBS; AUROC 0.90 vs 0.87 (external validation) sensitivity 100%, specificity 26%Gradient boosting machines (XGBoost)The ML model could identify low-risk patients who can be safely discharged
He et al[20], 2024Real-time endoscopic predictionEndoscopic images training data n = 3868, internal; validation data, n = 834; external validation data, n = 521Multicenter prospective; external validationAUC 0.80 in the validation data set, accuracy of 91.2%DCNN (image-based Forrest)The DCNN system showed more accurate and stable diagnostic performance than endoscopists in the prospective clinical comparison test
Bai et al[14], 2025Post-GI bleed mortality in cirrhosisClinical and lab data, n = 2467 cirrhoticsMulticenter prospective, international internal validation, no external validationAUC of 0.789 (up to 0.986 in LS-SVMR model)LS-SVMR (least squares support vector machine regression)CAGIB score is similar to CTP, MELD and MELD-Na for the prediction of mortality. No comparison with AIMS65, Rockall and GBS
Boros et al[8], 2025Post-GI bleed mortalityEHR registry (Hungarian GI bleed registry),
n = 1021
Retrospective, fivefold cross-validationXGBoost and CatBoost AUC 0.79 vs 0.62 of GBS in GIBXGBoost and CatBoostCatBoost reached a sensitivity of 78% and a specificity of 74%
Table 3 Key studies evaluating artificial intelligence/machine learning models in upper gastrointestinal bleeding
Ref.
Design/setting
Dataset size
Population/data type
Predictors/inputs
Algorithm/model
Comparator (if any)
Performance metrics (AUROC/sensitivity/specificity /PPV/NPV)
Validation type
Major limitations
Shung et al[6], 2020Retrospective, multicentre, external validation1958 patients (training + validation)Acute UGIB (clinical + lab data)Vital signs, Hb, BUN, comorbidities, transfusion needGradient boosting machine (XGBoost)GBS, AIMS65AUROC 0.90 vs 0.87 (GBS); sensitivity 100%, specificity 26%External (prospective cohort)Limited ethnic diversity; no imaging variables
Raghareutai et al[31], 2025Retrospective review of a prospectively collected dataset1389 casesNon-variceal UGIB; demographic, clinical + lab dataAge, vitals, Hb, BUN, SRHLinear discriminant analysis Rockall, GBSAUROC 0.74 (0.81 in validation test set)Internal + temporal validationSmall sample; limited external testing; static variables
Nazarian et al[11], 2024Multicentre retrospective cohort970 patientsAUGIB: Need for hemostatic therapyClinical + lab + endoscopic featuresRandom forest classifierGBS, RockallAUROC 0.84 vs 0.72 in GBSFive-fold cross-validationRetrospective bias; heterogeneous image quality
He et al[20], 2024Multicenter prospective; external validation3868 images from 1200 patientsPeptic ulcer bleed; Forrest classificationEndoscopic imagesDeep CNNEndoscopistAccuracy 91.2%; AUC 0.80 in validation data setExternal validationLimited to image data; no clinical integration
Yen et al[12], 2021Retrospective single-centre image analysis2738 images (2289 train, 449 test)Peptic ulcer bleedEndoscopic image featuresMobileNetV2 (CNN)Human endoscopistAUROC 0.91 vs 0.80 (human); sensitivity 94%; specificity 92% (3 class category)Internal (hold-out)Retrospective design; limited generalisability
Boros et al[8], 2025Retrospective EHR registry (Hungarian GI bleed registry)1021 recordsGI bleed (mixed aetiology)Demographics, labs, vitals, comorbiditiesXGBoost/CatBoost modelsGBS, RockallAUROC 0.79 in GIBleed, AUC 0.84 in mortality5-fold cross-validationRetrospective; national registry bias
Bai et al[14], 2025Multicenter prospective; international; internal2467 cirrhotic patients with UGIBCirrhosis with acute UGIBClinical + lab parametersLS-SVMR (least-squares SVM regression)Logistic regression modelAUROC 0.986External (prospective)Limited to the cirrhotic population
Levi et al[16], 2021Retrospective ICU database (MIMIC-III + eICU-CRD)Approximately 6000 ICU admissionsGI bleeding patients in the ICUDemographic + vital + lab seriesLSTM neural networkLogistic regressionAUROC > 0.80 in the MIMIC-III data setCross-dataset validationLimited prospective validation; EHR data noise