BPG is committed to discovery and dissemination of knowledge
Review
Copyright ©The Author(s) 2026.
World J Hepatol. Feb 27, 2026; 18(2): 114834
Published online Feb 27, 2026. doi: 10.4254/wjh.v18.i2.114834
Table 1 Summary of artificial intelligence applications for non-invasive assessment of fibrosis and cirrhosis
Input data type
AI tool
Conventional metric/tool outperformed
Key AI contribution
Evidence level
Primary limitation
Clinical/LaboratoryRF, LightGBM, ANNFIB-4 score and transient elastographyMore reliable prediction of fibrosis stage; development of novel indices (e.g., FIB-6, validated in multicenter cohorts)Moderate-high (multicenter validation available for FIB-6; large cohorts in MASLD studies) near clinical useSome models remain retrospective; limited external validation for several algorithms; interpretability constraints
Imaging (CT, MRI, US, elastography)DL (CNNs, ResNet50), RadiomicsExpert radiologists; elastography aloneEnhanced cirrhosis detection; automatic segmentation; identification of inflammation/fibrosis; distinction of etiologies; radiomics improves staging precisionModerate (several studies with external validation, but heterogeneous datasets), promisingData heterogeneity; many single-center cohorts; limited standardized imaging protocols; 'black box' interpretability
ECGDL (AI- cirrhosis-ECG score)Standard clinical evaluationLow-cost cirrhosis screening with high AUC (0.908); potential for routine, scalable screeningLow-moderate (retrospective, single-center), emergingLimited sample size; lack of external validation; implementation barriers despite low test cost
Table 2 Application of artificial intelligence in hepatocellular carcinoma diagnosis
Management stage
Data source/technology
AI application
Key benefit
Evidence level
Primary limitation
Risk stratificationClinical, viral (HCV), cirrhosis dataML models (outperform traditional regression)Superior prediction of HCC risk, even post-HCV eradication or in MASLDResearchRetrospective design; single center; limited generalizability; data heterogeneity; 'black box' nature
Imaging diagnosisRadiomics (CT, MRI)/DLClassification of benign vs malignant lesionsIdentifies indeterminate nodules and predicts MVIResearchValidation needed in diverse populations; 'black box' interpretability; generalizability across centers
PathologyH&E histological slidesDL/CNNsClassification of tumor subtypes and prediction of genetic mutations with near-perfect reliability ResearchDependence on quality of scanned slides; potential for algorithmic bias; 'black box' interpretability
Prognosis/treatmentImaging/multi-omics/clinical dataEstimation of survival and therapeutic responsePrediction of recurrence and response to locoregional therapies (e.g., TACE)ResearchNeed for broader multicenter validation; data heterogeneity; 'black box' nature; integration into clinical workflow
Table 3 Relevant applications of artificial intelligence in inflammatory bowel disease
Application area
Data source and type
AI application/technique
Key clinical benefit
Evidence level
Primary limitation
Non-invasive diagnosisFecal multi-omics ML models Accurate differentiation of healthy vs UC vs CDResearch/early clinicalData heterogeneity, limited external validation, small/retrospective datasets, lack of generalizability
Differential diagnosisEndoscopic imaging and clinical records (NLP)TextCNN/image analysisDistinguishes CD from intestinal tuberculosis and UC from CD ResearchSymptoms overlap and endoscopic similarities; limited data quality; need for external validation
Disease assessmentRadiomics and endoscopic videoDL modelsQuantifies inflammation, detects strictures, and automates endoscopic activity scoring improving standardizationResearchLimited data quality and standardization; lack of external validation; 'black box' nature
Histological predictionHistopathological slidesCNNs/DL (automating RHI, NHI, PHRI)Objective scoring and superior prediction of future flares and post-surgical recurrenceResearchHigh inter- and intra-observer variability in expert labeling; data quality; 'black box' interpretability
Therapeutic responseClinical, laboratory, multi-omics, endoscopy dataML predictive modelsPredicts response to biologic treatment, enabling timely therapy adjustmentResearch/early clinicalDisease heterogeneity; variable treatment responses; need for robust external validation; 'black box' interpretability
Risk stratificationPeripheral blood transcriptomics, histologyML/DL models (low vs high-risk groups)Predicts disease progression, need for treatment escalation, and post-surgical recurrence/complications (strictures/fistulas). Recurrence/complications (strictures/fistulas)ResearchNeed for larger, diverse datasets; limited external validation; potential algorithmic bias
Drug developmentTranscriptomic data (intestinal tissue)/Boolean networksIdentification of novel therapeutic targetsAccelerates the discovery of first-in-class therapies with novel mechanisms of actionResearchComplexity of biological systems; need for robust preclinical validation; 'black box' nature; ethical considerations
Clinical trialsPatient electronic medical recordsNLP/DL algorithmsAccelerates patient recruitment and potential use of "Digital Twins" Research/future prospectData privacy and security; regulatory frameworks; ethical concerns regarding "digital twins" and placebo groups
Table 4 Role of computer-aided detection technology in endoscopy
Technology
Function
AI action
Clinical output
Impact
Evidence level
Primary limitation
CADe (detection)Real-time lesion localizationProcesses endoscopic video stream and automatically highlights suspicious regions with visual overlaysIncreased detection ratePrevents missed lesions (e.g., sessile serrated lesions, early gastric cancer), particularly for less experienced endoscopistsClinical trial/commercialHigh rate of false positives leading to endoscopist fatigue; potential for increased procedure time; lack of generalizability across diverse populations
CADx (diagnosis)Real-Time lesion characterizationAnalyzes morphological and vascular patterns of a detected lesionClassification of lesions (adenomatous vs non-adenomatous) or activity grading in IBDFacilitates "Resect and Discard" or "Diagnose and Leave" strategies, reducing biopsy costs and timeClinical trial/commercialVariability in accuracy based on polyp location (e.g., proximal vs distal colon); 'black box' nature limiting clinician trust; need for external validation
Integrated systemInformed therapeutic decisionSimultaneous use of CADe and CADx in a single, non-interruptive workflowPrecise treatment planOptimizes workflow efficiency and standardizes quality of care across different operatorsClinical trial/commercialSeamless integration into diverse healthcare environments; high development/maintenance costs; 'deskilling' risk for endoscopists; unclear regulatory/ethical frameworks for AI-assisted decisions
Table 5 Evidence levels across major artificial intelligence applications in gastroenterology and hepatology
Domain/application
Evidence level
Key determinants
Endoscopy (CADe/CADx)HighMultiple FDA/CE-approved tools; prospective multicenter trials; real-time clinical use
Radiology (CT/MRI)Moderate-highExternal validation common; some multicenter cohorts; radiomics + DL pipelines
Non-invasive liver tests/fibrosisModerateMix of large cohorts + retrospective datasets; limited external validation except for FIB-6
HCC detection and surveillanceLow-moderateEarly-stage models; heterogeneous metrics; mostly retrospective; few external validations
IBD (imaging, histology)ModerateProspective validation for endoscopy/histology models; omics models still experimental
IBD multi-omics/transcriptomicsLowExperimental; small cohorts; no external validation
Capsule endoscopyModerateStrong DL performance; mostly single-center retrospective datasets
Motility testing/manometryLow-moderateEmerging field; small datasets; experimental DL approaches
Predictive models for complicationsLow-moderateMostly retrospective; internal validation only
Implementation/regulation-Not applicable (conceptual section)