BPG is committed to discovery and dissemination of knowledge
Systematic Reviews
Copyright ©The Author(s) 2026.
World J Gastrointest Oncol. Feb 15, 2026; 18(2): 113870
Published online Feb 15, 2026. doi: 10.4251/wjgo.v18.i2.113870
Table 1 Summary of quality assessment based on Scale for the Assessment of Narrative Review Articles criteria
SANRA criterion
Summary of findings
Justification of article importanceHigh: Most studies clearly stated the clinical problem of HCC detection and the potential of DL
Statement of concrete aimsMedium/high: Modeling aims (e.g., classification accuracy) were usually clear; efficiency aims were sometimes less explicitly stated
Description of literature searchLow: A significant weakness across almost all primary studies; search strategies were rarely reported
Scientific accuracy and rigorMedium: Methods were mostly sound technically, but clinical validation rigor (prospective/multicenter) was often low
Discussion of limitationsMedium: Common limitations like small sample size were often acknowledged; discussion of bias or generalizability was less frequent
Quality of illustrations/reportingHigh: Most studies included high-quality figures of models, results, and attention maps. Efficiency metrics (e.g., model size, inference time) were not consistently reported
Table 2 Comprehensive summary of deep learning architectures for hepatocellular carcinoma studies
Ref.
Year
DL architecture
Application area
Modality/efficiency techniques
Key findings
Challenges addressed
Aatresh et al[96]2021LiverNet (CNN-like model)HistopathologyH&E stained liver histopathology imagesAccuracy of 90.93% on the proposed KMC liver dataset for distinguishing HCC subtypesDiagnosing low sub-type liver HCC tumors correctly
Hamm et al[71]2019CNNLesion classificationMulti-phasic MRI/CNNAUC 0.992 for lesion classificationUngrouped lesions make classification harder in clinical use
Ioannou et al[75]2020DL RNNRisk predictionEHR time-series/sequential data modelingAUC 0.76 for 3-year HCC risk, RNN models performed better in virologic responders (AUC 0.806)External validation, computational cost, etc.
Ma et al[77]2024Transformer dense CenterNet (TDCenterNet)Liver tumor detectionCT images/attention mechanismImproved capturing lesions with different sizesFeature deficiency, ignoring lesions with varies sizes, etc.
Deshpande et al[83]2024Hybrid DL (like CNN, ResNet50, EfficientNetb3, etc.)Grade classificationHistopathology/slide images> 97% accuracy for HCC grade classificationInterpretable histopathological images, high parameter count of the models
Qu et al[95]2025DL modelSubtype identificationDynamic contrast-enhanced MRIIdentified proliferative HCC subtype (AUC 0.94 for external test set)Limited labeled data utilization
Li et al[97]2025HTRecNet (hybrid)Differential diagnosisHistopathological data (including images)Early and accurate differentiation of HCC and CCA from normal liver tissues (accuracy of 0.97 in external testing)Computational resource requirements, integrating AI models into clinical practice, etc.
Wang et al[79]2025Gabor attention + transformerSegmentationCT/hierarchical monitoringEnhanced tumor segmentation accuracyMultimodal data fusion
He et al[98]2025AI/MLMolecular subtype discoveryMulti-omics integration/dimensionality reductionNovel molecular subtype identificationHeterogeneity of HCC, dimensionality, integration of data
He et al[106]2024DL (like UNet, swim transformer)Microvascular invasion and HCC predictionCT/clinical parameters/attention modulesPredicted microvascular invasion/HCC (AUC > 0.94)Clinical outcome stratification
Lei et al[121]2024Multimodal DLMicrovascular invasion predictionCT/MRI/feature fusionImproved microvascular invasion prediction (AUC 0.844-0.871)Data heterogeneity
Paproki et al[111]2024GANsData augmentationSynthetic data/data augmentationReduced bias in training datasetsData diversity and fairness
Xia et al[76]2024CRNNSurvival predictionMulti modal (CT scans + clinical variables)Improved survival prediction (AUC of 0.777 and 0.704 in the validation and test sets)Temporal-spatial analysis
Mitrea et al[92]2023CNN + traditional MLHCC detectionUltrasound/data augmentation> 98% accuracy for HCC detectionNoise reduction in low-quality images
Yasaka et al[72]2018CNNLesion classificationDynamic CT/standard CNNHigh accuracy distinguishing liver masses on CT (median AUC 0.92)Generalizability
Cao et al[73]2022CNNLesion differentiationNon-contrast CTHigh accuracy distinguishing HCC vs hemangioma on non-contrast CTHandling low-contrast images
Wang et al[74]2020SCCNNDifferential diagnosisCT/multi-branch fusionImproved HCC vs cholangiocarcinoma classification (AUC > 0.95)Multi-class lesion discrimination
Table 3 Taxonomy of deep learning architectures for hepatocellular carcinoma tasks: Mapping to data types and efficiency profiles
Primary task
Data modality
Suitable DL architectures
Efficiency profile & trade-offs
Lesion detection & diagnosisStatic US/CT/MRI sliceLightweight CNNs (MobileNet, EfficientNet), standard CNNsPareto-efficient: Lightweight CNNs are optimized for high speed and low cost with minimal accuracy loss. Standard CNNs offer a balance
Volumetric CT/MRI3D CNNs, 2.5D CNNs (processing slices sequentially)Trades speed for accuracy: 3D CNNs are accurate but computationally heavy. 2.5D/pseudo-3D approaches are a more efficient compromise
Multi-phase CT/MRIMulti-input/weight-sharing CNNs, transformer-CNN hybridsPareto-efficient: Multi-input CNNs efficiently fuse phase data. Hybrids use transformers to capture long-range dependencies between phases without a full transformer's cost
SegmentationCT/MRI volumesU-Net variants, transformer-based (e.g., UNETR)Trades speed for accuracy: U-Nets are relatively efficient. Pure transformers (e.g., UNETR) offer superior accuracy for complex shapes but are computationally intensive
Longitudinal risk predictionTabular EHR time-seriesRNNs (LSTM/GRU), transformers, lightweight MLContext-dependent: RNNs/transformers model time well but can be heavy. For simpler tasks, logistic regression/gradient boosting (non-DL) are often more efficient and perform similarly
Histopathology classificationWSIMIL + CNN, ViTPareto-efficient: MIL frameworks are inherently efficient, processing bags of image patches. New efficient ViT variants are emerging for WSI analysis
Multimodal fusionFused imaging + clinicalHybrid architectures (e.g., CNN branch for images, DNN for tabular data)Pareto-efficient: Hybrid designs effectively integrate data types. The efficiency is determined by the choice of the image backbone (e.g., using a lightweight CNN)
High-precision detectionCT/MRI (small lesions)Dense detectors, ViTTrades speed for accuracy: Models like ViT and complex detectors excel at finding small lesions due to global attention but have high computational demands