BPG is committed to discovery and dissemination of knowledge
Review
Copyright ©The Author(s) 2025.
World J Gastroenterol. Sep 28, 2025; 31(36): 111137
Published online Sep 28, 2025. doi: 10.3748/wjg.v31.i36.111137
Table 1 Convolutional neural network architectures adapted for medical image analysis
Ref.
Architecture
Key strengths
Limitations
Applications
[39]AlexNetIntroduced rectified linear unit, dropout, graphics processing unit accelerationOverfitting on small datasetsHistopathology image classification
[40]VGGNetUniform structure, easy to implementLarge number of parameters, memory-intensivePolyp detection, organ segmentation
[41]GoogLeNetMulti-scale feature extraction, fewer parameters than VGGComplex architecture, harder to modifyLesion classification, colonoscopy image analysis
[42]ResNetResidual connections solve vanishing gradientCan overfit if dataset is smallDetection of GI tumors, segmentation of ulcers
[43]U-NetExcellent for biomedical segmentation, works with few imagesLimited to segmentation tasksPolyp segmentation, mucosal layer delineation
[44]DenseNetStrong gradient flow, parameter-efficientComputationally intensiveEndoscopic image classification, disease grading
[45]Attention U-NetIncorporates attention for better focus on relevant regionsIncreased complexity and longer training timeIBD severity scoring; small bowel bleeding detection
[46]EfficientNetOptimized trade-off between accuracy and speedRequires careful scaling and tuningLightweight, mobile-compatible GI image classification
[47]Swin-CNNWindow-based attention + CNN; balances local and global featuresComplex design; tuning is more demandingGI endoscopy video anomaly detection; GI tumor recognition
[48]TransUNetCombines CNN for feature extraction and Transformer for context modelingResource intensive; slower trainingComputed tomography/magnetic resonance imaging organ segmentation; GI lesion boundary detection
[49]MedT (medical transformer)Pure transformer-based; excellent at long-range dependency modelingNot optimal for small datasets; data hungryIntestinal lesion segmentation; colorectal cancer prediction
[50]ConvNeXtCombines CNN stability with Transformer-like design; efficient trainingRelatively new; limited ecosystem maturityMulti-organ classification; tumor region detection
Table 2 Application of convolutional neural networks for lesion detection and classification in gastrointestinal diseases
Data
Architecture
Application
Key findings
Ref.
Endoscopic imagesDeep CNN with high-resolution endoscopic inputAids in detecting laterally spreading tumors in the colon using deep learning to enhance endoscopic visualization accuracyThe CNN model significantly improved the detection rate of laterally spreading tumors compared to conventional endoscopy, reducing oversight and potentially improving early colorectal cancer diagnosis. Accuracy increased notably in identifying flat lesions that are otherwise difficult to detect manually[64]
GI endoscopy imagesDeep CNN trained on large annotated image datasetDesigned to detect and classify a wide range of GI tract diseases in real-time using endoscopic imageryThe model achieved high diagnostic accuracy across multiple disease categories, outperforming traditional classifiers. It demonstrated strong potential for integration into clinical decision support tools, increasing consistency and efficiency in endoscopic diagnoses[65]
Wireless capsule endoscopy imagesCNN with real-time image processing pipelineSupports non-invasive detection of GI diseases using capsule endoscopy video sequencesAchieved expert-level accuracy in detecting abnormal lesions in small bowel images. This system allows for automated and accurate screening, significantly reducing time spent by clinicians reviewing lengthy capsule footage[66]
White-light endoscopyCNN model trained on early gastric cancer casesEarly detection of gastric cancer through white-light endoscopy imagery with AI assistanceThe model enabled early cancer identification with high sensitivity and specificity, improving upon the detection rates of endoscopists. Its implementation could lead to reduced gastric cancer mortality by facilitating prompt treatment decisions[67]
Endoscopic imagesCNN with multi-center training dataDetecting Helicobacter pylori infection from endoscopy imagesThe CNN demonstrated high performance and generalizability across institutions. It offers a non-invasive, fast diagnostic method, comparable to biopsy confirmation, enabling real-time feedback during endoscopic exams[68]
Gastroscopy imagesDeep dense CNN with channel attention mechanismAutomated classification of esophageal disease types (e.g., ulcers, neoplasms)By incorporating attention modules, the model excelled in distinguishing subtle texture differences among esophageal pathologies, showing great promise in supporting less experienced endoscopists[58]
Capsule endoscopy imagesCNN for object detectionAutomatic detection of small intestinal hookwormsThe proposed system achieved high precision in detecting parasitic lesions, helping clinicians quickly identify the presence of hookworm infections, which are often missed due to subtle image features[69]
Capsule endoscopy imagesCNN trained on hemorrhagic potential lesionsIdentification and differentiation of small bowel lesions with bleeding riskAccurately categorized lesions by bleeding risk, offering clinicians critical insight for prioritizing cases. The system reduces subjectivity in evaluating lesion severity[70]
Endoscopic imagesDeep CNN classifier for esophageal lesion typesDifferentiation of protruding esophageal lesions (e.g., GI stromal tumors, leiomyomas)CNN provided more accurate differential diagnosis than generalist endoscopists. Can support pre-biopsy decision-making, potentially reducing unnecessary interventions[71]
Endoscopic imagesTransfer CNN architecture (e.g., ResNet-based)Detection of early gastric cancerTransfer learning from pre-trained CNNs enabled rapid training on limited labeled data while maintaining strong detection performance. Highlights usefulness of transfer learning in GI imaging where large datasets are rare[72]
Endoscopic imagesFusion of multiple CNNs via fuzzy Minkowski distanceMulti-class classification of GI disordersThis fuzzy fusion method significantly boosted CNN decision reliability in classifying various GI tract diseases. Particularly effective in handling noisy or borderline cases by leveraging ensemble confidence[73]
Double-balloon enteroscopyCustom deep CNN with lesion region focusingDetecting and classifying small bowel lesions (e.g., ulcers, tumors) during double-balloon enteroscopyAchieved high sensitivity for subtle and rare lesions in the small intestine. Helped reduce diagnostic time and enabled detection of hard-to-access areas, improving double-balloon enteroscopy clinical utility[74]
Endoscopic imagesCNN with transfer learning (e.g., ResNet pretrained on ImageNet)Detecting GI abnormalities in multi-center datasetsTransfer learning approach enhanced classification accuracy with limited training data. It proved robust across image variations, useful in practical endoscopy deployments with different imaging devices[75]
GI tract imagesDeep CNN with region proposal and hierarchical labelingClassification of syndromes in the GI tract (e.g., inflammation, bleeding, neoplasms)Outperformed traditional classifiers with over 90% accuracy on several syndrome categories. System demonstrated early signs of suitability for autonomous decision-making in AI-assisted endoscopy[76]
Capsule endoscopy imagesCNN with spatial attention and region localizationDetection and precise localization of GI diseases in capsule footageThe model not only identified lesions but also marked their spatial location, reducing time required for manual review. Showed high generalization across bleeding, polyps, and ulcers[53]
Endoscopic imagesMulti-fusion CNN with spatial and feature fusionDiagnosis of complex GI disease states through multiple feature pathsFusion approach combined texture, edge, and color patterns from multiple CNN streams, improving classification of ambiguous or compound lesions. Enabled more granular diagnosis[77]
Capsule endoscopy imagesDeep multi-class CNN classifierDistinguishing ulcerative colitis, polyps, and dyed-lifted polypsThe model achieved strong performance on multi-label lesion classification, aiding clinicians in identifying inflammation patterns vs neoplastic changes from wireless capsule endoscopy images[56]
Endoscopic imagesDeep CNN for histological predictionDetection of chronic atrophic gastritisThe system outperformed general practitioners in identifying chronic atrophic gastritis on white-light images. This could enable earlier intervention for patients at risk of gastric cancer[78]
Endoscopic imagesCNN-based diagnostic prediction modelClassification of multiple GI disorders including gastritis and neoplasmsThe system offered a probabilistic diagnosis output that correlated well with clinical reports. Could serve as a second reader to reduce misdiagnosis[79]
Endoscopic imagesDeep CNN trained on villous atrophy casesDetection of duodenal villous atrophy, a hallmark of celiac diseaseModel enabled accurate identification of atrophic mucosal features that are often missed by the naked eye. Can improve non-invasive screening for celiac disease[80]
Endoscopic imagesCNN with lesion localization layerEarly gastric cancer classification and localizationThe system not only detected cancer but pinpointed lesion boundaries, assisting in targeted biopsies. Shown to reduce oversight in flat lesions[81]
Endoscopic imagesDeep CNN prospectively testedDiagnosis of chronic atrophic gastritisProspective validation showed significant improvement in diagnosis rate compared to conventional methods. Demonstrated practical deployment readiness[82]
Endoscopic imagesCNN trained and validated against multiple architecturesDetection of gastric mucosal lesionsCompared multiple CNN methods and showed the proposed model performed best in both sensitivity and specificity. Suggested a robust preprocessing pipeline[83]
Endoscopic imagesDeep CNN with classification and detection modulesBroad GI disease detection and classification, including ulcers and tumorsThe CNN model integrated multiple diagnostic modules, enabling both lesion classification and visual confirmation. It significantly outperformed traditional rule-based systems and provided explainable outputs to support clinical decisions[84]
Endoscopic imagesDeep CNN trained on annotated gastric polyp datasetsDetection of gastric polyps with high sensitivityThe model effectively identified polyps, including flat and small ones. It helped reduce miss rates in routine gastroscopy and can act as a real-time assistant for novice endoscopists[54]
Endoscopic imagesEnsemble machine learning with CNN and decision tree comparisonsGI disease classification from diverse image typesThough primarily a comparative study, CNNs were shown to outperform other models in most lesion categories. Highlighted CNNs’ suitability for complex image recognition tasks in GI settings[85]
Endoscopic imagesDeep convolutional architecture trained on expert-labeled dataDiagnosis of common gastric lesions including gastritis and erosionsDemonstrated high diagnostic accuracy and reduced inter-observer variability, suggesting CNN use can help standardize endoscopic assessments across institutions[86]
Capsule endoscopyDeep CNN with binary output modeBinary lesion detection (normal vs abnormal) in capsule endoscopyThe binary CNN model achieved excellent sensitivity in identifying abnormal frames, acting as an efficient pre-filter to aid clinicians reviewing thousands of capsule images[87]
Endoscopic imagesDeep CNN tested across multiple centersGastritis classification using AI comparable to gastroenterologistsAchieved expert-level performance in classifying gastritis subtypes. Multicenter validation proved its robustness across varied endoscopic systems and clinical environments[20]
Endoscopic imagesAttention-guided CNN with feature weightingGI disease classification enhanced with attention mechanismsAttention modules enabled the model to focus on key lesion areas, improving classification accuracy for overlapping or complex cases. Boosted explainability in predictions[62]
Wireless capsule endoscopy imagesPretrained CNN fine-tuned on wireless capsule endoscopy imagesGI tract disease classification from capsule endoscopyThe transfer learning-based CNN adapted well to wireless capsule endoscopy image features. It accurately classified disease regions, even under motion blur and illumination changes[88]
Endoscopic imagesGI-Net: CNN with anomaly detection capabilityMulti-label classification of anomalies in GI tractGI-Net offered near real-time classification across numerous disease types, showing strong performance in clinical pilot tests. Especially useful in large-scale screenings[89]
Endoscopic imagesDeep CNN trained on Helicobacter pylori-positive and negative imagesEvaluation of Helicobacter pylori infectionModel achieved diagnostic accuracy comparable to experienced gastroenterologists. Could be integrated into live endoscopy for on-the-spot infection prediction[90]
Gastric endoscopic imagesDeep CNN optimized for malignancy detectionClassification of gastric malignanciesCNN showed excellent discrimination between benign and malignant lesions. Real-time feedback capability demonstrated potential for improving early gastric cancer outcomes[91]
Endoscopic imagesCNN for cancer detection trained on large datasetsAutomatic gastric cancer detection from endoscopyHigh sensitivity and specificity achieved. Could act as an AI “second observer” for difficult-to-see lesions during live endoscopy[92]
Endoscopic imagesDeep learning model with Helicobacter pylori-specific feature extractionHelicobacter pylori diagnosis using AI interpretation of mucosal patternsEnabled real-time assessment of infection risk without biopsy, saving time and cost. Also reduced patient discomfort by avoiding invasive tests[93]
GI tract imagesEnsemble of deep CNN + texture feature classifiersGI abnormality detection from diverse image typesFusion of deep and handcrafted features improved robustness in challenging image conditions. Showed enhanced adaptability across multiple GI lesion types[94]
Capsule endoscopy imagesCNN tailored for parasitic structure detectionHookworm detection in small bowelModel achieved very high detection rate and reduced review time. Critical in endemic regions where hookworm infections are often overlooked[95]
Endoscopic imagesCNN optimized for mucosal pattern recognitionDiagnosis of Helicobacter pylori infection based on endoscopic imagesEnhanced early infection detection through image-only methods. Potentially replaces biopsy in low-resource settings[96]
Endoscopic imagesCNN using hierarchical feature fusionLesion detection in GI endoscopyFeature fusion helped distinguish similar lesions, improving accuracy. Suitable for deployment in routine screening environments[97]
Endoscopic imagesDeep CNN with blind-spot awarenessEarly gastric cancer detection in real-timeThe model avoided blind spots, a common issue in manual diagnosis, reducing miss rate for flat and depressed lesions. Demonstrated strong potential for real-time applications[98]
Capsule endoscopy imagesNeural network with small-bowel angiectasia trainingDetection of GI angiectasiaAchieved accurate identification of angiectasias with high sensitivity. Provided a non-invasive method to detect obscure GI bleeding causes, potentially reducing missed diagnoses during manual video review[99]
Capsule endoscopy imagesCNN-based ulcer detection systemAutomated ulcer detection in small bowelThe CNN model effectively distinguished ulcers from normal mucosa, even in poor lighting or with capsule motion. Enhanced early diagnosis of Crohn’s disease and non-steroidal anti-inflammatory drug-induced ulcers[100]
Endoscopic imagesCNN architecture tailored for Helicobacter pylori featuresEvaluation of Helicobacter pylori infection statusThe model achieved performance comparable to histology-based diagnosis. Useful for non-invasive, in-procedure assessment, aiding in same-visit treatment planning[101]
Endoscopic imagesCNN trained on gastric tumor categoriesGastric neoplasm classification (e.g., adenomas, carcinomas)Outperformed junior endoscopists in differentiating benign vs malignant neoplasms. Accelerated workflow and supported biopsy decision-making[102]
Endoscopic imagesDeep learning classifier with ensemble optimizationGI disease recognition in endoscopic imagesHandled a wide range of disease types with strong classification performance. Showed potential for integration into automated report generation systems[103]
Capsule endoscopy imagesDeep CNN with binary detection of ulcerationDetection of erosions and ulcersDemonstrated over 90% accuracy in detecting clinically relevant ulcerative lesions. Reduced time spent reviewing wireless capsule endoscopy footage by half[104]
Biopsy imagesDeep CNN for histopathological image classificationDetection of diseases (e.g., gastritis, carcinoma) in GI biopsy samplesModel classified histopathological lesions with high fidelity. Useful in screening large volumes of slides, especially in resource-constrained pathology labs[105]
Endoscopic imagesDeep CNN architecture for visual pathology classificationGastric pathology detection (e.g., erosions, polyps, cancer)Showed robust multi-label classification capacity across common gastric pathologies. Aided endoscopists in characterizing mucosal abnormalities with high sensitivity[106]
Barium X-ray radiographyCNN trained on radiograph featuresGastritis detection from double-contrast GI seriesBrought deep learning to a classic imaging modality. Enabled detection of gastritis without the need for endoscopy, useful in areas lacking access to scopes[107]
Endoscopic imagesCNN classifier for esophageal cancerEsophageal cancer diagnosis using endoscopyCNN matched the diagnostic accuracy of experienced clinicians. Rapid lesion classification suggested potential use in screening and early detection programs[108]
Capsule endoscopy imagesDeep learning classifier trained with expert inputSmall-bowel disease and variant classificationDemonstrated gastroenterologist-level accuracy across a wide range of small-bowel findings. Significantly improved efficiency and reliability in interpreting long capsule videos[109]
Table 3 Applications of convolutional neural networks for lesion segmentation and disease severity assessment in gastrointestinal diseases
Data
Architecture
Application
Key findings
Ref.
Double-balloon endoscopy imagesDeep CNN with lesion segmentation and severity scoringAutomated detection and severity grading of Crohn’s ulcers from double-balloon endoscopyThe model accurately segmented ulcers and assessed their severity, enabling objective Crohn’s Disease monitoring. Outperformed traditional scoring methods and showed potential to reduce inter-observer variability[125]
Capsule endoscopy imagesCNN segmentation network trained on angiodysplasiasAutomated segmentation of vascular lesions in small bowel imagesProvided accurate lesion boundary identification for angiodysplasias, facilitating hemorrhage risk stratification. Supports quicker and more consistent diagnosis compared to manual review[126]
Colon capsule endoscopyCNN trained to detect blood and mucosal lesionsSimultaneous detection and segmentation of bleeding and mucosal abnormalitiesEnabled precise localization of lesions and bleeding points in colon capsule footage. Improved lesion coverage and reduced diagnostic delay[127]
Endoscopic imagesCNN trained to grade UC severityAutomated assessment of UC severity from colonoscopyModel achieved expert-level grading accuracy across multiple severity stages. Significantly reduced inter-observer bias, suggesting suitability for clinical trial endpoints[128]
Colonoscopy imagesCNN-based model for pattern recognitionDifferentiation of Crohn’s disease vs UC from colonoscopyThe system accurately distinguished UC and Crohn’s disease patterns, providing real-time decision support for disease type classification[114]
Endoscopic imagesDeep learning classifier for depth predictionPredicting submucosal invasion in gastric neoplasmsModel reliably estimated invasion depth, reducing unnecessary surgical intervention. Useful in pre-treatment risk stratification[129]
Endoscopic imagesCNN for multi-feature predictionPrediction of early gastric cancer, invasion depth, and differentiationAI model outperformed experts in cancer invasion depth and differentiation. Enabled non-invasive yet accurate diagnosis during endoscopy[130]
Capsule endoscopy imagesDeep neural network tailored to stricture detectionDetection of Crohn’s-related intestinal stricturesThe model improved the detection of strictures, which are often missed in manual review. Accelerates diagnosis and may guide therapeutic decisions[131]
Confocal laser endomicroscopy imagesCNN trained for mucosal healing assessmentConfirmation of mucosal healing in Crohn’s diseaseEnabled fine-grained assessment of healing vs inflammation. Provided high-resolution insight for assessing treatment efficacy[132]
Endoscopic imagesAI-assisted classification modelUC disease activity scoring using Mayo classificationProvided consistent and reproducible activity scores. Reduced assessment variability, improving clinical and research utility[133]
Endoscopic imagesCNN vs human graders comparisonGrading UC severity from colonoscopyCNN showed equal or superior performance to human reviewers in grading UC. Suggested for use in high-throughput settings or trials[134]
Capsule endoscopy videosCNN for quantitative feature extractionQuantitative analysis of celiac disease lesionsModel extracted and quantified villous atrophy and mucosal abnormalities. Offered an objective metric to monitor disease progression[135]
Conventional endoscopy imagesCNN trained on histological labelsPredicting invasion depth of gastric cancerCNN-assisted depth prediction provided decision support for therapy planning. Could reduce need for unnecessary biopsies[136]
Table 4 Convolutional neural network-based real-time artificial intelligence support and workflow integration in gastrointestinal endoscopy
Data
Architecture
Application
Key findings
Ref.
Gastroscopy imagesReal-time anatomical classification with CNNReal-time anatomical recognition during gastroscopy proceduresCNN accurately classified anatomical positions in real-time, reducing mislabeling and improving procedural documentation. Enabled smoother workflow integration with minimal latency[150]
Gastroscopy videosFrame-wise CNN classificationAutomated disease detection during endoscopy video reviewThe system provided real-time frame classification, improving detection of GI abnormalities in long video sequences and assisting in efficient case triage[151]
White-light endoscopyCNN for real-time Helicobacter pylori statusImmediate assessment of Helicobacter pylori infection during endoscopyReal-time feedback from the CNN allowed on-the-spot therapeutic decision-making and eliminated delays caused by biopsy processing[113]
Esophagogastroduodenoscopy videosCNN-based detection system integrated with endoscopeReal-time detection of gastroesophageal varicesThe system was validated across multiple centers and significantly reduced miss rates of varices in real-time, enhancing early intervention[152]
Colonoscopic imagesDeep CNN classifier for anatomical site recognitionAutomated real-time anatomical classification of colon imagesCNN correctly labeled anatomical sites, reducing reliance on user memory and ensuring consistent documentation. Improved novice performance[153]
Upper GI endoscopyCNN detection assistant in randomised controlled trialDetection of gastric neoplasms during routine proceduresA deep learning-based system reduced miss rate of gastric neoplasms significantly in a randomized controlled trial. Validated real-world clinical benefit[154]
Upper GI anatomy imagesMulti-task CNN modelSimultaneous detection of anatomical landmarks and structuresModel performed both classification and segmentation, improving navigation and assisting less experienced users in orientation[155]
Esophagogastroduodenoscopy imagesCNN trained on large anatomical datasetReal-time classification of esophagogastroduodenoscopy anatomyCNN classified images with expert-level accuracy, aiding report generation and training in real-time[141]
GI endoscopy videosCNN vs global feature comparisonReal-time disease detection efficiency benchmarkingCNNs proved significantly more accurate than traditional global features. Demonstrated readiness for real-time clinical deployment[156]
GI tract videosCNN with automated reporting moduleEnd-to-end disease detection and report generationAutomatically generated reports based on real-time CNN classification, reducing documentation time and enhancing standardization[157]