BPG is committed to discovery and dissemination of knowledge
Review
Copyright ©The Author(s) 2025.
World J Gastroenterol. Oct 28, 2025; 31(40): 111499
Published online Oct 28, 2025. doi: 10.3748/wjg.v31.i40.111499
Table 1 Take-home messages for practicing endoscopists based on current guidelines
Society
Current position on AI in colonoscopy
ESGE[16]Supports CADe use to reduce AMR, provided minimal false positives and no WT prolongation; allows CADx for “leave-in-situ”/“resect and discard”, if accuracy is equivalent to experts. Recommends AI integration to help standardize performance in less experienced endoscopists
BMJ[17]Recommends against routine CADe use due to concerns over overdiagnosis, minimal clinical benefit, and increased surveillance; recommendation graded as “weak”
AGA[19]Does not issue a definitive recommendation for or against CADe use, citing uncertainty in long-term outcomes
WEO[20]Acknowledges CADe/CADx potential but urges cost-effectiveness assessment; highlights the need for high-quality real-world studies across healthcare systems
Table 2 Summary of systematic reviews and meta-analyses evaluating the impact of artificial intelligence on detection rate metrics in colonoscopy[18,37,40,41,43,44,46,48,50,51,55-61,108-113]
Ref.
Year
Study type
Patient number
Number of studies
Primary outcome
Artificial intelligence vs conventional colonoscopy (95%CI)
Barua et al[37]2020Systematic review and meta-analysis43115 RCTsADR, PDR, APC, PPC, aAPCADR: 29.6% vs 19.3%; RR = 1.52 (1.31 to 1.77); PDR: 45.4% vs 30.6%; RR = 1.48 (1.37 to 1.60); No difference in aAPC; PPC: 0.93 vs 0.51; Mean difference 0.42 (0.33 to 0.50); APC: 0.41 vs 0.23; Mean difference 0.18 (0.13 to 0.22)
Aziz et al[108]2020Systematic review and meta-analysis28153 RCTsADR32.9% vs 20.8%; RR = 1.58 (1.39 to 1.80)
Mohan et al[109]2020Systematic review and meta-analysis49626 RCTsADR32.8% vs 21.1%; RR = 1.5 (1.30 to 1.72)
Ashat et al[110]2021Systematic review and meta-analysis50586 RCTsADR33.7% vs 22.9%; OR = 1.76 (1.55 to 2.00)
Hassan et al[56]2021Systematic review and meta-analysis43545 RCTsADR36.6% vs 25.2%; RR = 1.44 (1.27 to 1.62)
Deliwala et al[61]2021Systematic review and meta-analysis49966 RCTsADR, PDRADR: OR = 1.77 (1.50 to 2.08); PDR: OR = 1.91 (1.68 to 2.16)
Nazarian et al[51]2021Systematic review and meta-analysis5577 subjects for polyp detection with ADR and PDR48 studies (18 studies for polyp detection, 22 studies for polyp characterization and 8 studies for PDR)ADR, PDR, polyp characterizationPDR: OR = 1.75 (1.56 to 1.96); ADR: OR = 1.53 (1.32 to 1.77)
Li et al[59]2021Systematic review and meta-analysis43115 studiesADR, PDRPDR: OR = 1.91 (1.68 to 2.16); ADR: OR = 1.75 (1.52 to 2.01)
Zhang et al[60]2021Systematic review and meta-analysis54277 RCTsADR, PDRPDR: OR = 1.95 (1.75 to 2.19); ADR: OR = 1.72 (1.52 to 1.95)
Spadaccini et al[48]2021Systematic review and network meta-analysis3444550 RCTsADRCADe vs HD white-light endoscopy OR = 1.78 (1.44 to 2.18); CADe vs chromoendoscopy OR = 1.45 (1.14 to 1.85); CADe vs increased mucosal visualization systems OR = 1.54 (1.22 to 1.94)
Huang et al[41]2022Systematic review and meta-analysis662910 RCTsADR, PDRADR: 35.4% vs 24.9%; RR = 1.43 (1.33 to 1.53)/OR = 1.45 (1.32 to 1.59); PDR: 48.6% vs 33.8%; RR = 1.44 (1.35 to 1.53)/OR = 1.90 (1.70 to 2.11)
Shah et al[62]2023Systematic review and meta-analysis1092814 RCTsADR, PDRADR: OR = 1.52 (1.39 to 1.67); PDR: OR = 1.48 (1.37 to 1.61)
Hassan et al[57]2023Systematic review and meta-analysis1823221 RCTsADR, APC, aAPC, number of serrated lesions/colonoscopy, number of polypectomies for nonneoplastic lesions, WTADR: 44.0% vs 35.9%; RR = 1.24 (1.16 to 1.33)
Lou et al[58]2023Systematic review and meta-analysis2740433 RCTsADR, APCADR: RR = 1.242 (1.159 to 1.332); APC: IRR = 1.390 (1.277 to 1.513)
Adiwinata et al[50]2023Systematic review and meta-analysisNA13 RCTsADR, PDRPDR: OR = 1.46 (1.13 to 1.89); ADR: OR = 1.58 (1.37 to 1.82)
Shiha et al[43]2023Systematic review and meta-analysis1134012 RCTsADR41.4% vs 33%; RR = 1.26 (1.18 to 1.35)
Wei et al[111]2024Systematic review and meta-analysis1166012 studiesADRADR was statistically significantly higher with AI vs CC (36.3% vs 35.8%, RR = 1.13, 95%CI: 1.01 to 1.28)
Aziz et al[112]2024Systematic review and network meta-analysis6117294 RCTsADRAutofluorescence imaging: RR = 1.33 (1.06 to 1.66); Dye-based chromoendoscopy: RR = 1.32 (1.17 to 1.50); Endo cuff: RR = 1.19 (1.04 to 1.35); Endo cuff vision: RR = 1.26 (1.13 to 1.41); Endoring: RR = 1.30 (1.10 to 1.52); Flexible spectral imaging color enhancement: RR = 1.26 (1.09 to 1.46); Full-spectrum endoscopy: RR = 1.40 (1.19 to 1.65); High definition: RR = 1.41 (1.28 to 1.54); Linked color imaging: RR = 1.21 (1.08 to 1.36); Narrow band imaging: RR = 1.33 (1.18 to 1.48); Water exchange: RR = 1.22 (1.06 to 1.42); Water immersion: RR = 1.47 (1.19 to 1.82)
Soleymanjahi et al[18]2024Systematic review and meta-analysis3620144 RCTsAPC, ACNAPC: 0.98 vs 0.78; IRD = 0.22 (0.16 to 0.28); ACN: 0.16 vs 0.15; IRD = 0.01 (-0.01 to 0.02)
Patel et al[46]2024Systematic review and meta-analysis97828 non-randomized studiesADR44% vs 38%; RR = 1.11 (0.97 to 1.28)
Gangwani et al[113]2024Network meta-analysis2256026 studies (20 RCTs, 3 retrospective, 3 prospective studies)ADRAI vs single operator: RR = 1.1 (0.9 to 1.2)
Lee et al[44]2024Systematic review and meta-analysis1741324 RCTsADRTandem studies: RR = 1.18 (1.08 to 1.30); Parallel studies: RR = 1.26 (1.17 to 1.35); Overall ADR: RR = 1.24 (1.17 to 1.31)
Makar et al[40]2025Systematic review and meta-analysis2386128 RCTsADRADR: RR = 1.20 (1.14 to 1.27)
Spadaccini et al[55]2025Systematic review and meta-analysis542110 RCTsADR0.62 vs 0.52; RR = 1.19 (1.08 to 1.31)
Table 3 Summary of systematic reviews and meta-analyses evaluating the impact of artificial intelligence on miss rate metrics in colonoscopy[40,52,57,58]
Ref.
Year
Study type
Patient number
Number of studies
Primary outcome
Artificial intelligence vs conventional colonoscopy (95%CI)
Hassan et al[57]2023Systematic review and meta-analysis1823221 RCTsAMRAMR: 16% vs 35%; RR = 0.45 (0.35 to 0.58)
Lou et al[58]2023Systematic review and meta-analysis2740433 RCTsAMRAMR: RR = 0.495 (0.390 to 0.627)
Maida et al[52]2024Systematic review and meta-analysis17186 RCTsAMR, PMRPMR: 16.3% vs 38.1%; RR = 0.44 (0.33 to 0.60); AMR: 15.3% vs 34.1%; RR = 0.46 (0.38 to 0.55)
Makar et al[40]2025Systematic review and meta-analysis2386128 RCTsAMRAMR: RR = 0.45 (0.37 to 0.54)
Table 4 Summary of study characteristics evaluating artificial intelligence performance for invasion depth prediction and polyp characterization in colonoscopy
Ref.
Year
Study type
Image type
AI algorithm
Patient number training set
Patient number validation set
Patient number testing set
Primary outcome
Sensitivity (%)
Specificity (%)
Accuracy (%)
Luo et al[69]2021Single center retrospectiveWLIDeep learning556137Invasion depth Tis/T1a vs T1b/> T291.29191.1
Minami et al[70]2022Single center retrospectiveWLI, NBI, CCEDeep learning914956Submucosal invasion depth SM1 vs SM2/387.235.774.4
Lu et al[68]2022Multicenter retrospectiveWLI, NBI, BLIDeep learning305140Invasion depth LGD/HGD/IM/SM1 vs SM2/advanced CRC90.094.293.8
Nemoto et al[72]2023Multicenter retrospectiveWLIDeep learning1084400Invasion depth Tis/T1a vs T1b59.894.487.3
Tokunaga et al[73]12021Single center retrospectiveWLIDeep learning824211Invasion depth LGD/HGD/SM1 vs SM2/advanced CRC96.775.090.3
Nakajima et al[71]2022Multicenter retrospectiveWLIDeep learning31344Invasion depth Tis/T1a vs T1b81.087.084.0
Song et al[75]2020Single center retrospectiveNBIDeep learning624545Invasion depth SSP/BA/SM1 vs SM2/358.893.381.3
Lui et al[74]2019Single center retrospectiveWLI, NBIDeep learning165276Invasion depth polyps ≥ 2 cm adenoma/SM1 vs SM294.692.394.3
Yao et al[76]2023Multicenter retrospectiveWLI, IEEDeep learning339198Invasion depth large SSPs ≥ 10 mm78.896.290.4
Racz et al[66]2022Single center retrospectiveNBIMachine learning279Polyp characterization non-neoplastic vs neoplastic292.277.686.6
Ham et al[67]2025Single center retrospectiveWLIDeep learning2696476Polyp characterization low vs high-risk adenomas ≤ 10 mm375.695.793.8
Table 5 Practical tips for incorporating artificial intelligence into endoscopy training
Practical tips for trainees using AI in colonoscopy
Use AI as a complement, not a replacement for clinical judgment
Review false positive alerts to learn distinguishing features
Combine AI with feedback from supervisors
Practice with and without AI assistance
Incorporate AI into video-based self-review
Be aware of potential deskilling with overuse
Participate in structured training programs including AI modules