Copyright
        ©The Author(s) 2021.
    
    
        World J Gastroenterol. Jun 7, 2021; 27(21): 2818-2833
Published online Jun 7, 2021. doi: 10.3748/wjg.v27.i21.2818
Published online Jun 7, 2021. doi: 10.3748/wjg.v27.i21.2818
            Table 1 Artificial intelligence applications in gastric cancer pathology
        
    | Ref. | Task | No. of cases/data set | Machine learning method | Performance | 
| Bollschweiler et al[79] | Prognosis prediction | 135 cases | ANN | Accuracy (93%) | 
| Duraipandian et al[80] | Tumor classification | 700 slides | GastricNet | Accuracy (100%) | 
| Cosatto et al[65] | Tumor classification | > 12000 WSIs | MIL | AUC (0.96) | 
| Sharma et al[21] | Tumor classification | 454 cases | CNN | Accuracy (69% for cancer classification), accuracy (81% for necrosis detection) | 
| Jiang et al[81] | Prognosis prediction | 786 cases | SVM classifier | AUCs (up to 0.83) | 
| Qu et al[82] | Tumor classification | 9720 images | DL | AUCs (up to 0.97) | 
| Yoshida et al[23] | Tumor classification | 3062 gastric biopsy specimens | ML | Overall concordance rate (55.6%) | 
| Kather et al[34] | Prediction of microsatellite instability | 1147 cases (gastric and colorectal cancer) | Deep residual learning | AUC (0.81 for gastric cancer; 0.84 for colorectal cancer) | 
| Garcia et al[30] | Tumor classification | 3257 images | CNN | Accuracy (96.9%) | 
| León et al[83] | Tumor classification | 40 images | CNN | Accuracy (up to 89.7%) | 
| Fu et al[32] | Prediction of genomic alterations, gene expression profiling, and immune infiltration | > 1000 cases (gastric, colorectal, esophageal, and liver cancers) | Neural networks. | AUC (0.9) for BRAF mutations prediction in thyroid cancers | 
| Liang et al[84] | Tumor classification | 1900 images | DL | Accuracy (91.1%) | 
| Sun et al[85] | Tumor classification | 500 images | DL | Accuracy (91.6%) | 
| Tomita et al[24] | Tumor classification | 502 cases (esophageal adenocarcinoma and Barret esophagus) | Attention-based deep learning | Accuracy (83%) | 
| Wang et al[86] | Tumor classification | 608 images | Recalibrated multi-instance deep learning | Accuracy (86.5%) | 
| Iizuka et al[22] | Tumor classification | 1746 biopsy WSIs | CNN, RNN | AUCs (up to 0.98), accuracy (95.6%) | 
| Kather et al[33] | Prediction of genetic alterations and gene expression signatures | > 1000 cases (gastric, colorectal, and pancreatic cancer) | Neural networks | AUC (up to 0.8) | 
            Table 2 Artificial intelligence applications in colorectal cancer pathology
        
    | Ref. | Task | No. of cases/data set | Machine learning method | Performance | 
| Xu et al[38] | Tumor classification: 6 classes (NL/ADC/MC/SC/PC/CCTA) | 717 patches | AlexNet | Accuracy (97.5%) | 
| Awan et al[87] | Tumor classification: Normal/Low-grade cancer/High-grade cancer | 454 cases | Neural networks | Accuracy (97%, for 2-class; 91%, for 3-class) | 
| Haj-Hassan et al[37] | Tumor classification: 3 classes (NL/AD/ADC) | 30 multispectral image patches | CNN | Accuracy (99.2%) | 
| Kainz et al[88] | Tumor classification: Benign/Malignant | 165 images | CNN (LeNet-5) | Accuracy (95%-98%) | 
| Korbar et al[36] | Tumor classification: 6 classes (NL/HP/SSP/TSA/TA/TVA-VA) | 697 cases | ResNet | Accuracy (93.0%) | 
| Yoshida et al[35] | Tumor classification | 1328 colorectal biopsy WSIs | ML | Accuracy (90.1%, adenoma) | 
| Alom et al[89] | Tumor microenvironment analysis: Classification, Segmentation and Detection | 21135 patches | DCRN/R2U-Net | Accuracy (91.1%, classification) | 
| Bychkov et al[42] | Prediction of colorectal cancer outcome (5-yr disease-specific survival). | 420 cases | Recurrent neural networks | HR of 2.3, AUC (0.69) | 
| Weis et al[90] | Evaluation of tumor budding | 401 cases | CNN | Correlation R (0.86) | 
| Ponzio et al[91] | Tumor classification: 3 classes (NL/AD/ADC) | 27 WSIs (13500 patches) | VGG16 | Accuracy (96 %) | 
| Kather et al[34] | Tumor classification: 2 classes (NL/Tumor) | 94 WSIs | ResNet18 | AUC (> 0.99) | 
| Kather et al[34] | Prediction of microsatellite instability | 360 TCGA- DX (93408 patches), 378 TCGA- KR (60894 patches) | ResNet18 | AUC: TCGA-DX—(0.77, TCGA-DX; 0.84, TCGA-KR) | 
| Kather et al[26] | Tumor microenvironment analysis: classification of 9 cell types | 86 WSIs (100000) | VGG19 | Accuracy (94%-99%) | 
| Kather et al[26] | Prognosis predictions | 1296 WSIs | VGG19 | Accuracy (94%-99%) | 
| Kather et al[26] | Prognosis prediction | 934 cases | Deep learning (comparison of 5 networks) | HR for overall survival of 1.99 (training set) and 1.63 (test set) | 
| Geessink et al[29] | Prognosis prediction, quantification of intratumoral stroma | 129 cases | Neural networks | HRs of 2.04 for disease-free survival | 
| Sena et al[40] | Tumor classification: 4 classes (NL/HP/AD/ADC) | 393 WSIs (12,565 patches) | CNN | Accuracy (80%) | 
| Shapcott et al[92] | Tumor microenvironment analysis: detection and classification | 853 patches and 142 TCGA images | CNN with a grid-based attention network | Accuracy (84%, training set; 65%, test set) | 
| Sirinukunwattana et al[31] | Prediction of consensus molecular subtypes of colorectal cancer | 1206 cases | Neural networks with domain-adversarial learning | AUC (0.84 and 0.95 in the two validation sets) | 
| Swiderska-Chadaj et al[93] | Tumor Microenvironment Analysis: Detection of immune cell, CD3+, CD8+ | 28 WSIs | FCN/LSM/U-Net | Sensitivity (74.0%) | 
| Yoon et al[39] | Tumor classification: 2 classes (NL/Tumor) | 57 WSIs (10280 patches) | VGG | Accuracy (93.5%) | 
| Echle et al[46] | Prediction of microsatellite instability | 8836 cases | ShuffleNet Deep learning | AUC (0.92 in development cohort; 0.96 in validation cohort) | 
| Iizuka et al[22] | Tumor classification: 3 classes (NL/AD/ADC) | 4036 WSIs | CNN/RNN | AUCs (0.96, ADC; 0.99, AD) | 
| Skrede et al[28] | Prognosis predictions | 2022 cases | Neural networks with multiple instance learning | HR (3.04 after adjusting for established prognostic markers) | 
            Table 3 Advantages and disadvantages of representative machine-learning methods in the development of artificial intelligence-models for gastrointestinal pathology
        
    | AI model | Advantages | Disadvantages | 
| Conventional ML (supervised) | User can reflect domain knowledge to features | Requires hand-crafted features; Accuracy depends heavily on the quality of feature extraction | 
| Conventional ML (unsupervised) | Executable without labels | Results are often unstable; Interpretability of the results | 
| Deep neural networks (CNN) | Automatic feature extraction; High accuracy | Requires a large dataset; Low explainability (Black box) | 
| Multi-instance learning | Executable without detailed labels | Requires a large dataset; High computational cost | 
| Semantic segmentation (FCN, U-Net) | Pixel-level detection gives the position, size, and shape of the target | High labeling cost | 
| Recurrent neural networks | Learn sequential data | High computational cost | 
| Generative adversarial networks | Learn to synthesize new realistic data | Complexity and instability in training | 
- Citation: Yoshida H, Kiyuna T. Requirements for implementation of artificial intelligence in the practice of gastrointestinal pathology. World J Gastroenterol 2021; 27(21): 2818-2833
- URL: https://www.wjgnet.com/1007-9327/full/v27/i21/2818.htm
- DOI: https://dx.doi.org/10.3748/wjg.v27.i21.2818

 
         
                         
                 
                 
                 
                 
                 
                         
                         
                        