BPG is committed to discovery and dissemination of knowledge
Retrospective Study
Copyright ©The Author(s) 2026. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Jan 7, 2026; 32(1): 112090
Published online Jan 7, 2026. doi: 10.3748/wjg.v32.i1.112090
Predicting lymph node metastasis in colorectal cancer using case-level multiple instance learning
Ling-Feng Zou, Xuan-Bing Wang, Jing-Wen Li, Xin Ouyang, Yi-Ying Luo, Yan Luo, Cheng-Long Wang
Ling-Feng Zou, Jing-Wen Li, Xin Ouyang, Yi-Ying Luo, Cheng-Long Wang, Department of Pathology, Chongqing Traditional Chinese Medicine Hospital, Chongqing 400021, China
Xuan-Bing Wang, Department of Pathology, Chongqing Jiulongpo People's Hospital, Chongqing 400050, China
Xuan-Bing Wang, Laboratory of Tissue and Cell Biology, Lab Teaching & Management Center, Chongqing Medical University, Chongqing 400016, China
Yan Luo, Department of Stomatology, The People's Hospital of Dadukou District, Chongqing 400084, China
Co-first authors: Ling-Feng Zou and Xuan-Bing Wang.
Co-corresponding authors: Yan Luo and Cheng-Long Wang.
Author contributions: Wang CL contributed to conceptualization, methodology, validation, resources, writing review and editing, supervision, funding acquisition; Luo Y contributed to conceptualization, methodology, resources, data curation, writing review and editing, supervision, project administration; Zou LF contributed to methodology, validation, formal analysis, investigation, data curation, writing original draft; Wang XB contributed to methodology, software, formal analysis, investigation, resources, data curation, writing original draft, visualization; Li JW contributed to methodology, resources, data curation, project administration; Ouyang X contributed to resources, data curation, project administration; Luo YY contributed to software, resources, visualization, project administration.
Supported by Chongqing Medical Scientific Research Project (Joint Project of Chongqing Health Commission and Science and Technology Bureau), No. 2023MSXM060.
Institutional review board statement: This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Chongqing Traditional Chinese Medicine Hospital, No. 2025-IIT-KS-7.
Informed consent statement: The requirement for informed consent was waived by the institutional review board due to the retrospective nature of the study.
Conflict-of-interest statement: The authors declare no conflicts of interest.
Data sharing statement: The data that support the findings of this study are available on request from the corresponding author. The computer code developed and utilized for this research is accessible in the following public repository: https://github.com/Patho-Lab/Colorectal_cancer_DL.
Open Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Cheng-Long Wang, MD, PhD, Department of Pathology, Chongqing Traditional Chinese Medicine Hospital, No. 6 Panxi 7 Branch Road, Jiangbei District, Chongqing 400021, China. qq171909771@gmail.com
Received: July 17, 2025
Revised: July 30, 2025
Accepted: November 27, 2025
Published online: January 7, 2026
Processing time: 172 Days and 10.8 Hours
Abstract
BACKGROUND

The accurate prediction of lymph node metastasis (LNM) is crucial for managing locally advanced (T3/T4) colorectal cancer (CRC). However, both traditional histopathology and standard slide-level deep learning often fail to capture the sparse and diagnostically critical features of metastatic potential.

AIM

To develop and validate a case-level multiple-instance learning (MIL) framework mimicking a pathologist's comprehensive review and improve T3/T4 CRC LNM prediction.

METHODS

The whole-slide images of 130 patients with T3/T4 CRC were retrospectively collected. A case-level MIL framework utilising the CONCH v1.5 and UNI2-h deep learning models was trained on features from all haematoxylin and eosin-stained primary tumour slides for each patient. These pathological features were subsequently integrated with clinical data, and model performance was evaluated using the area under the curve (AUC).

RESULTS

The case-level framework demonstrated superior LNM prediction over slide-level training, with the CONCH v1.5 model achieving a mean AUC (± SD) of 0.899 ± 0.033 vs 0.814 ± 0.083, respectively. Integrating pathology features with clinical data further enhanced performance, yielding a top model with a mean AUC of 0.904 ± 0.047, in sharp contrast to a clinical-only model (mean AUC 0.584 ± 0.084). Crucially, a pathologist’s review confirmed that the model-identified high-attention regions correspond to known high-risk histopathological features.

CONCLUSION

A case-level MIL framework provides a superior approach for predicting LNM in advanced CRC. This method shows promise for risk stratification and therapy decisions, requiring further validation.

Keywords: Colorectal cancer; Lymph node metastasis; Deep learning; Multiple instance learning; Histopathology

Core Tip: To better predict lymph node metastasis (LNM) in advanced colorectal cancer, this pilot study developed a case-level deep learning framework. By analysing the pathology slides of all patients and emulating a pathologist's workflow, the model achieved a high area under the curve of 0.899, outperforming traditional methods. Integrating the clinical data further increased the accuracy to 0.904. This interpretable approach is a promising tool for refining LNM risk assessments and guiding adjuvant therapy decisions.