BPG is committed to discovery and dissemination of knowledge
Review
Copyright ©The Author(s) 2025. Published by Baishideng Publishing Group Inc. All rights reserved.
World J Gastroenterol. Dec 21, 2025; 31(47): 112921
Published online Dec 21, 2025. doi: 10.3748/wjg.v31.i47.112921
Foundation models: Insights and implications for gastrointestinal cancer
Lei Shi, Rui Huang, Li-Ling Zhao, An-Jie Guo
Lei Shi, Rui Huang, An-Jie Guo, School of Life Sciences, Chongqing University, Chongqing 400044, China
Li-Ling Zhao, Department of Stomatology, The First Affiliated Hospital of Chongqing Medical University, Chongqing 400042, China
Author contributions: Shi L and Huang R designed the study and collected the data; Shi L, Huang R, and Zhao LL analyzed and interpreted the data; Shi L and Huang R wrote the manuscript; Zhao LL and Guo AJ revised the manuscript; all authors approved the final version of the manuscript.
Supported by the Open Project Program of Panxi Crops Research and Utilization Key Laboratory of Sichuan Province, No. SZKF202302; and the Fundamental Research Funds for the Central Universities No. 2019CDYGYB024.
Conflict-of-interest statement: The authors deny any conflict of interest.
Open Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Lei Shi, Associate Professor, School of Life Sciences, Chongqing University, No. 55 University City South Road, Shapingba District, Chongqing 400044, China. shil@cqu.edu.cn
Received: August 11, 2025
Revised: September 10, 2025
Accepted: November 3, 2025
Published online: December 21, 2025
Processing time: 132 Days and 10 Hours
Abstract

Gastrointestinal (GI) cancers represent a major global health concern due to their high incidence and mortality rates. Foundation models (FMs), also referred to as large models, represent a novel class of artificial intelligence technologies that have demonstrated considerable potential in addressing these challenges. These models encompass large language models (LLMs), vision FMs (VFMs), and multimodal LLMs (MLLMs), all of which utilize transformer architectures and self-supervised pre-training on extensive unlabeled datasets to achieve robust cross-domain generalization. This review delineates the principal applications of these models: LLMs facilitate the structuring of clinical narratives, extraction of insights from medical records, and enhancement of physician-patient communication; VFMs are employed in the analysis of endoscopic, radiological, and pathological images for lesion detection and staging; MLLMs integrate heterogeneous data modalities, including imaging, textual information, and genomic data, to support diagnostic processes, treatment prediction, and prognostic evaluation. Despite these promising developments, several challenges remain, such as the need for data standardization, limited diversity within training datasets, substantial computational resource requirements, and ethical-legal concerns. In conclusion, FMs exhibit significant potential to advance research and clinical management of GI cancers. Future research efforts should prioritize the refinement of these models, promote international collaborations, and adopt interdisciplinary approaches. Such a comprehensive strategy is essential to fully harness the capabilities of FMs, driving substantial progress in the fight against GI malignancies.

Keywords: Foundation models; Gastrointestinal cancers; Large language models; Vision foundation models; Multimodal large language models

Core Tip: This review synthesizes applications of foundation models in gastrointestinal cancer, from clinical text structuring and image analysis to multimodal data integration. Despite current knowledge gaps and challenges like data standardization, it highlights foundation models’ transformative potential, urging refined models and collaborations to advance gastrointestinal cancer research.