Yang RH, Fan WX, Zhong Y, Lin ZP, Chen JP, Jiang GH, Dai HY. Predicting esophageal cancer response to neoadjuvant therapy with magnetic resonance imaging radiomics. World J Gastrointest Oncol 2025; 17(10): 110671 [PMID: 41114118 DOI: 10.4251/wjgo.v17.i10.110671]
Corresponding Author of This Article
Hai-Yang Dai, MD, Department of Radiology, Huizhou Central People's Hospital, No. 41 North Eling Road, Huizhou 516001, Guangdong Province, China. d.ocean@163.com
Research Domain of This Article
Radiology, Nuclear Medicine & Medical Imaging
Article-Type of This Article
Retrospective Study
Open-Access Policy of This Article
This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Oct 15, 2025 (publication date) through Oct 25, 2025
Times Cited of This Article
Times Cited (0)
Journal Information of This Article
Publication Name
World Journal of Gastrointestinal Oncology
ISSN
1948-5204
Publisher of This Article
Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA
Share the Article
Yang RH, Fan WX, Zhong Y, Lin ZP, Chen JP, Jiang GH, Dai HY. Predicting esophageal cancer response to neoadjuvant therapy with magnetic resonance imaging radiomics. World J Gastrointest Oncol 2025; 17(10): 110671 [PMID: 41114118 DOI: 10.4251/wjgo.v17.i10.110671]
Co-corresponding authors: Gui-Hua Jiang and Hai-Yang Dai.
Author contributions: Yang RH and Fan WX contribute equally to this study as co-first authors and they participated in the conception and design of the study; Zhong Y involved in the acquisition, analysis, or interpretation of data; Lin ZP and Chen JP prepared the tables and figures; Yang RH wrote the first draft and subsequent versions; Jiang GH and Dai HY was responsible for project administration and supervision, and contributed equally to this work as co-corresponding authors; all authors critically reviewed and approved the final manuscript to be published.
Supported by Guangdong Medical Research Foundation, No. B2023272.
Institutional review board statement: This study was approved by the Ethics Committee on Clinical Researches and Novel Technologies of Meizhou People’s Hospital (grant No. 2023-C-45).
Informed consent statement: Patient informed consent was waived for this retrospective study.
Conflict-of-interest statement: All the authors report no relevant conflicts of interest for this article.
Data sharing statement:sharing statement: The data that support the findings of this study are available from the corresponding author upon reasonable request at d.ocean@163.com.
Open Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: https://creativecommons.org/Licenses/by-nc/4.0/
Corresponding author: Hai-Yang Dai, MD, Department of Radiology, Huizhou Central People's Hospital, No. 41 North Eling Road, Huizhou 516001, Guangdong Province, China. d.ocean@163.com
Received: June 12, 2025 Revised: August 12, 2025 Accepted: September 19, 2025 Published online: October 15, 2025 Processing time: 124 Days and 18.8 Hours
Abstract
BACKGROUND
Predicting the pathological response of esophageal cancer (EC) to neoadjuvant therapy (NAT) is of significant clinical importance.
AIM
To evaluate the pathological response of NAT in EC patients using multiple machine learning algorithms based on magnetic resonance imaging (MRI) radiomics.
METHODS
This retrospective study included 132 patients with pathologically confirmed EC, were randomly divided into a training cohort (n = 92) and a validation cohort (n = 40) in a 7:3 ratio. All patients underwent a preoperative MRI scan from the neck to the abdomen. High-throughput and quantitative radiomics features were extracted from T2-weighted imaging (T2WI). Radiomics signatures were selected using minimal redundancy maximal relevance and the least absolute shrinkage and selection operator. Nine classification algorithms were used to build the models, and the diagnostic performance of each model was evaluated using the area under the curve (AUC), sensitivity (SEN), and specificity (SPE).
RESULTS
A total of 1834 features were extracted. Following feature dimension reduction, ten radiomics features were selected to construct radiomics signatures. Among the nine classification algorithms, the ExtraTrees algorithm demonstrated the best diagnostic performance in both the training (AUC: 0.932; SEN: 0.906; SPE: 0.817) and validation cohorts (AUC: 0.900; SEN: 0.667; SPE: 0.700). The Delong test proved no significance in the diagnostic efficiency within these models (P > 0.05).
CONCLUSION
T2WI radiomics may aid in determining the pathological response to NAT in EC patients, serving as a noninvasive and quantitative tool to assist personalized treatment planning.
Core Tip: Few studies have utilized multiple radiomic algorithms to predict the pathological response to neoadjuvant therapy (NAT) in esophageal cancer (EC). In this study we found ten radiomics features related to pathological therapeutic response. The ExtraTrees algorithm performed good efficiency in the predictive and validation sets. Our study showed that the radiomics model derived from magnetic resonance imaging T2-weighted imaging images demonstrates good performance in determining the pathological response of NAT in EC, which would help individualized plans in EC.
Citation: Yang RH, Fan WX, Zhong Y, Lin ZP, Chen JP, Jiang GH, Dai HY. Predicting esophageal cancer response to neoadjuvant therapy with magnetic resonance imaging radiomics. World J Gastrointest Oncol 2025; 17(10): 110671
Esophageal cancer (EC) is a highly malignant digestive neoplasm, and radical resection combined with lymph node dissection is the standard treatment method[1,2]. However, a significant proportion of patients presents with local progression at diagnosis and are not candidates for surgery. Neoadjuvant therapy (NAT) prior to radical resection has been proved to reduce tumor size, stage, and surgical complexity, making it the first-line treatment option for patients with locally advanced EC[3]. However, some patients are resistant to NAT, thus receiving unnecessary treatment and experiencing toxic side effects, which negatively impact their prognosis[4]. Therefore, predicting the pathological response of EC to NAT at an early stage could help identify responders and reduce unnecessary treatment.
Currently, the assessment of treatment response to NAT in EC primarily relies on medical imaging techniques. Magnetic resonance imaging (MRI) could provide excellent anatomical details and soft tissue contrast, making it particularly useful for predicting tumor response after treatment[5-7]. According to the 2022 Chinese Society of Clinical Oncology guidelines, MRI has been recommended as a level 3 staging system for EC. The predictive value of using MRI to evaluate NAT response has achieved good results in various malignant tumors[8]. Recently, the development of artificial intelligence (AI) has further helped overcome the limitation of traditional imaging methods. The integration of AI with medical imaging has proven to be a powerful approach for enhancing the prediction of therapeutic response and prognosis, thereby aiding in the optimization of clinical treatment plans[9,10].
Radiomics, an advanced imaging analysis technique that extracts quantitative features from medical images, has gained widespread acceptance in clinical oncology for its ability to aid in predicting preoperative tumor staging and evaluating prognosis[11-13]. In particular, recent studies have demonstrated that radiomics features derived from imaging modalities can help predicting treatment response and prognosis in EC patients[14,15]. Despite the growing body of evidence supporting the usefulness of radiomics in EC, few studies have specifically addressed the use of MRI-based radiomic features in predicting NAT response. Among the MRI sequences, T2-weighted imaging (T2WI) has been less frequently explored for this purpose. T2WI, which provides high-resolution anatomical details, may offer distinct advantages in assessing tumor response to NAT, particularly with respect to changes in tumor microstructure and tissue composition that occur during treatment.
This study aimed to fill this gap by utilizing T2WI MRI-based radiomic features in conjunction with advanced machine learning algorithms to predict the pathological response to NAT in EC patients. The ultimate goal is to enhance the ability to stratify patients based on their likelihood of responding to NAT, thereby guiding more personalized treatment approaches and improving clinical outcomes.
MATERIALS AND METHODS
Patients
This retrospective study included 132 patients with pathologically confirmed EC who underwent NAT followed by radical surgery between January 2016 and October 2022. The inclusion criteria were: (1) Postoperative pathology confirmed EC; (2) Preoperative MRI scans performed within two weeks prior to NAT; (3) Clinical staging of II to IV; and (4) No history of other malignance. The exclusion criteria were as follows: (1) Any therapies administered prior to MRI examination; (2) Interruption of NAT for more than 2 weeks; (3) No surgical resection following NAT; (4) Poor quality of MRI images; and (5) Incomplete clinical data. All patients underwent radical EC resection surgery 4-6 weeks following completion of NAT. Based on the postoperative pathological remission status, patients were classified into a response group [tumor regression grades (TRGs) I to II] with better remission response and a non-response group (TRGs III to V) with poorer remission. This study was approved by the Ethics Committee on Clinical Researches and Novel Technologies of Meizhou People’s Hospital (grant No. 2021-CY-32). Written informed consent was waived by the ethics committee due to the retrospective nature of the study. All methods were performed in accordance with the relevant ethical guidelines and regulations.
Clinical and pathological data
Relevant clinical information was extracted from the hospital’s electronic medical record system, including patients’ age, gender, carcinoembryonic antigen (CEA), squamous cell carcinoma antigen (SCCA). The pathological results include tumor location, tumor size, degree of differentiation, TNM staging and tumor pathological response level.
Image acquisition
All examinations were conducted using a 3.0 T MRI system (Magnetom Skyra, Siemens Healthineers, Germany) equipped with an 18-channel body array coil. To minimize motion artifacts, patients were positioned supine and instructed to maintain quiet, regular breathing throughout the scan. The scanning range was from the level of the supraclavicular fossa to the gastric cardia. The scanning sequence and parameters were: (1) T1-weighted imaging (T1WI) Dixon sequence: Positive phase TR: 4.0 ms, TE: 1.3 ms, thickness: 5 mm, spacing: 6.5 mm; reverse phase TR: 4.0 ms, TE: 2.5 ms, thickness: 5 mm, spacing: 6.5 mm; (2) T2WI TSE BLADE sequence: TR: 7408 ms, TE: 84 ms, thickness: 5 mm, spacing: 6.5 mm; and (3) Diffusion-weighted imaging (DWI) sequence: TR: 6900 ms, TE: 59 ms, thickness: 5 mm, spacing: 6.5 mm, B values including 50 s/mm2, 800 s/mm2, and 1000 s/mm2, respectively. The axial T2WI images were used to build the radiomics models. All image matrix was 210 × 210.
Pathological evaluation
The specimens excised after NAT were evaluated by a senior gastrointestinal pathologist. According to the standards of Mandard et al[16], tumor tissue was classified into TRG1 to TRG5: (1) TRG1: Complete remission, and tumor cells completely disappeared; (2) TRG2: A small amount of residual tumor cells with fibrotic proliferation; (3) TRG3: There are more residual tumor cells, but fibrosis and proliferation are still the main forms; (4) TRG4: The proportion of residual tumor cells is greater than that of fibrotic proliferation; and (5) TRG5: No fibrotic or necrotic reactions.
Image segmentation
DICOM raw data were imported into the 3Dslicer software. Two doctors with more than 10 years’ experience of abdominal imaging manually delineated and segmented the tumors along tumor margin on the axial T2WI images with unaware of the pathological results, and tumor volumes were generated on the software platform automatically. During the drawing and segmenting process, tumor necrosis, adjacent fat tissue and uninvolved esophagus wall should be excluded. Then another senior diagnostic doctor review and check all the region of interests (ROIs), with any discrepancies were resolved by discussion.
Image preprocessing
All segmented images were imported into the AK software (Artificial Intelligence Kit V3.3.0, GE Healthcare, China). The following steps were followed for image preprocessing according to previous reports[17,18]: (1) Step 1: Resampling of the images using B-spline interpolation to standardize the voxel size to 1 mm3; and (2) Step 2: Grayscale normalization, according to the Image Biomarker Standardization Initiative guidelines[19].
Radiomic feature extraction
Patients were randomly divided into a training cohort (n = 92) and a validation cohort (n = 40) at a 7:3 ratio, as according to previous report[20]. High-throughput extraction of radiomic features from the training cohort was executed with the Pyradiomics toolkit. The extracted feature set encompassed a comprehensive suite of 1834 metrics, including first-order statistics, morphological (shape-based) features, and second-order textural features derived from gray-level co-occurrence, run-length, size zone, dependence, and neighboring difference matrices. Subsequent to extraction, feature normalization was performed using the z-score technique (mean = 0, standard deviation = 1) to mitigate the influence of data scale variations across different feature types, consistent with conventional radiomics preprocessing practices[21]. Feature data extracted from the training cohort were used for feature selection, dimensionality reduction and prediction model constructions, while data from the validation cohort were used for validate the performance and effectiveness of the constructed models.
Feature selection and dimensionality reduction
Feature dimensionality reduction was performed to ensure the robustness of these models. Firstly, the intra-class correlation coefficients (ICCs) are calculated. Only features demonstrating excellent reproducibility (ICC > 0.75) were retained for subsequent analysis. t-test or Mann-Whitney U tests are then used to screen features with P < 0.05. Spearman’s rank coefficient was applied to determine the correlation between features with high repetitiveness. During this process, correlation coefficient between features was calculated as in reference to previous reports[22,23], and feature was retained with a correlation coefficient exceed 0.9. Following feature selection using the least absolute shrinkage and selection operator (LASSO) algorithm with 10-fold cross-validation, features with non-zero coefficients were retained to construct the radiomics signature. Nine classification algorithms, including logistic regression, support vector machines, K-nearest neighbor (KNN), random forest, ExtraTrees, extreme gradient boosting, light gradient boosting machine, multilayer perceptron and adaptive boosting, were applied to calculate a radiomics score for each patient. The area under curve (AUC) was calculated to assess model performance. The radiomics workflow was shown in Figure 1.
Figure 1 The workflow of radiomics analysis.
T2WI: T2-weighted imaging.
Statistical analysis
We used Python (version 3.5.6) and SPSS 19.0 software to perform statistical analysis. Continuous variables were analyzed using independent sample t-tests (normal distribution) or Mann Whitney U tests (non-normal distribution), while categorical variables were analyzed using χ2 tests or Fisher's exact probability test. ROC and AUC were used to evaluate the predictive performance of each model. Clinical application was evaluated by decision curve analysis (DCA), and the AUC of each model was compared by Delong test. A two-sided P value < 0.05 was considered statistically significant.
RESULTS
Patient characteristics
The clinical characteristics of all patients are summarized in Table 1. No statistically significant differences were observed in age, gender, or tumor location, degree of differentiation, length of tumor, or levels of CEA and SCCA between the response group and non-response groups (P > 0.05).
Table 1 Clinical characteristics of all esophageal cancer patients.
A total of 1834 radiomics features were extracted. After feature selection and dimensionality reduction, ten radiomics features were retained for NAT response in EC. The names and descriptions of the selected features are provided in Figure 2.
Figure 2 The names and descriptions of the selected features.
Radiomics signature discrimination performance
The radiomics signature was evaluated using nine different classification algorithms. The AUC values in the training cohort ranged from 0.735 to 1.000, and in the validation cohort from 0.733 to 0.900 (Table 2). Among these, the ExtraTrees algorithm demonstrated the best diagnostic performance in both the training [AUC: 0.932; sensitivity (SEN): 0.906; specificity (SPE): 0.817] and validation cohort (AUC: 0.900; SEN: 0.667; SPE: 0.700). The Delong test showed no significant difference in diagnostic efficacy among these algorithms (P > 0.05; Table 3). The receiver operating characteristic curves for each model in both cohorts are shown in Figure 3. DCA further indicated that the ExtraTrees model provided the highest net benefit across a range of probability thresholds (Figure 4).
Figure 3 Performance evaluation of radiomics models.
A and B: Receiver operating characteristic curve analysis comparing model performance in the training (A) and validation cohort (B). AUC: Area under the curve; KNN: K-nearest neighbor; GBM: Light gradient boosting machine; LR: Logistic regression; MLP: Multilayer perceptron; SVM: Support vector machines; XGBoost: Extreme gradient boosting.
Figure 4 Decision curve analysis demonstrating clinical utility across probability thresholds in the validation cohort.
The Y-axis indicates net benefit standardized by patient population. The blue curve corresponds to the radiomics model, the black solid curve represents the all-patients-treated scenario, and the black dashed line indicates the no-patients-treated scenario. DCA: Decision curve analysis.
Table 2 Performance of different classification algorithms in predicting neoadjuvant therapy efficacy in esophageal cancer.
This study introduced a radiomics framework designed to predict personalized responses to NAT in individuals with EC. A key distinction from prior investigations is the utilization of the T2WI sequence from MRI to derive radiomics features. The findings suggested that MRI-based approach had potential as a valuable instrument for clinicians to detect non-responsive patients early in the course of NAT, thereby playing an assisting role in refining the comprehensive treatment strategy for appropriate EC cases.
EC prognosis varies significantly due to tumor heterogeneity. While radiomics methods using enhanced computed tomography (CT) or 18F-FDG positron emission tomography (PET) images have been employed to assess NAT response[24-26], few studies have explored magnetic resonance radiomics for this purpose. Hou et al[27] extracted 138 radiomics features from T2WI images of 68 EC patients, demonstrating strong predictive performance for radio-chemotherapy response. Compared with conventional CT or PET-CT, T2WI offers superior lesion detail visualization, avoids radiation exposure, and eliminates concerns related to renal impairment or contrast agent allergies[28]. Therefore, we leveraged T2WI images and a machine learning approach to develop an optimal radiomics model for predicting NAT response in EC patients. Ten radiomics features were selected from the T2WI sequences. These features primarily reflect differences in pixel grayscale intensity and morphological patterns. Radiomics features characterize the spatial distribution patterns of pixels (voxels) within the tumor. Greater tumor heterogeneity typically manifests as more irregular voxel distributions, which may correlate with treatment response and patient prognosis[29]. Results from our study demonstrate that MRI T2WI radiomics demonstrated good performance in predicting the pathological response to NAT in EC patients. Our findings may assist in guiding individualized treatment and care for EC patients, potentially improving prognosis.
Various methods can be used to build radiomics signatures, which can result in different prediction accuracy. In the present study, we established nine classification algorithms to build radiomics models based on T2WI images. By calculating the key features in these algorithms, the ExtraTrees algorithm exhibited the best diagnostic accuracy and stability in the primary cohort and also good performance in the validation set. This may be contributed to the advantage of ExtraTrees algorithm in avoiding over-fitting ability for high-dimensional data, and its robustness, stability and good generalization ability[30]. Our results indicate that the radiomics algorithms based on T2WI images could predict pathological response of EC with high accuracy, and is consistent with previous reports[27]. Results from DCA indicate the ExtraTrees model exhibit good diagnostic efficiency, which is consistence with previous results of Hou et al[27] and Lu et al[31]. These models identify tumor characteristics beyond visual observation, and utilize machine learning algorithms revealing comprehensive situation of tumor heterogeneity.
Radiomics features that extracted from MRI images could provide more detailed variations in tissue composition and tumor heterogeneity[32,33]. Hu et al[34] predicted an AUC of 0.852 for the efficacy of radiotherapy and chemotherapy in EC based on the CT radiomics characteristics of tumors. Yang et al[35] predicted the efficacy of neoadjuvant radiotherapy and chemotherapy for EC based on the radiomics features selected by LASSO. The results showed that its AUC in the training cohort was 0.84-0.86, and in the testing cohort it was 0.71-0.79. Kawahara et al[36] developed a multi-institution prediction model based on PET/CT radiomics and dosimetry features. The mixed model achieved accuracy rates of 84.4%, 86.0% and 79.0% for CT radiomics based on KNN classifier, PET radiomics based on PET classifier, and dosomics based on KNN classifier, respectively. Our MRI-based radiomics model demonstrated superior performance compared to the aforementioned CT-based models, suggesting the potential added value of MRI in characterizing EC tumor heterogeneity.
No clinical factors were found to be significantly associated with pathological response to NAT in EC in this study, which is consistent with most previous reports[37,38]. However, Chao et al[39] identified age as a risk factor. This discrepancy may be attributed to differences in sample size or the characteristics of the cohort populations included in these studies.
This research has several limitations. First, its single-center origin and retrospective nature, coupled with a limited sample size, mean that our findings must be validated in future multicenter studies with larger, prospective cohorts. Second, manual delineation of the ROI may introduce inter-observer variability, potentially affecting the consistency of the results. Last, we incorporated only a limited set of clinical factors. Future studies should integrate a broader range of clinical features with radiomics characteristics which including DWI and contrast-enhanced T1WI to better elucidate the relationship between these factors and the pathological response to NAT in EC.
CONCLUSION
This study demonstrates that MRI T2WI-based radiomics can effectively predict pathological response to NAT in EC. The T2WI-derived models may serve as a valuable clinical tool to guide personalized treatment planning for EC patients.
Footnotes
Provenance and peer review: Unsolicited article; Externally peer reviewed.
Peer-review model: Single blind
Specialty type: Oncology
Country of origin: China
Peer-review report’s classification
Scientific Quality: Grade C
Novelty: Grade C
Creativity or Innovation: Grade C
Scientific Significance: Grade C
P-Reviewer: Xia L, MD, China S-Editor: Lin C L-Editor: A P-Editor: Yu HG
Gabrielson S, Tsai JA, Celebioglu F, Nilsson M, Rouvelas I, Lindblad M, Bjäreback A, Tomson A, Axelsson R. Preoperative detection of sentinel lymph nodes with hybrid SPECT/computed tomography imaging may improve the accuracy of sentinel lymph node biopsies in patients with early stages of cancer of the oesophagus or gastro-oesophageal junction.Nucl Med Commun. 2020;41:1153-1160.
[RCA] [PubMed] [DOI] [Full Text] [Full Text (PDF)][Cited by in RCA: 2][Reference Citation Analysis (0)]
Luo T, Wan J, Liu S, Wang X, Zhou P, Xue Q, Hou J, Wang P. Establishment of a scoring model for predicting clinical outcomes in patients with unilateral primary aldosteronism after superselective adrenal artery embolization.Ir J Med Sci. 2024;193:2269-2279.
[RCA] [PubMed] [DOI] [Full Text][Cited by in RCA: 3][Reference Citation Analysis (0)]
Shen J, Du H, Wang Y, Du L, Yang D, Wang L, Zhu R, Zhang X, Wu J. A novel nomogram model combining CT texture features and urine energy metabolism to differentiate single benign from malignant pulmonary nodule.Front Oncol. 2022;12:1035307.
[RCA] [PubMed] [DOI] [Full Text][Cited by in RCA: 7][Reference Citation Analysis (0)]
Mo X, Chen W, Chen S, Chen Z, Guo Y, Chen Y, Wu X, Zhang L, Chen Q, Jin Z, Li M, Chen L, You J, Xiong Z, Zhang B, Zhang S. MRI texture-based machine learning models for the evaluation of renal function on different segmentations: a proof-of-concept study.Insights Imaging. 2023;14:28.
[RCA] [PubMed] [DOI] [Full Text][Cited by in RCA: 7][Reference Citation Analysis (0)]
Kawahara D, Murakami Y, Awane S, Emoto Y, Iwashita K, Kubota H, Sasaki R, Nagata Y. Radiomics and dosiomics for predicting complete response to definitive chemoradiotherapy patients with oesophageal squamous cell cancer using the hybrid institution model.Eur Radiol. 2024;34:1200-1209.
[RCA] [PubMed] [DOI] [Full Text][Cited by in Crossref: 2][Cited by in RCA: 4][Article Influence: 4.0][Reference Citation Analysis (0)]