Prediction of graft outcomes after kidney transplantation: When standard statistics compare to machine learning techniques

doi:10.5527/wjn.v15.i1.116879

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 15, Issue 1

This Article

(0) (0) (0) (862)

Peer-Review Report of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Journal Information of This Article

Publication Name

World Journal of Nephrology

ISSN

2220-6124

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Retrospective Study

Copyright: ©Author(s) 2026. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution-NonCommercial (CC BY-NC 4.0) license. No commercial re-use. See permissions. Published by Baishideng Publishing Group Inc.

World J Nephrol. Mar 25, 2026; 15(1): 116879
Published online Mar 25, 2026. doi: 10.5527/wjn.v15.i1.116879

Prediction of graft outcomes after kidney transplantation: When standard statistics compare to machine learning techniques

Carolina Salgado, Francisca Gonzalez Cohens, Felipe A Vera, Rocío Ruiz, Juan D Velasquez, Fernando M Gonzalez

Carolina Salgado, Francisca Gonzalez Cohens, Felipe A Vera, Rocío Ruiz, Juan D Velasquez, Web Intelligence Centre, Faculty of Physics and Mathematical Sciences, Universidad de Chile, Santiago 7500922, Chile

Fernando M Gonzalez, Department of Nephrology, Faculty of Medicine, Universidad de Chile, Santiago 7500922, Chile

Author contributions: Salgado C built and ran all the models; Gonzalez Cohens F and Gonzalez FM revised the methodology, model performance, and interpretability; Salgado C, Gonzalez Cohens F, and Gonzalez FM wrote the manuscript; Vera FA, Ruiz R, and Velasquez JD reviewed, edited, and approved the manuscript.

Supported by a public grant from Agencia Nacional De Investigación Y Desarrollo, No. ID23I10232.

Institutional review board statement: The Comité Ético Científico del Servicio de Salud Metropolitano Oriente reviewed and approved the study (No. 29092020).

Informed consent statement: Written informed consent was obtained from all kidney transplant recipients upon their inclusion on the transplant centers’ waiting lists, authorizing the use of their anonymized clinical data.

Conflict-of-interest statement: All the authors report no relevant conflicts of interest for this article.

Data sharing statement: Data can be shared upon request.

Corresponding author: Fernando M Gonzalez, MD, Department of Nephrology, Faculty of Medicine, Universidad de Chile, Avenida Salvador 486, Providencia, Santiago 7500922, Chile. fgonzalf@uc.cl

Received: November 24, 2025
Revised: January 12, 2026
Accepted: February 9, 2026
Published online: March 25, 2026
Processing time: 111 Days and 7.3 Hours

Abstract

BACKGROUND

Over the last decade, the use of machine learning (ML) techniques in problem modeling and solving has increased significantly, including in kidney transplantation. Numerous studies have used ML to predict outcomes such as delayed graft function (DGF). This study compares various ML models with logistic regression (LR) in predicting DGF, focusing on donor characteristics.

AIM

To compare various ML models with LR in predicting DGF, focusing on donor characteristics.

METHODS

We analyzed 523 deceased donor kidney transplants performed between 2010 and 2020 across three transplant centers. The dataset included 14 donors, 3 transplants, and 64 recipient features. Four problem types were defined based on variable combinations: Donor-only, donor + transplant, donor + recipient, and donor + transplant + recipient. The dataset comprised 43.5% DGF-positive and 56.5% DGF-negative patients, split into 80% for training and 20% for validation/testing. Six ML models - support vector machine, decision trees, random forest (RF), gradient boost (GB), extreme gradient boost (XGB), and multilayer perceptron - were compared with LR. Hyperparameters were optimized using random search and 10-fold cross-validation. Accuracy was the primary performance metric.

RESULTS

The best-performing model for each problem type achieved accuracies of 70% (RF), 70% (RF), 58% (RF), and 61% (XGB) for donor-only, donor + transplant, donor + recipient, and donor + transplant + recipient, respectively. LR achieved accuracies of 57%, 66%, 52% and 66%; however, these models generally showed low sensitivity and high specificity. Across most of them, significant predictors included donor creatinine, age, and mean blood pressure, cold ischemia time (transplant variable), and recipient smoking condition.

CONCLUSION

While most ML models outperformed LR, the differences were not substantial. This may be attributed to the small dataset size, which likely contributed to the overall poor performance. We recommend using these complex models with high-quality datasets that include a sufficient number of variables and observations to fully leverage their potential. The key question for future research is determining the dataset size required for ML to become the primary analytic tool for predicting kidney transplant outcomes.

Keywords: Delayed graft function; Prediction; Logistic regression; Machine learning; Artificial intelligence

Core Tip: Machine learning (ML) is increasingly used in kidney transplantation research, including predicting delayed graft function. This study compares six ML models with logit across four donor, transplant, and recipient variable combinations. The dataset comprises 44.7% delayed graft function-positive cases. All methods have similar performances, with accuracies between 58%-70%. Important predictors included donor creatinine, age, and mean blood pressure, cold-ischemia time, and recipient smoking condition. Although ML approaches slightly outperformed logit, overall performance remained modest, likely due to limited sample-size. Further research should define dataset scale and quality for ML to become a primary analytic tool for predicting kidney transplant outcomes.