1
|
Godau P, Kalinowski P, Christodoulou E, Reinke A, Tizabi M, Ferrer L, Jäger P, Maier-Hein L. Navigating prevalence shifts in image analysis algorithm deployment. Med Image Anal 2025; 102:103504. [PMID: 40020420 DOI: 10.1016/j.media.2025.103504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 01/16/2025] [Accepted: 02/11/2025] [Indexed: 03/03/2025]
Abstract
Domain gaps are significant obstacles to the clinical implementation of machine learning (ML) solutions for medical image analysis. Although current research emphasizes new training methods and network architectures, the specific impact of prevalence shifts on algorithms in real-world applications is often overlooked. Differences in class frequencies between development and deployment data are crucial, particularly for the widespread adoption of artificial intelligence (AI), as disease prevalence can vary greatly across different times and locations. Our contribution is threefold. Based on a diverse set of 30 medical classification tasks (1) we demonstrate that lack of prevalence shift handling can have severe consequences on the quality of calibration, decision threshold, and performance assessment. Furthermore, (2) we show that prevalences can be accurately and reliably estimated in a data-driven manner. Finally, (3) we propose a new workflow for prevalence-aware image classification that uses estimated deployment prevalences to adjust a trained classifier to a new environment, without requiring additional annotated deployment data. Comprehensive experiments indicate that our proposed approach could contribute to generating better classifier decisions and more reliable performance estimates compared to current practice.
Collapse
Affiliation(s)
- Patrick Godau
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Hospital Heidelberg, Germany; Faculty of Mathematics and Computer Science, Heidelberg University, Germany; HIDSS4Health - Helmholtz Information and Data Science School for Health, Karlsruhe/Heidelberg, Germany.
| | - Piotr Kalinowski
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; Faculty of Mathematics and Computer Science, Heidelberg University, Germany; HIDSS4Health - Helmholtz Information and Data Science School for Health, Karlsruhe/Heidelberg, Germany.
| | - Evangelia Christodoulou
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Annika Reinke
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Germany
| | - Minu Tizabi
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Germany
| | - Luciana Ferrer
- Instituto de Ciencias de la Computación, UBA-CONICET, Argentina
| | - Paul Jäger
- Helmholtz Imaging, German Cancer Research Center (DKFZ), Germany; Interactive Machine Learning Group, German Cancer Research Center (DKFZ), Germany
| | - Lena Maier-Hein
- German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany; National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Hospital Heidelberg, Germany; Faculty of Mathematics and Computer Science, Heidelberg University, Germany; Helmholtz Imaging, German Cancer Research Center (DKFZ), Germany; Medical Faculty, Heidelberg University, Germany
| |
Collapse
|
2
|
Schindele A, Krebold A, Heiß U, Nimptsch K, Pfaehler E, Berr C, Bundschuh RA, Wendler T, Kertels O, Tran-Gia J, Pfob CH, Lapa C. Interpretable machine learning for thyroid cancer recurrence predicton: Leveraging XGBoost and SHAP analysis. Eur J Radiol 2025; 186:112049. [PMID: 40096773 DOI: 10.1016/j.ejrad.2025.112049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 02/28/2025] [Accepted: 03/11/2025] [Indexed: 03/19/2025]
Abstract
PURPOSE For patients suffering from differentiated thyroid cancer (DTC), several clinical, laboratory, and pathological features (including patient age, tumor size, extrathyroidal extension, or serum thyroglobulin levels) are currently used to identify recurrence risk. Validation and potential adjustment of their individual and combined prognostic values using a large patient cohort with several years of follow-up might improve the correct identification of patients at risk. METHODS In this retrospective study, we developed an XGBoost model using clinical and biomarker features for accurate DTC recurrence prediction using a cohort of 1228 consecutive patients (965 papillary, and 263 follicular) that were treated at the Department of Nuclear Medicine at University Hospital Augsburg between 1976 and 2010. The dataset was split into 982 patients for model training, and 246 for independent testing. From the 982 patients, 200 different random combinations of 785 training and 197 validation patients were conducted. To identify critical risk factors and understand the model's decision-making process, we conducted Shapely Additive exPlanations (SHAP) analysis. RESULTS The XGBoost model achieved an AUROC of 0.84 (95 % CI: 0.84-0.86; SD: 0.08), sensitivity of 0.79 (95 % CI: 0.77-0.81; SD: 0.17), and specificity of 0.78 (95 % CI: 0.77-0.79; SD: 0.04) on the validation datasets, and an AUROC of 0.88 (sensitivity 0.83, specificity 0.80) on the independent test set. Tumor size, maximal thyroglobulin values within six months after thyroidectomy, and maximal thyroglobulin antibody levels within 12 to 24 months after thyroidectomy were the most important factors. SHAP dependence plots suggested new recurrence risk thresholds for a tumor size of 25 mm, maximal serum thyroglobulin levels of 3 and 10 ng/mL, respectively, and maximal thyroglobulin antibody levels of 120 IU/mL. CONCLUSION Our XGBoost model, supported by SHAP analysis empowers clinicians with interpretable insights and defined risk thresholds and could facilitate informed decision-making and patient-centric care.
Collapse
Affiliation(s)
- Andreas Schindele
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Anne Krebold
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Ursula Heiß
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Kerstin Nimptsch
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Elisabeth Pfaehler
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Christina Berr
- Internal Medicine I, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Ralph A Bundschuh
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Thomas Wendler
- Diagnostic and Interventional Radiology, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Olivia Kertels
- Department of Diagnostic and Interventional Neuroradiology, School of Medicine, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
| | - Johannes Tran-Gia
- Department of Nuclear Medicine, University Hospital Würzburg, Oberdürrbacherstr. 6, Würzburg 97080, Germany
| | - Christian H Pfob
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany
| | - Constantin Lapa
- Nuclear Medicine, Faculty of Medicine, University of Augsburg, Augsburg, Germany.
| |
Collapse
|
3
|
Godwin RC, Tung A, Berkowitz DE, Melvin RL. Transforming Physiology and Healthcare through Foundation Models. Physiology (Bethesda) 2025; 40:0. [PMID: 39832521 DOI: 10.1152/physiol.00048.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 11/30/2024] [Accepted: 01/08/2025] [Indexed: 01/22/2025] Open
Abstract
Recent developments in artificial intelligence (AI) may significantly alter physiological research and healthcare delivery. Whereas AI applications in medicine have historically been trained for specific tasks, recent technological advances have produced models trained on more diverse datasets with much higher parameter counts. These new, "foundation" models raise the possibility that more flexible AI tools can be applied to a wider set of healthcare tasks than in the past. This review describes how these newer models differ from conventional task-specific AI, which relies heavily on focused datasets and narrow, specific applications. By examining the integration of AI into diagnostic tools, personalized treatment strategies, biomedical research, and healthcare administration, we highlight how these newer models are revolutionizing predictive healthcare analytics and operational workflows. In addition, we address ethical and practical considerations associated with the use of foundation models by highlighting emerging trends, calling for changes to existing guidelines, and emphasizing the importance of aligning AI with clinical goals to ensure its responsible and effective use.
Collapse
Affiliation(s)
- Ryan C Godwin
- Department of Anesthesiology and Perioperative Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States
| | - Avery Tung
- Department of Anesthesia and Critical Care, University of Chicago, Chicago, Illinois, United States
| | - Dan E Berkowitz
- Department of Anesthesiology and Perioperative Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States
| | - Ryan L Melvin
- Department of Anesthesiology and Perioperative Medicine, University of Alabama at Birmingham, Birmingham, Alabama, United States
| |
Collapse
|
4
|
Yuan JH, Jin YM, Xiang JY, Li SS, Zhong YX, Zhang SL, Zhao B. Machine learning-based prediction of postoperative mortality risk after abdominal surgery. World J Gastrointest Surg 2025; 17:103696. [DOI: 10.4240/wjgs.v17.i4.103696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/27/2024] [Revised: 01/25/2025] [Accepted: 02/18/2025] [Indexed: 03/29/2025] Open
Abstract
BACKGROUND Preoperative risk assessments are vital for identifying patients at high risk of postoperative mortality. However, traditional scoring systems can be time consuming. We hypothesized that the use of machine learning models would enable rapid and accurate risk assessments to be performed.
AIM To assess the potential of machine learning algorithms to develop predictive models of mortality risk after abdominal surgery.
METHODS This retrospective study included 230 individuals who underwent abdominal surgery at the Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine between January 2023 and December 2023. Demographic and surgery-related data were collected and used to develop nomogram, decision-tree, random-forest, gradient-boosting, support vector machine, and naïve Bayesian models to predict 30-day mortality risk after abdominal surgery. Models were assessed using receiver operating characteristic curves and compared using the DeLong test.
RESULTS Of the 230 included patients, 52 died and 178 survived. Models were developed using the training cohort (n = 161) and assessed using the validation cohort (n = 68). The areas under the receiver operating characteristic curves for the nomogram, decision-tree, random-forest, gradient-boosting tree, support vector machine, and naïve Bayesian models were 0.908 [95% confidence interval (CI): 0.824-0.992], 0.874 (95%CI: 0.785-0.963), 0.928 (95%CI: 0.869-0.987), 0.907 (95%CI: 0.837-0.976), 0.983 (95%CI: 0.959-1.000), and 0.807 (95%CI: 0.702-0.911), respectively.
CONCLUSION Nomogram, random-forest, gradient-boosting tree, and support vector machine models all demonstrate strong performances for the prediction of postoperative mortality and can be selected based on the clinical circumstances.
Collapse
Affiliation(s)
- Ji-Hong Yuan
- Department of General Surgery, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai 201317, China
| | - Yong-Mei Jin
- Department of General Surgery, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai 201317, China
| | - Jing-Ye Xiang
- Department of Health Management, Zhenru Community Health Service Center of Putuo District, Shanghai 200333, China
| | - Shuang-Shuang Li
- Department of Oncology, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai 201317, China
| | - Ying-Xi Zhong
- Department of Rehabilitation, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai 201317, China
| | - Shu-Liu Zhang
- Department of Critical Care Medicine, The 960th Hospital of the PLA Joint Logistics Support Force, Jinan 250000, Shandong Province, China
| | - Bin Zhao
- Department of General Surgery, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai 201317, China
| |
Collapse
|
5
|
Jones WS, Farrow DJ. One-class support vector machines for detecting population drift in deployed machine learning medical diagnostics. Sci Rep 2025; 15:12157. [PMID: 40204747 PMCID: PMC11982198 DOI: 10.1038/s41598-025-94427-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2024] [Accepted: 03/13/2025] [Indexed: 04/11/2025] Open
Abstract
Machine learning (ML) models are increasingly being applied to diagnose and predict disease, but face technical challenges such as population drift, where the training and real-world deployed data distributions differ. This phenomenon can degrade model performance, risking incorrect diagnoses. Current detection methods are limited: not directly measuring population drift and often requiring ground truth labels for new patient data. Here, we propose using a one-class support vector machine (OCSVM) to detect population drift. We trained a OCSVM on the Wisconsin Breast Cancer dataset and tested its ability to detect population drift on simulated data. Simulated data was offset at 0.4 standard deviations of the minimum and maximum values of the radius_mean variable, at three noise levels: 5%, 10% and 30% of the standard deviation; 10,000 records per noise level. We hypothesised that increased noise would correlate with more OCSVM-detected inliers, indicating a sensitivity to population drift. As noise increased, more inliers were detected: 5% (27 inliers), 10% (486), and 30% (851). Therefore, this approach could effectively alert to population drift, supporting safe ML diagnostics adoption. Future research should explore OCSVM monitoring on real-world data, enhance model transparency, investigate complementary statistical and ML methods, and extend applications to other data types.
Collapse
Affiliation(s)
- William S Jones
- Centre of Excellence for Data Science, Artificial Intelligence and Modelling (DAIM), Faculty of Science and Engineering, University of Hull, Hull, UK.
| | - Daniel J Farrow
- Centre of Excellence for Data Science, Artificial Intelligence and Modelling (DAIM), Faculty of Science and Engineering, University of Hull, Hull, UK
| |
Collapse
|
6
|
Brodsky V, Ullah E, Bychkov A, Song AH, Walk EE, Louis P, Rasool G, Singh RS, Mahmood F, Bui MM, Parwani AV. Generative Artificial Intelligence in Anatomic Pathology. Arch Pathol Lab Med 2025; 149:298-318. [PMID: 39836377 DOI: 10.5858/arpa.2024-0215-ra] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/20/2024] [Indexed: 01/22/2025]
Abstract
CONTEXT.— Generative artificial intelligence (AI) has emerged as a transformative force in various fields, including anatomic pathology, where it offers the potential to significantly enhance diagnostic accuracy, workflow efficiency, and research capabilities. OBJECTIVE.— To explore the applications, benefits, and challenges of generative AI in anatomic pathology, with a focus on its impact on diagnostic processes, workflow efficiency, education, and research. DATA SOURCES.— A comprehensive review of current literature and recent advancements in the application of generative AI within anatomic pathology, categorized into unimodal and multimodal applications, and evaluated for clinical utility, ethical considerations, and future potential. CONCLUSIONS.— Generative AI demonstrates significant promise in various domains of anatomic pathology, including diagnostic accuracy enhanced through AI-driven image analysis, virtual staining, and synthetic data generation; workflow efficiency, with potential for improvement by automating routine tasks, quality control, and reflex testing; education and research, facilitated by AI-generated educational content, synthetic histology images, and advanced data analysis methods; and clinical integration, with preliminary surveys indicating cautious optimism for nondiagnostic AI tasks and growing engagement in academic settings. Ethical and practical challenges require rigorous validation, prompt engineering, federated learning, and synthetic data generation to help ensure trustworthy, reliable, and unbiased AI applications. Generative AI can potentially revolutionize anatomic pathology, enhancing diagnostic accuracy, improving workflow efficiency, and advancing education and research. Successful integration into clinical practice will require continued interdisciplinary collaboration, careful validation, and adherence to ethical standards to ensure the benefits of AI are realized while maintaining the highest standards of patient care.
Collapse
Affiliation(s)
- Victor Brodsky
- From the Department of Pathology and Immunology, Washington University School of Medicine in St Louis, St Louis, Missouri (Brodsky)
| | - Ehsan Ullah
- the Department of Surgery, Health New Zealand, Counties Manukau, New Zealand (Ullah)
| | - Andrey Bychkov
- the Department of Pathology, Kameda Medical Center, Kamogawa City, Chiba Prefecture, Japan (Bychkov)
- the Department of Pathology, Nagasaki University, Nagasaki, Japan (Bychkov)
| | - Andrew H Song
- the Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts (Song, Mahmood)
| | - Eric E Walk
- Office of the Chief Medical Officer, PathAI, Boston, Massachusetts (Walk)
| | - Peter Louis
- the Department of Pathology and Laboratory Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, New Jersey (Louis)
| | - Ghulam Rasool
- the Department of Oncologic Sciences, Morsani College of Medicine and Department of Electrical Engineering, University of South Florida, Tampa (Rasool)
- the Department of Machine Learning, Moffitt Cancer Center and Research Institute, Tampa, Florida (Rasool)
- Department of Machine Learning, Neuro-Oncology, Moffitt Cancer Center and Research Institute, Tampa, Florida (Rasool)
| | - Rajendra S Singh
- Dermatopathology and Digital Pathology, Summit Health, Berkley Heights, New Jersey (Singh)
| | - Faisal Mahmood
- the Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts (Song, Mahmood)
| | - Marilyn M Bui
- Department of Machine Learning, Pathology, Moffitt Cancer Center and Research Institute, Tampa, Florida (Bui)
| | - Anil V Parwani
- the Department of Pathology, The Ohio State University, Columbus (Parwani)
| |
Collapse
|
7
|
Rowan NJ. Embracing a Penta helix hub framework for co-creating sustaining and potentially disruptive sterilization innovation that enables artificial intelligence and sustainability: A scoping review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2025; 972:179018. [PMID: 40088793 DOI: 10.1016/j.scitotenv.2025.179018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Revised: 02/26/2025] [Accepted: 02/27/2025] [Indexed: 03/17/2025]
Abstract
The supply of safe pipeline medical devices is of paramount importance. Opportunities exist to transform reusable medical devices for improved processing that meets diverse patient needs. There is increased interest in multi-actor hub frameworks to meet innovation challenges globally. The purpose of this scoping paper was to identify critical decontamination and sterilization needs for the medtech and pharmaceutical sectors with a focus on understanding how to effectively use the Penta helix hub framework that combines academia, industry, healthcare, policy-makers/regulators and patients/society. A PRISMA scoping review of PubMed publications was conducted over the period 2010 to January 2025. Thirty of the 124 'helix hub' papers addressed innovation where only 3 of 16 healthcare-focused helices used or mentioned the need for key performance indicators (KPIs). Early-phase helix innovation ecosystems are mainly supported by qualitative or non-empirical data. This review explores multi-actor needs along with describing quantifiable KPIs at micro (end-user), meso (innovation hub) and macro (regional, national and international) levels. This integrated Penta hub approach will help to effectively plan, co-create, manage, analyse and utilize voluminous data, for example there are ca. 60,000 and 56,000 publications per year on artificial intelligence (AI) and medical devices respectively along, with some 35,000 adverse reports on devices submitted to the US FDA. This review addresses sustaining and potentially disruptive opportunities for decontamination and sterilization that includes the use of AI-enabled devices, bespoke training and sustainability.
Collapse
Affiliation(s)
- Neil J Rowan
- Faculty of Science and Health, Midlands Campus, Technological University of the Shannon, Ireland; Centre for Sustainable Disinfection and Sterilization, Technological University of the Shannon, Ireland; CURAM Research Centre for Medical Devices, University of Galway, Ireland.
| |
Collapse
|
8
|
Zhang Z, Sanders HS, Dragun V, Cole S, Smith BD. Fluorescent Molecular Probe for Imaging Hypoxia in 2D Cell Culture Monolayers and 3D Tumor Spheroids: The Cell Membrane Partition Model for Predicting Probe Distribution in a Spheroid. ACS APPLIED MATERIALS & INTERFACES 2025; 17:18046-18058. [PMID: 40079788 DOI: 10.1021/acsami.4c22228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/15/2025]
Abstract
Compared to cultured 2D cell monolayers, 3D multicellular spheroids are more realistic tumor models. Nonetheless, spheroids remain under-utilized in preclinical research, in part, because there is a lack of fluorescence sensors that can noninvasively interrogate all the individual cells within a spheroid. This present study describes a deep-red fluorogenic molecular probe for microscopic imaging of cells that contain a high level of nitroreductase enzyme activity as a biomarker of cell hypoxia. A first-generation version of the probe produced "turn-on" fluorescence in a 2D cell monolayer under hypoxic conditions; however, it was not useful in a 3D multicellular tumor spheroid because it only accumulated in the peripheral cells. To guide the probe structural optimization process, an intuitive theoretical membrane partition model was conceived to predict how a dosed probe will distribute within a 3D spheroid. The model identifies three limiting molecular diffusion pathways that are determined by a probe's membrane partition properties. A lipophilic probe with high membrane affinity rapidly becomes trapped in the membranes of the peripheral cells. In contrast, a very hydrophilic probe molecule with negligible membrane affinity diffuses rapidly through the spheroid intercellular space and rarely enters the cells. However, a probe molecule with intermediate membrane affinity undergoes sequential diffusion in and out of cells and distributes to all the cells within a spheroid. Using the model as a predictive tool, a second-generation fluorescent probe was prepared with a smaller and more hydrophilic molecular structure, and optical sectioning using structured illumination or light sheet microscopy revealed roughly even probe diffusion throughout a tumor spheroid. The membrane permeation model is likely to be broadly applicable for the structural optimization of various classes of molecules and nanoparticles to enable even distribution within a tumor spheroid.
Collapse
Affiliation(s)
- Zhumin Zhang
- Department of Chemistry and Biochemistry, University of Notre Dame, 251 Nieuwland Science Hall, Notre Dame, Indiana 46556, United States
| | - Hailey S Sanders
- Department of Chemistry and Biochemistry, University of Notre Dame, 251 Nieuwland Science Hall, Notre Dame, Indiana 46556, United States
| | - Vivienne Dragun
- Department of Chemistry and Biochemistry, University of Notre Dame, 251 Nieuwland Science Hall, Notre Dame, Indiana 46556, United States
| | - Sara Cole
- Notre Dame Integrated Imaging Facility, University of Notre Dame, Notre Dame, Indiana 46556, United States
| | - Bradley D Smith
- Department of Chemistry and Biochemistry, University of Notre Dame, 251 Nieuwland Science Hall, Notre Dame, Indiana 46556, United States
| |
Collapse
|
9
|
Duque DDA, Meira DD, Altoé LSC, Casotti MC, Lopes TJDS, Louro ID, Varejão FM. Using machine learning to predict Hemophilia A severity. Curr Res Transl Med 2025; 73:103508. [PMID: 40121975 DOI: 10.1016/j.retram.2025.103508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 03/09/2025] [Accepted: 03/16/2025] [Indexed: 03/25/2025]
Abstract
Hemophilia A is a rare genetic condition that predominantly affects men and is characterized by a deficiency in Factor VIII clotting (FVIII). This research focuses on the development of a classification model to predict the severity of Hemophilia A, using data from point mutations in the FVIII protein. The study employs a variety of classification models, including RandomForest, XGBoost, and LightGBM, and performs a robust analysis of the data to select the most relevant features. The final model achieved an accuracy of 65.5 %, demonstrating significant performance against a simple gaussian naive bayes model that achieves 51.1 % of accuracy. Although the model cannot yet replace the FVIII measurement test in the blood for diagnostic purposes, the results represent a significant advance in Hemophilia A research. This work provides data analysis that deepens the understanding of the characteristics of the FVIII protein and contributes to the development of models capable of classifying the severity of this condition into its three possible classes: mild, moderate, or severe.
Collapse
Affiliation(s)
- Daniel de Almeida Duque
- Departamento de Ciências Biológicas, Núcleo de Genética Humana e Molecular, Centro de Ciências Humanas e Naturais, Universidade Federal do Espírito Santo (UFES), Av. Fernando Ferrari, N. 514, Prédio Ciências Biológicas, Bloco A, Sala 106, Vitória, Espírito Santo, Brazil
| | - Débora Dummer Meira
- Departamento de Ciências Biológicas, Núcleo de Genética Humana e Molecular, Centro de Ciências Humanas e Naturais, Universidade Federal do Espírito Santo (UFES), Av. Fernando Ferrari, N. 514, Prédio Ciências Biológicas, Bloco A, Sala 106, Vitória, Espírito Santo, Brazil.
| | - Lorena Souza Castro Altoé
- Departamento de Ciências Biológicas, Núcleo de Genética Humana e Molecular, Centro de Ciências Humanas e Naturais, Universidade Federal do Espírito Santo (UFES), Av. Fernando Ferrari, N. 514, Prédio Ciências Biológicas, Bloco A, Sala 106, Vitória, Espírito Santo, Brazil
| | - Matheus Correia Casotti
- Departamento de Ciências Biológicas, Núcleo de Genética Humana e Molecular, Centro de Ciências Humanas e Naturais, Universidade Federal do Espírito Santo (UFES), Av. Fernando Ferrari, N. 514, Prédio Ciências Biológicas, Bloco A, Sala 106, Vitória, Espírito Santo, Brazil
| | | | - Iuri Drumond Louro
- Departamento de Ciências Biológicas, Núcleo de Genética Humana e Molecular, Centro de Ciências Humanas e Naturais, Universidade Federal do Espírito Santo (UFES), Av. Fernando Ferrari, N. 514, Prédio Ciências Biológicas, Bloco A, Sala 106, Vitória, Espírito Santo, Brazil
| | - Flávio Miguel Varejão
- Departamento de Informática, Universidade Federal do Espírito Santo (UFES), Espírito Santo, Brazil
| |
Collapse
|
10
|
Rama NJG, Sousa I. Bridging the gap: The role of technological advances in shaping gastrointestinal oncological outcomes. World J Gastrointest Oncol 2025; 17:101752. [PMID: 40092923 PMCID: PMC11866242 DOI: 10.4251/wjgo.v17.i3.101752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 11/13/2024] [Accepted: 12/16/2024] [Indexed: 02/14/2025] Open
Abstract
Gastrointestinal (GI) cancers are highly prevalent and considered a major global health challenge. Their approach has undergone a remarkable transformation over the past years due to the development of new technologies that enabled better outcomes regarding their diagnosis and management. These include artificial intelligence, robotics, next-generation sequencing and personalized medicine. Nonetheless, the integration of these advances into everyday clinical practice remains complex and challenging as we are still trying to figure out if these innovations tangibly improve oncological outcomes or if the current state of art should remain as the gold standard for the treatment of these patients. Additionally, there are also some issues regarding ethical subjects, data privacy, finances and governance. Precision surgery concept has evolved considerably over the past decades, especially for oncological patients. It aims to customize medical treatments and to operate on those patients who most likely will benefit from a specific surgical procedure. In the future, to improve GI oncological outcomes, a delicate balance between technological advances adoption and evidence-based care should be chased. As we move forward, the question will be to harness the power of innovation while keeping up the highest standards of patient care.
Collapse
Affiliation(s)
- Nuno J G Rama
- Division of Colorectal Surgical, Leiria Hospital Centre, Leiria 2410-021, Portugal
| | - Inês Sousa
- Department of Surgical, Leiria Hospital Centre, Leiria 2410-021, Portugal
| |
Collapse
|
11
|
Fogarty B, García-Martínez A, Chawla NV, Serván-Mori E. Social and economic predictors of under-five stunting in Mexico: a comprehensive approach through the XGB model. J Glob Health 2025; 15:04065. [PMID: 40084528 PMCID: PMC11907376 DOI: 10.7189/jogh.15.04065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2025] Open
Abstract
Background The multifaceted issue of childhood stunting in low- and middle-income countries has a profound and enduring impact on children's well-being, cognitive development, and future earning potential. Childhood stunting arises from a complex interplay of genetic, environmental, and socio-cultural factors. It requires a comprehensive approach across nutrition, education, healthcare, and poverty reduction sectors to mitigate its prevalence and short- and long-term effects. The Mexican case presents a distinct challenge, as the country has experienced the recent dissolution of social health security programmes, rising poverty rates, and reduced government expenditures for childhood well-being. Methods We propose a machine learning approach to understand the contribution of social and economic determinants to childhood stunting risk in Mexico. Using data from the 2006-2018 population-based Mexican National Health and Nutrition Surveys, six different machine learning classification algorithms were used to model and identify the most important predictors of childhood stunting. Findings Among the six classification algorithms tested, Extreme Gradient Boosting (XGB) obtained the highest Youden Index value, effectively balancing the correct classification of children with and without stunting. In the XGB model, the most important predictor for classifying childhood stunting is the household's socioeconomic status, followed by the state of residence, the child's age, indigenous population status, the household's portion of children under five years old, and the local area's deprivation level. Conclusions This paper contributes to understanding the structural determinants of stunting in children, emphasising the importance of implementing tailored interventions and policies, especially given our findings that highlight indigenous status and local deprivation as key predictors. In the context of diminishing health initiatives, this underscores the urgent need for specific, targeted, and sustainable actions to prevent and address a potential rise in stunting in similar settings. Keywords social and economic deprivation, stunting, children, machine learning, XGB model, Mexico.
Collapse
Affiliation(s)
- Brian Fogarty
- Lucy Family Institute for Data & Society, University of Notre Dame, South Bend, Indiana, USA
| | | | - Nitesh V Chawla
- Lucy Family Institute for Data & Society, University of Notre Dame, South Bend, Indiana, USA
| | - Edson Serván-Mori
- Centre for Health Systems Research, the National Institute of Public Health, Cuernavaca, Morelos, Mexico
| |
Collapse
|
12
|
Taye EA, Woubet EY, Hailie GY, Arage FG, Zerihun TE, Zegeye AT, Zeleke TC, Kassaw AT. Application of the random forest algorithm to predict skilled birth attendance and identify determinants among reproductive-age women in 27 Sub-Saharan African countries; machine learning analysis. BMC Public Health 2025; 25:901. [PMID: 40050868 PMCID: PMC11887244 DOI: 10.1186/s12889-025-22007-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2025] [Accepted: 02/19/2025] [Indexed: 03/09/2025] Open
Abstract
INTRODUCTION Maternal mortality refers to a mother's death owing to complications arising from childbirth or pregnancy. This issue is a forefront public health challenge around the globe which is pronounced in low- and middle-income countries, particularly in the sub-Saharan African regions where the burdens remain significantly high. Moreover, this problem is further complicated in developing countries due to limited access to antenatal care and the shortage of skilled birth attendants. So far, considerable improvements in the health status of many populations have been reported in developing countries. Nonetheless, the MDGs to reduce maternal and newborn mortality unmet in many SSA nations. Leveraging machine learning approaches allows us to better understand these constraints and predict skilled birth attendance among reproductive age women, providing actionable insights for policy and intervention. OBJECTIVE This study aimed to predict skill birth attendance and identify its determinants among reproductive age women in 27 SSA countries using machine learning algorithm. METHODS Using data from the Demographic and Health Surveys (2016-2024) across 27 SSA countries, we analyzed responses from 198,707 reproductive age women. The Random Forest classifier, complemented by SHAP for feature interpretability, was employed for prediction and analysis. Data preprocessing included K-nearest neighbor imputation for missing values, SMOTE for handling class imbalance, and Recursive Feature Elimination for feature selection. Model performance was evaluated using metrics such as accuracy, recall, F1 score, and AUC-ROC. RESULTS The Random Forest model demonstrated robust performance, achieving an AUC-ROC of 92%, recall of 96%, accuracy of 92%, precision of 93 and F1 score of 93%. The SHAP analysis identifies key predictors of skilled birth attendance, including facility delivery, maternal education, higher wealth index, urban residence, reduced distance to healthcare facilities, media exposure, and internet use. CONCLUSION AND RECOMMENDATIONS The findings highlight the potential of machine learning to identify critical predictors of skilled birth attendance to inform targeted interventions. Addressing socioeconomic and educational disparities, enhancing healthcare access, and implementing tailored cessation programs are crucial to enhance skilled birth attendance in this vulnerable population.
Collapse
Affiliation(s)
- Eliyas Addisu Taye
- Department of Health Informatics, Institute of Public Health, University of Gondar, Gondar, Ethiopia.
| | - Eden Yitbarek Woubet
- Department of Reproductive Health, Institute of Public Health, University of Gondar, Gondar, Ethiopia
| | - Gabrela Yimer Hailie
- Department of Environmental and Occupational Health and Safety, Institute of Public Health, University of Gondar, Gondar, Ethiopia
| | - Fetlework Gubena Arage
- Department of Epidemiology and Biostatistics, Institute of Public Health, College of Medicine and Health Sciences, University of Gondar, Gondar, Ethiopia
| | - Tigabu Eskeziya Zerihun
- Department of Clinical Pharmacy, Pharmacy Education and Clinical Services Directorate, College of Health Sciences, Debre Tabor University, Debre Tabor, Ethiopia
| | - Adem Tsegaw Zegeye
- Department of Health Informatics, Institute of Public Health, University of Gondar, Gondar, Ethiopia
| | - Tarekegn Cheklie Zeleke
- Department of Optometry, School of Medicine, Tertiary Eye Care and Training Center, University of Gondar, Gondar, Ethiopia
| | - Abel Temeche Kassaw
- Department of Clinical Pharmacy, Pharmacy Education and Clinical Services Directorate, College of Health Sciences, Debre Tabor University, Debre Tabor, Ethiopia
| |
Collapse
|
13
|
Sun Y, Tan W, Gu Z, He R, Chen S, Pang M, Yan B. A data-efficient strategy for building high-performing medical foundation models. Nat Biomed Eng 2025:10.1038/s41551-025-01365-0. [PMID: 40044818 DOI: 10.1038/s41551-025-01365-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 02/04/2025] [Indexed: 04/04/2025]
Abstract
Foundation models are pretrained on massive datasets. However, collecting medical datasets is expensive and time-consuming, and raises privacy concerns. Here we show that synthetic data generated via conditioning with disease labels can be leveraged for building high-performing medical foundation models. We pretrained a retinal foundation model, first with approximately one million synthetic retinal images with physiological structures and feature distribution consistent with real counterparts, and then with only 16.7% of the 904,170 real-world colour fundus photography images required in a recently reported retinal foundation model (RETFound). The data-efficient model performed as well or better than RETFound across nine public datasets and four diagnostic tasks; and for diabetic-retinopathy grading, it used only 40% of the expert-annotated training data used by RETFound. We also support the generalizability of the data-efficient strategy by building a classifier for the detection of tuberculosis on chest X-ray images. The text-conditioned generation of synthetic data may enhance the performance and generalization of medical foundation models.
Collapse
Affiliation(s)
- Yuqi Sun
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Weimin Tan
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Zhuoyao Gu
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Ruian He
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Siyuan Chen
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Miao Pang
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China
| | - Bo Yan
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, China.
| |
Collapse
|
14
|
Zhang Z, Zhou X, Fang Y, Xiong Z, Zhang T. AI-driven 3D bioprinting for regenerative medicine: From bench to bedside. Bioact Mater 2025; 45:201-230. [PMID: 39651398 PMCID: PMC11625302 DOI: 10.1016/j.bioactmat.2024.11.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 11/01/2024] [Accepted: 11/16/2024] [Indexed: 12/11/2024] Open
Abstract
In recent decades, 3D bioprinting has garnered significant research attention due to its ability to manipulate biomaterials and cells to create complex structures precisely. However, due to technological and cost constraints, the clinical translation of 3D bioprinted products (BPPs) from bench to bedside has been hindered by challenges in terms of personalization of design and scaling up of production. Recently, the emerging applications of artificial intelligence (AI) technologies have significantly improved the performance of 3D bioprinting. However, the existing literature remains deficient in a methodological exploration of AI technologies' potential to overcome these challenges in advancing 3D bioprinting toward clinical application. This paper aims to present a systematic methodology for AI-driven 3D bioprinting, structured within the theoretical framework of Quality by Design (QbD). This paper commences by introducing the QbD theory into 3D bioprinting, followed by summarizing the technology roadmap of AI integration in 3D bioprinting, including multi-scale and multi-modal sensing, data-driven design, and in-line process control. This paper further describes specific AI applications in 3D bioprinting's key elements, including bioink formulation, model structure, printing process, and function regulation. Finally, the paper discusses current prospects and challenges associated with AI technologies to further advance the clinical translation of 3D bioprinting.
Collapse
Affiliation(s)
- Zhenrui Zhang
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, PR China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, PR China
- “Biomanufacturing and Engineering Living Systems” Innovation International Talents Base (111 Base), Beijing, 100084, PR China
| | - Xianhao Zhou
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, PR China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, PR China
- “Biomanufacturing and Engineering Living Systems” Innovation International Talents Base (111 Base), Beijing, 100084, PR China
| | - Yongcong Fang
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, PR China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, PR China
- “Biomanufacturing and Engineering Living Systems” Innovation International Talents Base (111 Base), Beijing, 100084, PR China
- State Key Laboratory of Tribology in Advanced Equipment, Tsinghua University, Beijing, 100084, PR China
| | - Zhuo Xiong
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, PR China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, PR China
- “Biomanufacturing and Engineering Living Systems” Innovation International Talents Base (111 Base), Beijing, 100084, PR China
| | - Ting Zhang
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, PR China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, PR China
- “Biomanufacturing and Engineering Living Systems” Innovation International Talents Base (111 Base), Beijing, 100084, PR China
- State Key Laboratory of Tribology in Advanced Equipment, Tsinghua University, Beijing, 100084, PR China
| |
Collapse
|
15
|
Qian X, Pei J, Han C, Liang Z, Zhang G, Chen N, Zheng W, Meng F, Yu D, Chen Y, Sun Y, Zhang H, Qian W, Wang X, Er Z, Hu C, Zheng H, Shen D. A multimodal machine learning model for the stratification of breast cancer risk. Nat Biomed Eng 2025; 9:356-370. [PMID: 39633027 DOI: 10.1038/s41551-024-01302-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 10/31/2024] [Indexed: 12/07/2024]
Abstract
Machine learning models for the diagnosis of breast cancer can facilitate the prediction of cancer risk and subsequent patient management among other clinical tasks. For the models to impact clinical practice, they ought to follow standard workflows, help interpret mammography and ultrasound data, evaluate clinical contextual information, handle incomplete data and be validated in prospective settings. Here we report the development and testing of a multimodal model leveraging mammography and ultrasound modules for the stratification of breast cancer risk based on clinical metadata, mammography and trimodal ultrasound (19,360 images of 5,216 breasts) from 5,025 patients with surgically confirmed pathology across medical centres and scanner manufacturers. Compared with the performance of experienced radiologists, the model performed similarly at classifying tumours as benign or malignant and was superior at pathology-level differential diagnosis. With a prospectively collected dataset of 191 breasts from 187 patients, the overall accuracies of the multimodal model and of preliminary pathologist-level assessments of biopsied breast specimens were similar (90.1% vs 92.7%, respectively). Multimodal models may assist diagnosis in oncology.
Collapse
Affiliation(s)
- Xuejun Qian
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China.
- State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, China.
- Shanghai United Imaging Intelligence Co. Ltd., Shanghai, China.
| | - Jing Pei
- Department of Breast Surgery, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Department of General Surgery, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Chunguang Han
- Department of General Surgery, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Zhiying Liang
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
| | - Gaosong Zhang
- Department of Ultrasound, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Na Chen
- Department of Ultrasound, Nanjing Hospital Affiliated to Nanjing Medical University, Nanjing, China
| | - Weiwei Zheng
- Department of Ultrasound, Xuancheng People's Hospital, Xuancheng, China
| | - Fanlun Meng
- Department of Breast Surgery, Fuyang Cancer Hospital, Fuyang, China
| | - Dongsheng Yu
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
| | - Yixuan Chen
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
| | - Yiqun Sun
- Department of Radiology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Hanqi Zhang
- Department of Ultrasound, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Wei Qian
- Department of Radiology, The Second Affiliated Hospital of Zhejiang University, Hangzhou, China
| | - Xia Wang
- Department of Radiology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Zhuoran Er
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
| | - Chenglu Hu
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
| | - Hui Zheng
- Department of Ultrasound, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Dinggang Shen
- School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
- State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai, China
- Shanghai United Imaging Intelligence Co. Ltd., Shanghai, China
- Shanghai Clinical Research and Trial Center, Shanghai, China
| |
Collapse
|
16
|
Woo JJ, Yang AJ, Huang RY, Ramkumar PN. Editorial Commentary: Thoughtful Application of Artificial Intelligence Technique Improves Diagnostic Accuracy and Supportive Clinical Decision-Making. Arthroscopy 2025; 41:585-587. [PMID: 39675394 DOI: 10.1016/j.arthro.2024.12.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Revised: 12/07/2024] [Accepted: 12/08/2024] [Indexed: 12/17/2024]
Abstract
Medical research within areas of deep learning, particularly in computer vision for medical imaging, has shown promise over the past decade with an increasing volume of technical papers published in orthopaedics related to imaging artificial intelligence (AI). However, as more tools and models are developed and deployed, it is easy for clinicians to get overwhelmed with the different types of models, leaving "artificial intelligence" as an empty buzzword where true value can be unclear. As with surgery, the techniques of deep learning require thoughtful application and cannot follow a one-size-fits-all approach as different problems require differential levels of technical complexity with model application. Moreover, the application of AI-based clinical tools should be both adjunctive and transparent in their stepwise integration within clinical medicine to provide additive insight. As a medical profession, we must together decide how, when, and where we want AI-based applications to offer insight.
Collapse
Affiliation(s)
- Joshua J Woo
- Commons Clinic (J.J.W.); Warren Alpert Brown School of Medicine (J.J.W., A.J.Y., R.Y.H.)
| | - Andrew J Yang
- Commons Clinic (J.J.W.); Warren Alpert Brown School of Medicine (J.J.W., A.J.Y., R.Y.H.)
| | - Ryan Y Huang
- Commons Clinic (J.J.W.); Warren Alpert Brown School of Medicine (J.J.W., A.J.Y., R.Y.H.)
| | - Prem N Ramkumar
- Commons Clinic (J.J.W.); Warren Alpert Brown School of Medicine (J.J.W., A.J.Y., R.Y.H.)
| |
Collapse
|
17
|
Asadi F, Rahimi M, Ramezanghorbani N, Almasi S. Comparing the Effectiveness of Artificial Intelligence Models in Predicting Ovarian Cancer Survival: A Systematic Review. Cancer Rep (Hoboken) 2025; 8:e70138. [PMID: 40103563 PMCID: PMC11920737 DOI: 10.1002/cnr2.70138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 12/23/2024] [Accepted: 01/27/2025] [Indexed: 03/20/2025] Open
Abstract
BACKGROUND This systematic review investigates the use of machine learning (ML) algorithms in predicting survival outcomes for ovarian cancer (OC) patients. Key prognostic endpoints, including overall survival (OS), recurrence-free survival (RFS), progression-free survival (PFS), and treatment response prediction (TRP), are examined to evaluate the effectiveness of these algorithms and identify significant features that influence predictive accuracy. RECENT FINDINGS A thorough search of four major databases-PubMed, Scopus, Web of Science, and Cochrane-resulted in 2400 articles published within the last decade, with 32 studies meeting the inclusion criteria. Notably, most publications emerged after 2021. Commonly used algorithms for survival prediction included random forest, support vector machines, logistic regression, XGBoost, and various deep learning models. Evaluation metrics such as area under the curve (AUC) (18 studies), concordance index (C-index) (11 studies), and accuracy (11 studies) were frequently employed. Age at diagnosis, tumor stage, CA-125 levels, and treatment-related factors were consistently highlighted as significant predictors, emphasizing their relevance in OC prognosis. CONCLUSION ML models demonstrate considerable potential for predicting OC survival outcomes; however, challenges persist regarding model accuracy and interpretability. Incorporating diverse data types-such as clinical, imaging, and molecular datasets-holds promise for enhancing predictive capabilities. Future advancements will depend on integrating heterogeneous data sources with multimodal ML approaches, which are crucial for improving prognostic precision in OC.
Collapse
Affiliation(s)
- Farkhondeh Asadi
- Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Milad Rahimi
- Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Nahid Ramezanghorbani
- Department of Development & Coordination Scientific Information and Publications, Deputy of Research & Technology, Ministry of Health & Medical Education, Tehran, Iran
| | - Sohrab Almasi
- Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
18
|
Santos CS, Amorim-Lopes M. Externally validated and clinically useful machine learning algorithms to support patient-related decision-making in oncology: a scoping review. BMC Med Res Methodol 2025; 25:45. [PMID: 39984835 PMCID: PMC11843972 DOI: 10.1186/s12874-025-02463-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 01/03/2025] [Indexed: 02/23/2025] Open
Abstract
BACKGROUND This scoping review systematically maps externally validated machine learning (ML)-based models in cancer patient care, quantifying their performance, and clinical utility, and examining relationships between models, cancer types, and clinical decisions. By synthesizing evidence, this study identifies, strengths, limitations, and areas requiring further research. METHODS The review followed the Joanna Briggs Institute's methodology, Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews guidelines, and the Population, Concept, and Context mnemonic. Searches were conducted across Embase, IEEE Xplore, PubMed, Scopus, and Web of Science (January 2014-September 2022), targeting English-language quantitative studies in Q1 journals (SciMago Journal and Country Ranking > 1) that used ML to evaluate clinical outcomes for human cancer patients with commonly available data. Eligible models required external validation, clinical utility assessment, and performance metric reporting. Studies involving genetics, synthetic patients, plants, or animals were excluded. Results were presented in tabular, graphical, and descriptive form. RESULTS From 4023 deduplicated abstracts and 636 full-text reviews, 56 studies (2018-2022) met the inclusion criteria, covering diverse cancer types and applications. Convolutional neural networks were most prevalent, demonstrating high performance, followed by gradient- and decision tree-based algorithms. Other algorithms, though underrepresented, showed promise. Lung and digestive system cancers were most frequently studied, focusing on diagnosis and outcome predictions. Most studies were retrospective and multi-institutional, primarily using image-based data, followed by text-based and hybrid approaches. Clinical utility assessments involved 499 clinicians and 12 tools, indicating improved clinician performance with AI assistance and superior performance to standard clinical systems. DISCUSSION Interest in ML-based clinical decision-making has grown in recent years alongside increased multi-institutional collaboration. However, small sample sizes likely impacted data quality and generalizability. Persistent challenges include limited international validation across ethnicities, inconsistent data sharing, disparities in validation metrics, and insufficient calibration reporting, hindering model comparison reliability. CONCLUSION Successful integration of ML in oncology decision-making requires standardized data and methodologies, larger sample sizes, greater transparency, and robust validation and clinical utility assessments. OTHER Financed by FCT-Fundação para a Ciência e a Tecnologia (Portugal, project LA/P/0063/2020, grant 2021.09040.BD) as part of CSS's Ph.D. This work was not registered.
Collapse
Affiliation(s)
- Catarina Sousa Santos
- Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal.
| | - Mário Amorim-Lopes
- Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal
| |
Collapse
|
19
|
Gensheimer MF, Lu J, Ramchandran K. Comparison of 1-year mortality predictions from vendor-supplied versus academic model for cancer patients. PeerJ 2025; 13:e18958. [PMID: 39959833 PMCID: PMC11827575 DOI: 10.7717/peerj.18958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Accepted: 01/17/2025] [Indexed: 02/18/2025] Open
Abstract
Purpose The Epic End of Life Care Index (EOLCI) predicts 1-year mortality for a general adult population using medical record data. It is deployed at various medical centers, but we are not aware of an independent validation. We evaluated its performance for predicting 1-year mortality in patients with metastatic cancer, comparing it against an academic machine learning model designed for cancer patients. We focused on this patient population because of their high short-term mortality risk and because we had access to the comparator model predictions. Materials and Methods This retrospective analysis included adult outpatients with metastatic cancer from four outpatient sites. Performance metrics included AUC for 1-year mortality and positive predictive value of high-risk score. Results There were 1,399 patients included. Median age at first EOLCI prediction was 67 and 55% were female. A total of 1,283 patients were evaluable for 1-year mortality; of these, 297 (23%) died within 1 year. AUC for 1-year mortality for EOLCI and academic model was 0.73 (95% CI [0.70-0.76]) and 0.82 (95% CI [0.80-0.85]), respectively. Positive predictive value was 0.38 and 0.65, respectively. Conclusion The EOLCI's discrimination performance was lower than the vendor-stated value (AUC of 0.86) and the academic model's performance. Vendor-supplied machine learning models should be independently validated, particularly in specialized patient populations, to ensure accuracy and reliability.
Collapse
Affiliation(s)
- Michael F. Gensheimer
- Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA, United States
| | - Jonathan Lu
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, United States
| | - Kavitha Ramchandran
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, United States
| |
Collapse
|
20
|
Wong E, Bermudez-Cañete A, Campbell MJ, Rhew DC. Bridging the Digital Divide: A Practical Roadmap for Deploying Medical Artificial Intelligence Technologies in Low-Resource Settings. Popul Health Manag 2025. [PMID: 39899377 DOI: 10.1089/pop.2024.0222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2025] Open
Abstract
In recent decades, the integration of artificial intelligence (AI) into health care has revolutionized diagnostics, treatment customization, and delivery. In low-resource settings, AI offers significant potential to address health care disparities exacerbated by shortages of medical professionals and other resources. However, implementing AI effectively and responsibly in these settings requires careful consideration of context-specific needs and barriers to equitable care. This article explores the practical deployment of AI in low-resource environments through a review of existing literature and interviews with experts, ranging from health care providers and administrators to AI tool developers and government consultants. The authors highlight 4 critical areas for effective AI deployment: infrastructure requirements, deployment and data management, education and training, and responsible AI practices. By addressing these aspects, the proposed framework aims to guide sustainable AI integration, minimizing risk, and enhancing health care access in underserved regions.
Collapse
Affiliation(s)
- Evelyn Wong
- School of Medicine, Stanford University, Stanford, California, USA
| | | | | | - David C Rhew
- Microsoft Corporation, Redmond, Washington, USA
- Division of Primary Care and Population Health, Department of Medicine, Stanford University, Stanford, California, USA
| |
Collapse
|
21
|
Hresko DJ, Drotar P, Ngo QC, Kumar DK. Enhanced Domain Adaptation for Foot Ulcer Segmentation Through Mixing Self-Trained Weak Labels. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2025; 38:455-466. [PMID: 39020158 PMCID: PMC11810871 DOI: 10.1007/s10278-024-01193-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 05/04/2024] [Accepted: 05/22/2024] [Indexed: 07/19/2024]
Abstract
Wound management requires the measurement of the wound parameters such as its shape and area. However, computerized analysis of the wound suffers the challenge of inexact segmentation of the wound images due to limited or inaccurate labels. It is a common scenario that the source domain provides an abundance of labeled data, while the target domain provides only limited labels. To overcome this, we propose a novel approach that combines self-training learning and mixup augmentation. The neural network is trained on the source domain to generate weak labels on the target domain via the self-training process. In the second stage, generated labels are mixed up with labels from the source domain to retrain the neural network and enhance generalization across diverse datasets. The efficacy of our approach was evaluated using the DFUC 2022, FUSeg, and RMIT datasets, demonstrating substantial improvements in segmentation accuracy and robustness across different data distributions. Specifically, in single-domain experiments, segmentation on the DFUC 2022 dataset scored a dice score of 0.711, while the score on the FUSeg dataset achieved 0.859. For domain adaptation, when these datasets were used as target datasets, the dice scores were 0.714 for DFUC 2022 and 0.561 for FUSeg.
Collapse
Affiliation(s)
- David Jozef Hresko
- IISLab, Technical University of Kosice, Letna 1/9, Kosice, 04200, Kosicky Kraj, Slovakia
| | - Peter Drotar
- IISLab, Technical University of Kosice, Letna 1/9, Kosice, 04200, Kosicky Kraj, Slovakia.
| | - Quoc Cuong Ngo
- School of Engineering, RMIT University, 80/445 Swanston St, Melbourne, 3000, VIC, Australia
| | - Dinesh Kant Kumar
- School of Engineering, RMIT University, 80/445 Swanston St, Melbourne, 3000, VIC, Australia
| |
Collapse
|
22
|
Ryu S, Kim JH, Choi YJ, Lee JS. Generating synthetic CT images from unpaired head and neck CBCT images and validating the importance of detailed nasal cavity acquisition through simulations. Comput Biol Med 2025; 185:109568. [PMID: 39700859 DOI: 10.1016/j.compbiomed.2024.109568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 11/25/2024] [Accepted: 12/10/2024] [Indexed: 12/21/2024]
Abstract
BACKGROUND AND OBJECTIVE Computed tomography (CT) of the head and neck is crucial for diagnosing internal structures. The demand for substituting traditional CT with cone beam CT (CBCT) exists because of its cost-effectiveness and reduced radiation exposure. However, CBCT cannot accurately depict airway shapes owing to image noise. This study proposes a strategy utilizing a cycle-consistent generative adversarial network (cycleGAN) for denoising CBCT images with various loss functions and augmentation strategies, resulting in the generation of denoised synthetic CT (sCT) images. Furthermore, through a rule-based approach, we were able to automatically segment the upper airway in sCT images with high accuracy. Additionally, we conducted an analysis of the impact of finely segmented nasal cavities on airflow using computational fluid dynamics (CFD). METHODS We trained the cycleGAN model using various loss functions and compared the quality of the sCT images generated by each model. We improved the artifact removal performance by incorporating CT images with added Gaussian noise augmentation into the training dataset. We developed a rule-based automatic segmentation methodology using threshold and watershed algorithms to compare the accuracy of airway segmentation for noise-reduced sCT and original CBCT. Furthermore, we validated the significance of the nasal cavity by conducting CFD based on automatically segmented shapes obtained from sCT. RESULT The generated sCT images exhibited improved quality, with the mean absolute error decreasing from 161.60 to 100.54, peak signal-to-noise ratio increasing from 22.33 to 28.65, and structural similarity index map increasing from 0.617 to 0.865. Furthermore, by comparing the airway segmentation performances of CBCT and sCT using our proposed automatic rule-based algorithm, the Dice score improved from 0.849 to 0.960. Airway segmentation performance is closely associated with the accuracy of fluid dynamics simulations. Detailed airway segmentation is crucial for altering flow dynamics and contributes significantly to diagnostics. CONCLUSION Our deep learning methodology enhances the image quality of CBCT to provide anatomical information to medical professionals and enables precise and accurate biomechanical analysis. This allows clinicians to obtain precise quantitative metrics and facilitates accurate assessment.
Collapse
Affiliation(s)
- Susie Ryu
- Division of Obstructive Sleep Apnea Syndrome Diagnosis, School of Mechanical Engineering, College of Engineering, Yonsei University, Seoul, Republic of Korea
| | - Jun Hong Kim
- Division of Obstructive Sleep Apnea Syndrome Diagnosis, School of Mechanical Engineering, College of Engineering, Yonsei University, Seoul, Republic of Korea
| | - Yoon Jeong Choi
- Department of Orthodontics, The Institute of Craniofacial Deformity, Yonsei University College of Dentistry, Seoul, Republic of Korea; Department of Surgery, Division of Plastic and Reconstructive Surgery, Pediatric Craniofacial and Airway Orthodontics and Dental Sleep Medicine, Stanford University School of Medicine, Palo Alto, CA, USA
| | - Joon Sang Lee
- Division of Obstructive Sleep Apnea Syndrome Diagnosis, School of Mechanical Engineering, College of Engineering, Yonsei University, Seoul, Republic of Korea; The Center for Hemodynamic Precision Medical Platform, Seoul, Republic of Korea.
| |
Collapse
|
23
|
Zhang Y, Long K, Gong Z, Dai R, Zhang S. Postoperative fever following surgery for oral cancer: Incidence, risk factors, and the formulation of a machine learning-based predictive model. BMC Oral Health 2025; 25:165. [PMID: 39885528 PMCID: PMC11783824 DOI: 10.1186/s12903-025-05555-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 01/24/2025] [Indexed: 02/01/2025] Open
Abstract
BACKGROUND Postoperative fever (POF) is a common occurrence in patients undergoing major surgery, presenting challenges and burdens for both patients and surgeons yet. This study endeavors to examine the incidence, identify risk factors, and establish a machine learning-based predictive model for POF following surgery of oral cancer. METHODS A total of seven hundred and twenty-seven consecutive patients undergoing radical resection of oral cancer were retrospectively investigated. The analysis encompassed 34 parameters, incorporating demographic and clinical characteristics, biochemical and hematological assay results, surgical-related data, hospitalization costs and stay in hospital. Six machine learning models were compared by the area under the receiver operating characteristic curve (AUC). The best-performing models were selected for further analyze, including feature importance evaluation and nomogram analysis, identifying key POF risk factors, and establish a comprehensive prediction model. RESULTS A total of 466 patients with surgery for oral cancer met the criteria, with an average age of (54.2 ± 11.1) years, including an POF group (n = 197) and a non-POF group (n = 269). The fever group has greater hospitalization costs, longer lengths of stay, and higher infection biochemical indicators (leucocyte ratio and erythrocyte sedimentation rate). Furthermore, Among the 6 machine learning models, logistic regression models performed best, with the higher AUC and accuracy. In univariate and multivariate logistic analysis showed that age, sex, reoperation, Charlson Comorbidity Index score (CCI), leukocyte, bleeding and blood transfusion were independent risk factors for POF of patients following surgery in oral cancer. Then seven variables were selected to establish the nomogram for predict the probability of POF by nomogram algorithm. CONCLUSIONS Postoperative fever patients following radical resection of oral cancer have greater burden. Machine learning algorithms can be effectively used to identify potential risk factors of POF, which may enhance individualized treatment plans in oral cancer patient during perioperative period.
Collapse
Affiliation(s)
- Yanling Zhang
- Department of Anaesthesiology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China
| | - Kun Long
- Department of Anaesthesiology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China
| | - Zhaojian Gong
- Department of Oral and Maxillofacial Surgery, The Second Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Ruping Dai
- Department of Anaesthesiology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China
| | - Shuiting Zhang
- Department of Anaesthesiology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China.
| |
Collapse
|
24
|
Seringa J, Hirata A, Pedro AR, Santana R, Magalhães T. Health Care Professionals and Data Scientists' Perspectives on a Machine Learning System to Anticipate and Manage the Risk of Decompensation From Patients With Heart Failure: Qualitative Interview Study. J Med Internet Res 2025; 27:e54990. [PMID: 39832170 PMCID: PMC11791461 DOI: 10.2196/54990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 07/30/2024] [Accepted: 10/26/2024] [Indexed: 01/22/2025] Open
Abstract
BACKGROUND Heart failure (HF) is a significant global health problem, affecting approximately 64.34 million people worldwide. The worsening of HF, also known as HF decompensation, is a major factor behind hospitalizations, contributing to substantial health care costs related to this condition. OBJECTIVE This study aimed to explore the perspectives of health care professionals and data scientists regarding the relevance, challenges, and potential benefits of using machine learning (ML) models to predict decompensation from patients with HF. METHODS A total of 13 individual, semistructured, qualitative interviews were conducted in Portugal between October 31, 2022, and June 23, 2023. Participants represented different health care specialties and were selected from different contexts and regions of the country to ensure a comprehensive understanding of the topic. Data saturation was determined as the point at which no new themes emerged from participants' perspectives, ensuring a sufficient sample size for analysis. The interviews were audio recorded, transcribed, and analyzed using MAXQDA (VERBI Software GmbH) through a reflexive thematic analysis. Two researchers (JS and AH) coded the interviews to ensure the consistency of the codes. Ethical approval was granted by the NOVA National School of Public Health ethics committee (CEENSP 14/2022), and informed consent was obtained from all participants. RESULTS The participants recognized the potential benefits of ML models for early detection, risk stratification, and personalized care of patients with HF. The importance of selecting appropriate variables for model development, such as rapid weight gain and symptoms, was emphasized. The use of wearables for recording vital signs was considered necessary, although challenges related to adoption among older patients were identified. Risk stratification emerged as a crucial aspect, with the model needing to identify patients at high-, medium-, and low-risk levels. Participants emphasized the need for a response model involving health care professionals to validate ML-generated alerts and determine appropriate interventions. CONCLUSIONS The study's findings highlight ML models' potential benefits and challenges for predicting HF decompensation. The relevance of ML models for improving patient outcomes, reducing health care costs, and promoting patient engagement in disease management is highlighted. Adequate variable selection, risk stratification, and response models were identified as essential components for the effective implementation of ML models in health care. In addition, the study identified technical, regulatory and ethical, and adoption and acceptance challenges that need to be overcome for the successful integration of ML models into clinical workflows. Interpretation of the findings suggests that future research should focus on more extensive and diverse samples, incorporate the patient perspective, and explore the impact of ML models on patient outcomes and personalized care in HF management. Incorporation of this study's findings into practice is expected to contribute to developing and implementing ML-based predictive models that positively impact HF management.
Collapse
Affiliation(s)
- Joana Seringa
- NOVA National School of Public Health, Public Health Research Centre, Comprehensive Health Research Center, NOVA University Lisbon, Lisbon, Portugal
| | - Anna Hirata
- NOVA National School of Public Health, NOVA University Lisbon, Lisbon, Portugal
| | - Ana Rita Pedro
- NOVA National School of Public Health, Public Health Research Centre, Comprehensive Health Research Center, NOVA University Lisbon, Lisbon, Portugal
| | - Rui Santana
- NOVA National School of Public Health, Public Health Research Centre, Comprehensive Health Research Center, NOVA University Lisbon, Lisbon, Portugal
| | - Teresa Magalhães
- NOVA National School of Public Health, Public Health Research Centre, Comprehensive Health Research Center, NOVA University Lisbon, Lisbon, Portugal
| |
Collapse
|
25
|
Chen CH, Hsieh KY, Huang KE, Cheng ET. Using the Regression Slope of Training Loss to Optimize Chest X-ray Generation in Deep Convolutional Generative Adversarial Networks. Cureus 2025; 17:e77391. [PMID: 39811723 PMCID: PMC11730489 DOI: 10.7759/cureus.77391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/13/2025] [Indexed: 01/16/2025] Open
Abstract
Diffusion models, variational autoencoders, and generative adversarial networks (GANs) are three common types of generative artificial intelligence models for image generation. Among these, GANs are the most frequently used for medical image generation and are often employed for data augmentation in various studies. However, due to the adversarial nature of GANs, where the generator and discriminator compete against each other, the training process can sometimes end with the model unable to generate meaningful images or even producing noise. This phenomenon is rarely discussed in the literature, and no studies have proposed solutions to address this issue. Such outcomes can introduce significant bias when GANs are used for data augmentation in medical image training. Moreover, GANs often require substantial computational power and storage, adding to the challenges. In this study, we used deep convolutional GANs for chest X-ray generation, and three typical training outcomes were found. Two scenarios generated meaningful medical images and one failed to produce usable images. By analyzing the loss history during training, we observed that the regression line of the overall losses tends to diverge slowly. After excluding outlier losses, we found that the slope of the regression line within the stable loss segment indicates the optimal point to terminate training, ensuring the generation of meaningful medical images.
Collapse
Affiliation(s)
- Chih-Hsiung Chen
- Department of Critical Care Medicine, Mennonite Christian Hospital, Hualien, TWN
| | - Kuang-Yu Hsieh
- Department of Critical Care Medicine, Mennonite Christian Hospital, Hualien, TWN
| | - Kuo-En Huang
- Department of Critical Care Medicine, Mennonite Christian Hospital, Hualien, TWN
| | - En-Tsung Cheng
- Department of Critical Care Medicine, Jen Ho Hospital, Show Chwan Health Care System, Changhua, TWN
| |
Collapse
|
26
|
Zhu L, Chen Y, Liu L, Xing L, Yu L. Multi-Sensor Learning Enables Information Transfer Across Different Sensory Data and Augments Multi-Modality Imaging. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2025; 47:288-304. [PMID: 39302777 PMCID: PMC11875987 DOI: 10.1109/tpami.2024.3465649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Multi-modality imaging is widely used in clinical practice and biomedical research to gain a comprehensive understanding of an imaging subject. Currently, multi-modality imaging is accomplished by post hoc fusion of independently reconstructed images under the guidance of mutual information or spatially registered hardware, which limits the accuracy and utility of multi-modality imaging. Here, we investigate a data-driven multi-modality imaging (DMI) strategy for synergetic imaging of CT and MRI. We reveal two distinct types of features in multi-modality imaging, namely intra- and inter-modality features, and present a multi-sensor learning (MSL) framework to utilize the crossover inter-modality features for augmented multi-modality imaging. The MSL imaging approach breaks down the boundaries of traditional imaging modalities and allows for optimal hybridization of CT and MRI, which maximizes the use of sensory data. We showcase the effectiveness of our DMI strategy through synergetic CT-MRI brain imaging. The principle of DMI is quite general and holds enormous potential for various DMI applications across disciplines.
Collapse
|
27
|
Ding S, Ye J, Hu X, Zou N. Distilling the knowledge from large-language model for health event prediction. Sci Rep 2024; 14:30675. [PMID: 39730390 DOI: 10.1038/s41598-024-75331-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 10/04/2024] [Indexed: 12/29/2024] Open
Abstract
Health event prediction is empowered by the rapid and wide application of electronic health records (EHR). In the Intensive Care Unit (ICU), precisely predicting the health related events in advance is essential for providing treatment and intervention to improve the patients outcomes. EHR is a kind of multi-modal data containing clinical text, time series, structured data, etc. Most health event prediction works focus on a single modality, e.g., text or tabular EHR. How to effectively learn from the multi-modal EHR for health event prediction remains a challenge. Inspired by the strong capability in text processing of large language model (LLM), we propose the framework CKLE for health event prediction by distilling the knowledge from LLM and learning from multi-modal EHR. There are two challenges of applying LLM in the health event prediction, the first one is most LLM can only handle text data rather than other modalities, e.g., structured data. The second challenge is the privacy issue of health applications requires the LLM to be locally deployed, which may be limited by the computational resource. CKLE solves the challenges of LLM scalability and portability in the healthcare domain by distilling the cross-modality knowledge from LLM into the health event predictive model. To fully take advantage of the strong power of LLM, the raw clinical text is refined and augmented with prompt learning. The embedding of clinical text are generated by LLM. To effectively distill the knowledge of LLM into the predictive model, we design a cross-modality knowledge distillation (KD) method. A specially designed training objective will be used for the KD process with the consideration of multiple modality and patient similarity. The KD loss function consists of two parts. The first one is cross-modality contrastive loss function, which models the correlation of different modalities from the same patient. The second one is patient similarity learning loss function to model the correlations between similar patients. The cross-modality knowledge distillation can distill the rich information in clinical text and the knowledge of LLM into the predictive model on structured EHR data. To demonstrate the effectiveness of CKLE, we evaluate CKLE on two health event prediction tasks in the field of cardiology, heart failure prediction and hypertension prediction. We select the 7125 patients from MIMIC-III dataset and split them into train/validation/test sets. We can achieve a maximum 4.48% improvement in accuracy compared to state-of-the-art predictive model designed for health event prediction. The results demonstrate CKLE can surpass the baseline prediction models significantly on both normal and limited label settings. We also conduct the case study on cardiology disease analysis in the heart failure and hypertension prediction. Through the feature importance calculation, we analyse the salient features related to the cardiology disease which corresponds to the medical domain knowledge. The superior performance and interpretability of CKLE pave a promising way to leverage the power and knowledge of LLM in the health event prediction in real-world clinical settings.
Collapse
Affiliation(s)
- Sirui Ding
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX, USA
| | | | - Xia Hu
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Na Zou
- Department of Industrial Engineering, University of Houston, Houston, TX, USA.
| |
Collapse
|
28
|
Gathright R, Mejia I, Gonzalez JM, Hernandez Torres SI, Berard D, Snider EJ. Overview of Wearable Healthcare Devices for Clinical Decision Support in the Prehospital Setting. SENSORS (BASEL, SWITZERLAND) 2024; 24:8204. [PMID: 39771939 PMCID: PMC11679471 DOI: 10.3390/s24248204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Revised: 12/16/2024] [Accepted: 12/17/2024] [Indexed: 01/11/2025]
Abstract
Prehospital medical care is a major challenge for both civilian and military situations as resources are limited, yet critical triage and treatment decisions must be rapidly made. Prehospital medicine is further complicated during mass casualty situations or remote applications that require more extensive medical treatments to be monitored. It is anticipated on the future battlefield where air superiority will be contested that prolonged field care will extend to as much 72 h in a prehospital environment. Traditional medical monitoring is not practical in these situations and, as such, wearable sensor technology may help support prehospital medicine. However, sensors alone are not sufficient in the prehospital setting where limited personnel without specialized medical training must make critical decisions based on physiological signals. Machine learning-based clinical decision support systems can instead be utilized to interpret these signals for diagnosing injuries, making triage decisions, or driving treatments. Here, we summarize the challenges of the prehospital medical setting and review wearable sensor technology suitability for this environment, including their use with medical decision support triage or treatment guidance options. Further, we discuss recommendations for wearable healthcare device development and medical decision support technology to better support the prehospital medical setting. With further design improvement and integration with decision support tools, wearable healthcare devices have the potential to simplify and improve medical care in the challenging prehospital environment.
Collapse
Affiliation(s)
| | | | | | | | | | - Eric J. Snider
- Organ Support and Automation Technologies Group, U.S. Army Institute of Surgical Research, JBSA Fort Sam Houston, San Antonio, TX 78234, USA
| |
Collapse
|
29
|
Li Z, Wu W, Kang H. Machine Learning-Driven Metabolic Syndrome Prediction: An International Cohort Validation Study. Healthcare (Basel) 2024; 12:2527. [PMID: 39765954 PMCID: PMC11675332 DOI: 10.3390/healthcare12242527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2024] [Revised: 12/11/2024] [Accepted: 12/12/2024] [Indexed: 01/11/2025] Open
Abstract
Background/Objectives: This study aimed to develop and validate a machine learning (ML)-based metabolic syndrome (MetS) risk prediction model. Methods: We examined data from 6155 participants of the China Health and Retirement Longitudinal Study (CHARLS) in 2011. The LASSO regression feature selection identified the best MetS predictors. Nine ML-based algorithms were adopted to build predictive models. The model performance was validated using cohort data from the Korea National Health and Nutrition Examination Survey (KNHANES) (n = 5297), the United Kingdom (UK) Biobank (n = 218,781), and the National Health and Nutrition Examination Survey (NHANES) (n = 2549). Results: The multilayer perceptron (MLP)-based model performed best in the CHARLS cohort (AUC = 0.8908; PRAUC = 0.8073), the logistic model in the KNHANES cohort (AUC = 0.9101, PRAUC = 0.8116), the xgboost model in the UK Biobank cohort (AUC = 0.8556, PRAUC = 0.6246), and the MLP model in the NHANES cohort (AUC = 0.9055, PRAUC = 0.8264). Conclusions: Our MLP-based model has the potential to serve as a clinical application for detecting MetS in different populations.
Collapse
Affiliation(s)
| | | | - Hyunsik Kang
- College of Sport Science, Sungkyunkwan University, Suwon 16419, Republic of Korea; (Z.L.); (W.W.)
| |
Collapse
|
30
|
Huang K, Chandak P, Wang Q, Havaldar S, Vaid A, Leskovec J, Nadkarni GN, Glicksberg BS, Gehlenborg N, Zitnik M. A foundation model for clinician-centered drug repurposing. Nat Med 2024; 30:3601-3613. [PMID: 39322717 PMCID: PMC11645266 DOI: 10.1038/s41591-024-03233-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 08/05/2024] [Indexed: 09/27/2024]
Abstract
Drug repurposing-identifying new therapeutic uses for approved drugs-is often a serendipitous and opportunistic endeavour to expand the use of drugs for new diseases. The clinical utility of drug-repurposing artificial intelligence (AI) models remains limited because these models focus narrowly on diseases for which some drugs already exist. Here we introduce TxGNN, a graph foundation model for zero-shot drug repurposing, identifying therapeutic candidates even for diseases with limited treatment options or no existing drugs. Trained on a medical knowledge graph, TxGNN uses a graph neural network and metric learning module to rank drugs as potential indications and contraindications for 17,080 diseases. When benchmarked against 8 methods, TxGNN improves prediction accuracy for indications by 49.2% and contraindications by 35.1% under stringent zero-shot evaluation. To facilitate model interpretation, TxGNN's Explainer module offers transparent insights into multi-hop medical knowledge paths that form TxGNN's predictive rationales. Human evaluation of TxGNN's Explainer showed that TxGNN's predictions and explanations perform encouragingly on multiple axes of performance beyond accuracy. Many of TxGNN's new predictions align well with off-label prescriptions that clinicians previously made in a large healthcare system. TxGNN's drug-repurposing predictions are accurate, consistent with off-label drug use, and can be investigated by human experts through multi-hop interpretable rationales.
Collapse
Affiliation(s)
- Kexin Huang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Payal Chandak
- Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA
| | - Qianwen Wang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Shreyas Havaldar
- Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
| | - Akhil Vaid
- Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
| | - Jure Leskovec
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Girish N Nadkarni
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
| | - Benjamin S Glicksberg
- Hasso Plattner Institute for Digital Health, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, Mount Sinai, NY, USA
| | - Nils Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Harvard Data Science Initiative, Cambridge, MA, USA.
- Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
31
|
Lam JY, Lu X, Shashikumar SP, Lee YS, Miller M, Pour H, Boussina AE, Pearce AK, Malhotra A, Nemati S. Development, deployment, and continuous monitoring of a machine learning model to predict respiratory failure in critically ill patients. JAMIA Open 2024; 7:ooae141. [PMID: 39664647 PMCID: PMC11633942 DOI: 10.1093/jamiaopen/ooae141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Revised: 11/18/2024] [Accepted: 11/25/2024] [Indexed: 12/13/2024] Open
Abstract
Objectives This study describes the development and deployment of a machine learning (ML) model called Vent.io to predict mechanical ventilation (MV). Materials and Methods We trained Vent.io using electronic health record data of adult patients admitted to the intensive care units (ICUs) of the University of California San Diego (UCSD) Health System. We prospectively deployed Vent.io using a real-time platform at UCSD and evaluated the performance of Vent.io for a 1-month period in silent mode and on the MIMIC-IV dataset. As part of deployment, we included a Predetermined Changed Control Plan (PCCP) for continuous model monitoring that triggers model fine-tuning if performance drops below a specified area under the receiver operating curve (AUC) threshold of 0.85. Results The Vent.io model had a median AUC of 0.897 (IQR: 0.892-0.904) with specificity of 0.81 (IQR: 0.812-0.841) and positive predictive value (PPV) of 0.174 (IQR: 0.148-0.176) at a fixed sensitivity of 0.6 during 10-fold cross validation and an AUC of 0.908, sensitivity of 0.632, specificity of 0.849, and PPV of 0.235 during prospective deployment. Vent.io had an AUC of 0.73 on the MIMIC-IV dataset, triggering model fine-tuning per the PCCP as the AUC was below the minimum of 0.85. The fine-tuned Vent.io model achieved an AUC of 0.873. Discussion Deterioration of model performance is a significant challenge when deploying ML models prospectively or at different sites. Implementation of a PCCP can help models adapt to new patterns in data and maintain generalizability. Conclusion Vent.io is a generalizable ML model that has the potential to improve patient care and resource allocation for ICU patients with need for MV.
Collapse
Affiliation(s)
- Jonathan Y Lam
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Xiaolei Lu
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Supreeth P Shashikumar
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Ye Sel Lee
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Michael Miller
- Division of Pulmonary, Critical Care, and Sleep Medicine, University of California San Diego, La Jolla, CA 92093, United States
| | - Hayden Pour
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Aaron E Boussina
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| | - Alex K Pearce
- Division of Pulmonary, Critical Care, and Sleep Medicine, University of California San Diego, La Jolla, CA 92093, United States
| | - Atul Malhotra
- Division of Pulmonary, Critical Care, and Sleep Medicine, University of California San Diego, La Jolla, CA 92093, United States
| | - Shamim Nemati
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, United States
| |
Collapse
|
32
|
Niarakis A, Laubenbacher R, An G, Ilan Y, Fisher J, Flobak Å, Reiche K, Rodríguez Martínez M, Geris L, Ladeira L, Veschini L, Blinov ML, Messina F, Fonseca LL, Ferreira S, Montagud A, Noël V, Marku M, Tsirvouli E, Torres MM, Harris LA, Sego TJ, Cockrell C, Shick AE, Balci H, Salazar A, Rian K, Hemedan AA, Esteban-Medina M, Staumont B, Hernandez-Vargas E, Martis B S, Madrid-Valiente A, Karampelesis P, Sordo Vieira L, Harlapur P, Kulesza A, Nikaein N, Garira W, Malik Sheriff RS, Thakar J, Tran VDT, Carbonell-Caballero J, Safaei S, Valencia A, Zinovyev A, Glazier JA. Immune digital twins for complex human pathologies: applications, limitations, and challenges. NPJ Syst Biol Appl 2024; 10:141. [PMID: 39616158 PMCID: PMC11608242 DOI: 10.1038/s41540-024-00450-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 09/27/2024] [Indexed: 12/06/2024] Open
Abstract
Digital twins represent a key technology for precision health. Medical digital twins consist of computational models that represent the health state of individual patients over time, enabling optimal therapeutics and forecasting patient prognosis. Many health conditions involve the immune system, so it is crucial to include its key features when designing medical digital twins. The immune response is complex and varies across diseases and patients, and its modelling requires the collective expertise of the clinical, immunology, and computational modelling communities. This review outlines the initial progress on immune digital twins and the various initiatives to facilitate communication between interdisciplinary communities. We also outline the crucial aspects of an immune digital twin design and the prerequisites for its implementation in the clinic. We propose some initial use cases that could serve as "proof of concept" regarding the utility of immune digital technology, focusing on diseases with a very different immune response across spatial and temporal scales (minutes, days, months, years). Lastly, we discuss the use of digital twins in drug discovery and point out emerging challenges that the scientific community needs to collectively overcome to make immune digital twins a reality.
Collapse
Affiliation(s)
- Anna Niarakis
- Molecular, Cellular and Developmental Biology Unit (MCD), Centre de Biologie Integrative (CBI), University of Toulouse, UPS, CNRS, Toulouse, France.
- Lifeware Group, Inria, Saclay-île de France, Palaiseau, France.
| | | | - Gary An
- Department of Surgery, University of Vermont Larner College of Medicine, Vermont, USA
| | - Yaron Ilan
- Faculty of Medicine Hebrew University, Hadassah Medical Center, Jerusalem, Israel
| | - Jasmin Fisher
- UCL Cancer Institute, University College London, Paul O'Gorman Building, 72 Huntley Street, London, WC1E 6BT, UK
| | - Åsmund Flobak
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
- The Cancer Clinic, St Olav's University Hospital, Trondheim, Norway
- Department of Biotechnology and Nanomedicine, SINTEF Industry, Trondheim, Norway
| | - Kristin Reiche
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Leipzig, Germany
- Institute of Clinical Immunology, Medical Faculty, University Hospital, University of Leipzig, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden/Leipzig, Germany
| | - María Rodríguez Martínez
- Department of Biomedical Informatics & Data Science, Yale School of Medicine, New Haven, CT, USA
| | - Liesbet Geris
- Prometheus Division of Skeletal Tissue Engineering, KU Leuven, Leuven, Belgium
- Skeletal Biology and Engineering Research Center, Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Biomechanics Research Unit, GIGA Molecular and Computational Biology, University of Liège, Liège, Belgium
| | - Luiz Ladeira
- Biomechanics Research Unit, GIGA Molecular and Computational Biology, University of Liège, Liège, Belgium
| | - Lorenzo Veschini
- Faculty of Dentistry Oral & Craniofacial Sciences, King's College London, London, UK
- Biocomplexity Institute and Department of Intelligent Systems Engineering, Indiana University, Bloomington, Indiana, 47408, USA
| | - Michael L Blinov
- Center for Cell Analysis and Modeling, UConn Health, Farmington, CT, 06030, USA
| | - Francesco Messina
- Department of Epidemiology, Preclinical Research and Advanced Diagnostic, National Institute for Infectious Diseases 'Lazzaro Spallanzani' - I.R.C.C.S., Rome, Italy
| | - Luis L Fonseca
- Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Sandra Ferreira
- Mathematics Department and Center of Mathematics, University of Beira Interior, Covilhã, Portugal
| | - Arnau Montagud
- Barcelona Supercomputing Center (BSC), Barcelone, Spain
- Institute for Integrative Systems Biology (I2SysBio), CSIC-UV, Valencia, Spain
| | - Vincent Noël
- Institut Curie, Université PSL, F-75005, Paris, France
- INSERM, U900, F-75005, Paris, France
- Mines ParisTech, Université PSL, F-75005, Paris, France
| | - Malvina Marku
- Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| | - Eirini Tsirvouli
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
- Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Marcella M Torres
- Department of Mathematics and Statistics, University of Richmond, Richmond, VA, USA
| | - Leonard A Harris
- Department of Biomedical Engineering, University of Arkansas, Fayetteville, AR, USA
- Interdisciplinary Graduate Program in Cell and Molecular Biology, University of Arkansas, Fayetteville, AR, USA
- Cancer Biology Program, Winthrop P. Rockefeller Cancer Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | - T J Sego
- Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Chase Cockrell
- Department of Surgery, University of Vermont Larner College of Medicine, Vermont, USA
| | - Amanda E Shick
- Department of Mechanical and Aerospace Engineering, University of Florida, Gainesville, FL, USA
| | - Hasan Balci
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, The Netherlands
| | - Albin Salazar
- INRIA Paris/CNRS/École Normale Supérieure/PSL Research University, Paris, France
| | - Kinza Rian
- Andalusian Platform for Computational Medicine, Andalusian Public Foundation Progress and Health-FPS, Seville, Spain
| | - Ahmed Abdelmonem Hemedan
- Bioinformatics Core Unit, Luxembourg Centre of Systems Biomedicine LCSB, Luxembourg University, Esch-sur-Alzette, Luxembourg
| | - Marina Esteban-Medina
- Andalusian Platform for Computational Medicine, Andalusian Public Foundation Progress and Health-FPS, Seville, Spain
| | - Bernard Staumont
- Biomechanics Research Unit, GIGA Molecular and Computational Biology, University of Liège, Liège, Belgium
| | - Esteban Hernandez-Vargas
- Department of Mathematics and Statistical Science, University of Idaho, Moscow, ID, 83844-1103, USA
| | | | | | | | | | - Pradyumna Harlapur
- Department of Bioengineering, Indian Institute of Science, Bengaluru, India
| | | | - Niloofar Nikaein
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, SE-70182, Örebro, Sweden
- X-HiDE - Exploring Inflammation in Health and Disease Consortium, Örebro University, Örebro, Sweden
| | - Winston Garira
- Multiscale Mathematical Modelling of Living Systems program (M3-LSP), Kimberley, South Africa
- Department of Mathematical Sciences, Sol Plaatje University, Kimberley, South Africa
- Private Bag X5008, Kimberley, 8300, South Africa
| | - Rahuman S Malik Sheriff
- European Bioinformatics Institute, European Molecular Biology Laboratory (EMBL-EBI), Hinxton, Cambridge, UK
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London, UK
| | - Juilee Thakar
- Department of Microbiology & Immunology and Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Van Du T Tran
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Soroush Safaei
- Institute of Biomedical Engineering and Technology, Ghent University, Gent, Belgium
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC), Barcelone, Spain
- ICREA, 23 Passeig Lluís Companys, 08010, Barcelona, Spain
| | | | - James A Glazier
- Biocomplexity Institute and Department of Intelligent Systems Engineering, Indiana University, Bloomington, Indiana, 47408, USA
| |
Collapse
|
33
|
Lian S, Luo Z. Cutting-Edge Machine Learning in Biomedical Image Analysis: Editorial for Bioengineering Special Issue: "Recent Advance of Machine Learning in Biomedical Image Analysis". Bioengineering (Basel) 2024; 11:1106. [PMID: 39593766 PMCID: PMC11591837 DOI: 10.3390/bioengineering11111106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Accepted: 10/22/2024] [Indexed: 11/28/2024] Open
Abstract
Biomedical image analysis plays a critical role in the healthcare system [...].
Collapse
Affiliation(s)
- Sheng Lian
- College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
| | - Zhiming Luo
- Department of Artificial Intelligence, Xiamen University, Xiamen 361005, China;
| |
Collapse
|
34
|
Watson M, Chambers P, Steventon L, Harmsworth King J, Ercia A, Shaw H, Al Moubayed N. From prediction to practice: mitigating bias and data shift in machine-learning models for chemotherapy-induced organ dysfunction across unseen cancers. BMJ ONCOLOGY 2024; 3:e000430. [PMID: 39886186 PMCID: PMC11557724 DOI: 10.1136/bmjonc-2024-000430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 10/07/2024] [Indexed: 02/01/2025]
Abstract
Objectives Routine monitoring of renal and hepatic function during chemotherapy ensures that treatment-related organ damage has not occurred and clearance of subsequent treatment is not hindered; however, frequency and timing are not optimal. Model bias and data heterogeneity concerns have hampered the ability of machine learning (ML) to be deployed into clinical practice. This study aims to develop models that could support individualised decisions on the timing of renal and hepatic monitoring while exploring the effect of data shift on model performance. Methods and analysis We used retrospective data from three UK hospitals to develop and validate ML models predicting unacceptable rises in creatinine/bilirubin post cycle 3 for patients undergoing treatment for the following cancers: breast, colorectal, lung, ovarian and diffuse large B-cell lymphoma. Results We extracted 3614 patients with no missing blood test data across cycles 1-6 of chemotherapy treatment. We improved on previous work by including predictions post cycle 3. Optimised for sensitivity, we achieve F2 scores of 0.7773 (bilirubin) and 0.6893 (creatinine) on unseen data. Performance is consistent on tumour types unseen during training (F2 bilirubin: 0.7423, F2 creatinine: 0.6820). Conclusion Our technique highlights the effectiveness of ML in clinical settings, demonstrating the potential to improve the delivery of care. Notably, our ML models can generalise to unseen tumour types. We propose gold-standard bias mitigation steps for ML models: evaluation on multisite data, thorough patient population analysis, and both formalised bias measures and model performance comparisons on patient subgroups. We demonstrate that data aggregation techniques have unintended consequences on model bias.
Collapse
Affiliation(s)
- Matthew Watson
- Department of Computer Science, Durham University, Durham, UK
- Cancer Division, University College London Hospitals NHS Foundation Trust, London, UK
| | - Pinkie Chambers
- Cancer Division, University College London Hospitals NHS Foundation Trust, London, UK
- School of Pharmacy, University College London, London, UK
| | - Luke Steventon
- Cancer Division, University College London Hospitals NHS Foundation Trust, London, UK
- School of Pharmacy, University College London, London, UK
| | | | | | - Heather Shaw
- Cancer Division, University College London Hospitals NHS Foundation Trust, London, UK
- Mount Vernon Cancer Centre, Northwood, UK
| | - Noura Al Moubayed
- Department of Computer Science, Durham University, Durham, UK
- Evergreen Life Ltd, Manchester, UK
| |
Collapse
|
35
|
Araujo ARC, Okey OD, Saadi M, Adasme P, Rosa RL, Rodríguez DZ. Quantum-assisted federated intelligent diagnosis algorithm with variational training supported by 5G networks. Sci Rep 2024; 14:26333. [PMID: 39487124 PMCID: PMC11530559 DOI: 10.1038/s41598-024-71826-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Accepted: 08/28/2024] [Indexed: 11/04/2024] Open
Abstract
In the realm of intelligent healthcare, there is a growing ambition to reshape medical services through the integration of artificial intelligence (AI). However, conventional machine learning faces inherent challenges such as privacy issues, delayed updates, and protracted training times, particularly due to the hesitance of medical institutions to directly share sensitive data, with possible noises. In response to these concerns, a Quantum-Assisted Federated Intelligent Diagnosis Algorithm ( β -QuAFIDA) is proposed, applied into real medical data. Leveraging the capabilities of the 5G mobile network, this approach works the connection between Internet of Medical Things (IoMT) devices through the 5G, synchronizing training and updating the server model without disrupting their real-world applications. In our quest to safeguard patient data and enhance training efficiency, our study employs an innovative heuristic approach marked by a nested loop structure. Specifically, the inner loop is dedicated to training the beta-variational quantum eigensolver ( β -VQE) to approximate the expectation values of the proposed algorithm; the outer loop trains the β -QuAFIDA to reduce the relative entropy towards the target. This approach involves a balance between privacy considerations and the urgency of training. Results demonstrate that representations with low-rank attained through β -QuAFIDA offer an effective approach for acquiring low-rank states. This research signifies a step forward in the synergy between AI and 5G technologies, presenting a novel avenue for the advancement of intelligent healthcare.
Collapse
Affiliation(s)
| | - Ogobuchi Daniel Okey
- Center for Engineering, Modelling and Applied Sciences, CECS, Federal University of ABC, São Paulo, Brazil
| | - Muhammad Saadi
- School of Science and Technology, Department of Computer Science, Nottingham Trent University, Nottingham, England
| | - Pablo Adasme
- Department of Electrical Engineering, Faculty of Engineering, University of Santiago, Santiago, Chile
| | - Renata Lopes Rosa
- Department of Computer Science, Federal University of Lavras, Lavras, MG, Brazil
| | - Demóstenes Zegarra Rodríguez
- Department of Computer Science, Federal University of Lavras, Lavras, MG, Brazil.
- Center for Engineering, Modelling and Applied Sciences, CECS, Federal University of ABC, São Paulo, Brazil.
| |
Collapse
|
36
|
Nielsen C, Souza R, Wilms M, Forkert ND. Foundation model-driven distributed learning for enhanced retinal age prediction. J Am Med Inform Assoc 2024; 31:2550-2559. [PMID: 39225790 PMCID: PMC11491655 DOI: 10.1093/jamia/ocae220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 07/24/2024] [Accepted: 08/21/2024] [Indexed: 09/04/2024] Open
Abstract
OBJECTIVES The retinal age gap (RAG) is emerging as a potential biomarker for various diseases of the human body, yet its utility depends on machine learning models capable of accurately predicting biological retinal age from fundus images. However, training generalizable models is hindered by potential shortages of diverse training data. To overcome these obstacles, this work develops a novel and computationally efficient distributed learning framework for retinal age prediction. MATERIALS AND METHODS The proposed framework employs a memory-efficient 8-bit quantized version of RETFound, a cutting-edge foundation model for retinal image analysis, to extract features from fundus images. These features are then used to train an efficient linear regression head model for predicting retinal age. The framework explores federated learning (FL) as well as traveling model (TM) approaches for distributed training of the linear regression head. To evaluate this framework, we simulate a client network using fundus image data from the UK Biobank. Additionally, data from patients with type 1 diabetes from the UK Biobank and the Brazilian Multilabel Ophthalmological Dataset (BRSET) were utilized to explore the clinical utility of the developed methods. RESULTS Our findings reveal that the developed distributed learning framework achieves retinal age prediction performance on par with centralized methods, with FL and TM providing similar performance (mean absolute error of 3.57 ± 0.18 years for centralized learning, 3.60 ± 0.16 years for TM, and 3.63 ± 0.19 years for FL). Notably, the TM was found to converge with fewer local updates than FL. Moreover, patients with type 1 diabetes exhibited significantly higher RAG values than healthy controls in all models, for both the UK Biobank and BRSET datasets (P < .001). DISCUSSION The high computational and memory efficiency of the developed distributed learning framework makes it well suited for resource-constrained environments. CONCLUSION The capacity of this framework to integrate data from underrepresented populations for training of retinal age prediction models could significantly enhance the accessibility of the RAG as an important disease biomarker.
Collapse
Affiliation(s)
- Christopher Nielsen
- Department of Radiology, University of Calgary, Calgary, AB T2N 4N1, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Raissa Souza
- Department of Radiology, University of Calgary, Calgary, AB T2N 4N1, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB T2N 4N1, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Matthias Wilms
- Department of Radiology, University of Calgary, Calgary, AB T2N 4N1, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB T2N 4N1, Canada
- Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada
- Department of Pediatrics, University of Calgary, Calgary, AB T2N 4N1, Canada
- Department of Community Health Sciences, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Nils D Forkert
- Department of Radiology, University of Calgary, Calgary, AB T2N 4N1, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB T2N 4N1, Canada
- Alberta Children’s Hospital Research Institute, University of Calgary, Calgary, AB T2N 4N1, Canada
- Department of Clinical Neurosciences, University of Calgary, Calgary, AB T2N 4N1, Canada
| |
Collapse
|
37
|
Gonzalez JM, Ortiz R, Holland L, Ruiz A, Ross E, Snider EJ. Machine Learning Models for Tracking Blood Loss and Resuscitation in a Hemorrhagic Shock Swine Injury Model. Bioengineering (Basel) 2024; 11:1075. [PMID: 39593735 PMCID: PMC11591271 DOI: 10.3390/bioengineering11111075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Revised: 10/21/2024] [Accepted: 10/23/2024] [Indexed: 11/28/2024] Open
Abstract
Hemorrhage leading to life-threatening shock is a common and critical problem in both civilian and military medicine. Due to complex physiological compensatory mechanisms, traditional vital signs may fail to detect patients' impending hemorrhagic shock in a timely manner when life-saving interventions are still viable. To address this shortcoming of traditional vital signs in detecting hemorrhagic shock, we have attempted to identify metrics that can predict blood loss. We have previously combined feature extraction and machine learning methodologies applied to arterial waveform analysis to develop advanced metrics that have enabled the early and accurate detection of impending shock in a canine model of hemorrhage, including metrics that estimate blood loss such as the Blood Loss Volume Metric, the Percent Estimated Blood Loss metric, and the Hemorrhage Area metric. Importantly, these metrics were able to identify impending shock well before traditional vital signs, such as blood pressure, were altered enough to identify shock. Here, we apply these advanced metrics developed using data from a canine model to data collected from a swine model of controlled hemorrhage as an interim step towards showing their relevance to human medicine. Based on the performance of these advanced metrics, we conclude that the framework for developing these metrics in the previous canine model remains applicable when applied to a swine model and results in accurate performance in these advanced metrics. The success of these advanced metrics in swine, which share physiological similarities to humans, shows promise in developing advanced blood loss metrics for humans, which would result in increased positive casualty outcomes due to hemorrhage in civilian and military medicine.
Collapse
Affiliation(s)
| | | | | | | | | | - Eric J. Snider
- Organ Support and Automation Technologies Group, U.S. Army Institute of Surgical Research, Joint Base San Antonio, Fort Sam Houston, San Antonio, TX 78234, USA; (J.M.G.)
| |
Collapse
|
38
|
Liu J, Pan L, Wang S, Li Y, Wu Y, Luan J, Yang K. Predicting laboratory aspirin resistance in Chinese stroke patients using machine learning models by GP1BA polymorphism. Pharmacogenomics 2024; 25:539-550. [PMID: 39440554 DOI: 10.1080/14622416.2024.2411939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Accepted: 09/30/2024] [Indexed: 10/25/2024] Open
Abstract
This study aims to use machine learning model to predict laboratory aspirin resistance (AR) in Chinese stroke patients by incorporating patient characteristics and single nucleotide polymorphisms of GP1BA and LTC4S. 2405 patients were analyzed to measure the Mutation frequency of GP1BA rs6065 and LTC4S rs730012. 112 patients with first-stroke arteriostenosis were prospectively enrolled to establish machine learning model. GP1BA rs6065 mutation frequency is 5.26% and LTC4S rs730012 is 14.78%. GP1BA rs6065 CT patients have more sensitivity to aspirin than CC genotype. Simple linear regression identified significant associations with age, smoking, HDL and GP1BA rs6065. Random forest (RF) and extreme gradient boosting (XGBoost) demonstrated predictive capabilities for AR. Findings suggest pre-identifying GP1BA rs6065 could optimize aspirin treatment, enabling personalized care and future research avenues.
Collapse
Affiliation(s)
- Jun Liu
- Department of Neurology, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Linkun Pan
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Sheng Wang
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Yueran Li
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Yilai Wu
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Jiajie Luan
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| | - Kui Yang
- Department of Pharmacy, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, P.R. China
| |
Collapse
|
39
|
Steeg S, Bickley H, Clements C, Quinlivan LM, Barlow S, Monaghan E, Naylor F, Smith J, Mughal F, Robinson C, Gnani S, Kapur N. Care gaps among people presenting to the hospital following self-harm: observational study of three emergency departments in England. BMJ Open 2024; 14:e085672. [PMID: 39438110 PMCID: PMC11499793 DOI: 10.1136/bmjopen-2024-085672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 08/30/2024] [Indexed: 10/25/2024] Open
Abstract
OBJECTIVES This study aims to examine the proportions of patients referred to mental health, social and voluntary, community and social enterprise (VCSE) services and general practice and to assess care gaps among people presenting to the hospital following self-harm. DESIGN Population-based observational study. Data were extracted from hospital records. SETTING Three emergency departments (EDs) in Manchester, UK. PARTICIPANTS 26 090 patients aged 15+ years who presented to participating EDs following self-harm and who received a psychosocial assessment by a mental health specialist. PRIMARY AND SECONDARY OUTCOME MEASURES Primary outcome measures are as follows: care gaps, estimated from the proportion of patients with evidence of social and mental health needs with no new or active referral to mental health, social and VCSE services. Secondary outcome measures are as follows: proportions of referrals by groups of patients, estimated mental health and social needs of patients. Indicators of mental health and social need were developed with academic clinicians (psychiatrist, general practitioner and social worker) and expert lived experience contributors. RESULTS 96.2% (25 893/26 909) of individuals were estimated as having mental health needs. Among this group, 29.9% (6503/21 719) had no new or active referral to mental health services (indicating a care gap). Mental healthcare gaps were greater in men and those who were aged under 35 years, from a black, South Asian or Chinese ethnic group, living in the most deprived areas and had no mental health diagnosis, or alcohol, substance misuse, anxiety or trauma-related disorder. 52.8% (14 219/26 909) had social needs, with care gaps greater for men, individuals aged 45-64 and those who were unemployed or had a diagnosed mental disorder. CONCLUSIONS Care gaps were higher among hospital-presenting groups known to have increased risks of suicide: men, those in middle age, unemployed individuals and those misusing substances. Improved access to mental health, social and VCSE services and general practice care is vital to reduce inequities in access to self-harm aftercare.
Collapse
Affiliation(s)
- Sarah Steeg
- NIHR School for Primary Care Research, University of Manchester, Manchester, UK
- Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Centre for Mental Health and Safety, Division of Psychology and Mental Health, School of Health Sciences, University of Manchester, Manchester, UK
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Harriet Bickley
- Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Centre for Mental Health and Safety, Division of Psychology and Mental Health, School of Health Sciences, University of Manchester, Manchester, UK
| | - Caroline Clements
- Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Centre for Mental Health and Safety, Division of Psychology and Mental Health, School of Health Sciences, University of Manchester, Manchester, UK
| | - Leah M Quinlivan
- Centre for Mental Health and Safety, Division of Psychology and Mental Health, School of Health Sciences, University of Manchester, Manchester, UK
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Steven Barlow
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Elizabeth Monaghan
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Fiona Naylor
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Jonathan Smith
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
| | - Faraz Mughal
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
- School of Medicine, Keele University, Keele, UK
| | - Catherine Robinson
- Social Care and Society, School of Health Sciences, University of Manchester, Manchester, UK
| | - Shamini Gnani
- Department of Primary Care and Public Health, Imperial College London, London, UK
| | - Navneet Kapur
- Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Centre for Mental Health and Safety, Division of Psychology and Mental Health, School of Health Sciences, University of Manchester, Manchester, UK
- NIHR Greater Manchester Patient Safety Research Collaboration, School of Health Sciences, University of Manchester, Manchester, UK
- Mersey Care NHS Foundation Trust, Liverpool, UK
| |
Collapse
|
40
|
Kefeli J, Berkowitz J, Acitores Cortina JM, Tsang KK, Tatonetti NP. Generalizable and automated classification of TNM stage from pathology reports with external validation. Nat Commun 2024; 15:8916. [PMID: 39414770 PMCID: PMC11484761 DOI: 10.1038/s41467-024-53190-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 10/04/2024] [Indexed: 10/18/2024] Open
Abstract
Cancer staging is an essential clinical attribute informing patient prognosis and clinical trial eligibility. However, it is not routinely recorded in structured electronic health records. Here, we present BB-TEN: Big Bird - TNM staging Extracted from Notes, a generalizable method for the automated classification of TNM stage directly from pathology report text. We train a BERT-based model using publicly available pathology reports across approximately 7000 patients and 23 cancer types. We explore the use of different model types, with differing input sizes, parameters, and model architectures. Our final model goes beyond term-extraction, inferring TNM stage from context when it is not included in the report text explicitly. As external validation, we test our model on almost 8000 pathology reports from Columbia University Medical Center, finding that our trained model achieved an AU-ROC of 0.815-0.942. This suggests that our model can be applied broadly to other institutions without additional institution-specific fine-tuning.
Collapse
Affiliation(s)
- Jenna Kefeli
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Jacob Berkowitz
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Jose M Acitores Cortina
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Kevin K Tsang
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
- Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Nicholas P Tatonetti
- Department of Systems Biology, Columbia University, New York, NY, USA.
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
- Cedars-Sinai Cancer, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
- Department of Biomedical Informatics, Columbia University, New York, NY, USA.
| |
Collapse
|
41
|
Mbarek L, Chen S, Jin A, Pan Y, Meng X, Yang X, Xu Z, Jiang Y, Wang Y. Predicting 3-month poor functional outcomes of acute ischemic stroke in young patients using machine learning. Eur J Med Res 2024; 29:494. [PMID: 39385211 PMCID: PMC11466038 DOI: 10.1186/s40001-024-02056-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 09/09/2024] [Indexed: 10/12/2024] Open
Abstract
BACKGROUND Prediction of short-term outcomes in young patients with acute ischemic stroke (AIS) may assist in making therapy decisions. Machine learning (ML) is increasingly used in healthcare due to its high accuracy. This study aims to use a ML-based predictive model for poor 3-month functional outcomes in young AIS patients and to compare the predictive performance of ML models with the logistic regression model. METHODS We enrolled AIS patients aged between 18 and 50 years from the Third Chinese National Stroke Registry (CNSR-III), collected between 2015 and 2018. A modified Rankin Scale (mRS) ≥ 3 was a poor functional outcome at 3 months. Four ML tree models were developed: The extreme Gradient Boosting (XGBoost), Light Gradient Boosted Machine (lightGBM), Random Forest (RF), and The Gradient Boosting Decision Trees (GBDT), compared with logistic regression. We assess the model performance based on both discrimination and calibration. RESULTS A total of 2268 young patients with a mean age of 44.3 ± 5.5 years were included. Among them, (9%) had poor functional outcomes. The mRS at admission, living alone conditions, and high National Institutes of Health Stroke Scale (NIHSS) at discharge remained independent predictors of poor 3-month outcomes. The best AUC in the test group was XGBoost (AUC = 0.801), followed by GBDT, RF, and lightGBM (AUCs of 0.795, 0, 794, and 0.792, respectively). The XGBoost, RF, and lightGBM models were significantly better than logistic regression (P < 0.05). CONCLUSIONS ML outperformed logistic regression, where XGBoost the boost was the best model for predicting poor functional outcomes in young AIS patients. It is important to consider living alone conditions with high severity scores to improve stroke prognosis.
Collapse
Affiliation(s)
- Lamia Mbarek
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
| | - Siding Chen
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
- Changping Laboratory, Beijing, China
| | - Aoming Jin
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
| | - Yuesong Pan
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
| | - Xia Meng
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
| | - Xiaomeng Yang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
| | - Zhe Xu
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China
| | - Yong Jiang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China.
- Changping Laboratory, Beijing, China.
- Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Beihang University and Capital Medical University, Beijing, 100091, China.
| | - Yongjun Wang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China.
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, No.119 South 4th Ring West Road, Fengtai District, Beijing, 100070, China.
- Changping Laboratory, Beijing, China.
- Research Unit of Artificial Intelligence in Cerebrovascular Disease, Chinese Academy of Medical Sciences, Beijing, 2019RU018, China.
- Beijing Advanced Innovation Centre for Big Data-Based Precision Medicine, Beihang University, Capital Medical University, Beijing, China.
- Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Beijing, China.
- Advanced Innovation Center for Human Brain Protection, Capital Medical University, Beijing, China.
| |
Collapse
|
42
|
Zhang W, Zhu Y, Tong L, Wei G, Zhang H. Leverage machine learning to identify key measures in hospital operations management: a retrospective study to explore feasibility and performance of four common algorithms. BMC Med Inform Decis Mak 2024; 24:286. [PMID: 39367415 PMCID: PMC11451234 DOI: 10.1186/s12911-024-02689-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 09/18/2024] [Indexed: 10/06/2024] Open
Abstract
BACKGROUND Measures in operations management are pivotal for monitoring and assessing various aspects of hospital performance. Existing literature highlights the importance of regularly updating key management measures to reflect changing trends and organizational goals. Advancements in machine learning (ML) have presented promising opportunities for enhancing the process of updating operations management measures. However, their specific application and performance remain relatively unexplored. We aimed to investigate the feasibility and effectiveness of using common ML techniques to identify and update key measures in hospital operations management. METHODS Historical data on 43 measures on financial balance and quality of care under 4 categories were retrieved from the BI system of a regional health system in Central China. The dataset included 17 surgical and 15 non-surgical departments over 48 months. Four common ML techniques, linear models (LM), random forest (RF), partial least squares (PLS), and neural networks (NN), were used to identify the most important measures. Ordinary least square was employed to investigate the impact of the top 10 measures. A ground truth validation compared the ML-identified key measures against the humanly decided strategic measures from annual meeting minutes. RESULTS For financial balancing, inpatient treatment revenue was an important measure in 3/4 years, followed by equipment depreciation costs. The measures identified using the same technique differed between years, though RF and PLS yielded relatively consistent results. For quality of care, none of the ML-identified measures repeated over the years. Those consistently important over four years differed almost entirely among four techniques. On ground truth validation, the 2016-2019 ML-identified measures were among the humanly identified measures, with the exception of equipment depreciation from the 2019 dataset. All the ML-identified measures for quality of care failed to coincide with the humanly decided measures. CONCLUSIONS Using ML to identify key hospital operational measures is viable but performance of ML techniques vary considerably. RF performs best among the four techniques in identifying key measures in financial balance. None of the ML techniques seem effective for identifying quality of care measures. ML is suggested as a decision support tool to remind and inspire decision-makers in certain aspects of hospital operations management.
Collapse
Affiliation(s)
- Wantao Zhang
- Department of Operation Management, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, #141 Tianjin Road, Huangshi Port District, Huangshi, 435000, Hubei Province, China
| | - Yan Zhu
- Center for Health Statistics and Information, National Health Commission of People's Republic of China, #1 Xizhimen Wainan Road, Xicheng District, Beijing, 100810, China
| | - Liqun Tong
- Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, #141 Tianjin Road, Huangshi Port District, Huangshi, 435000, Hubei Province, China
| | - Guo Wei
- Department of Pediatrics, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, #141 Tianjin Road, Huangshi Port District, Huangshi, 435000, Hubei Province, China
| | - Huajun Zhang
- Department of Operation Management, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, #141 Tianjin Road, Huangshi Port District, Huangshi, 435000, Hubei Province, China.
| |
Collapse
|
43
|
Chinni BK, Manlhiot C. Emerging Analytical Approaches for Personalized Medicine Using Machine Learning In Pediatric and Congenital Heart Disease. Can J Cardiol 2024; 40:1880-1896. [PMID: 39097187 DOI: 10.1016/j.cjca.2024.07.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/25/2024] [Accepted: 07/26/2024] [Indexed: 08/05/2024] Open
Abstract
Precision and personalized medicine, the process by which patient management is tailored to individual circumstances, are now terms that are familiar to cardiologists, despite it still being an emerging field. Although precision medicine relies most often on the underlying biology and pathophysiology of a patient's condition, personalized medicine relies on digital biomarkers generated through algorithms. Given the complexity of the underlying data, these digital biomarkers are most often generated through machine-learning algorithms. There are a number of analytic considerations regarding the creation of digital biomarkers that are discussed in this review, including data preprocessing, time dependency and gating, dimensionality reduction, and novel methods, both in the realm of supervised and unsupervised machine learning. Some of these considerations, such as sample size requirements and measurements of model performance, are particularly challenging in small and heterogeneous populations with rare outcomes such as children with congenital heart disease. Finally, we review analytic considerations for the deployment of digital biomarkers in clinical settings, including the emerging field of clinical artificial intelligence (AI) operations, computational needs for deployment, efforts to increase the explainability of AI, algorithmic drift, and the needs for distributed surveillance and federated learning. We conclude this review by discussing a recent simulation study that shows that, despite these analytic challenges and complications, the use of digital biomarkers in managing clinical care might have substantial benefits regarding individual patient outcomes.
Collapse
Affiliation(s)
- Bhargava K Chinni
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA
| | - Cedric Manlhiot
- The Blalock-Taussig-Thomas Pediatric and Congenital Heart Center, Department of Pediatrics, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland, USA; Research Institute, SickKids Hospital, Department of Pediatrics, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
44
|
Yang Y, Zhang H, Gichoya JW, Katabi D, Ghassemi M. The limits of fair medical imaging AI in real-world generalization. Nat Med 2024; 30:2838-2848. [PMID: 38942996 PMCID: PMC11485237 DOI: 10.1038/s41591-024-03113-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 06/05/2024] [Indexed: 06/30/2024]
Abstract
As artificial intelligence (AI) rapidly approaches human-level performance in medical imaging, it is crucial that it does not exacerbate or propagate healthcare disparities. Previous research established AI's capacity to infer demographic data from chest X-rays, leading to a key concern: do models using demographic shortcuts have unfair predictions across subpopulations? In this study, we conducted a thorough investigation into the extent to which medical AI uses demographic encodings, focusing on potential fairness discrepancies within both in-distribution training sets and external test sets. Our analysis covers three key medical imaging disciplines-radiology, dermatology and ophthalmology-and incorporates data from six global chest X-ray datasets. We confirm that medical imaging AI leverages demographic shortcuts in disease classification. Although correcting shortcuts algorithmically effectively addresses fairness gaps to create 'locally optimal' models within the original data distribution, this optimality is not true in new test settings. Surprisingly, we found that models with less encoding of demographic attributes are often most 'globally optimal', exhibiting better fairness during model evaluation in new test environments. Our work establishes best practices for medical imaging models that maintain their performance and fairness in deployments beyond their initial training contexts, underscoring critical considerations for AI clinical deployments across populations and sites.
Collapse
Affiliation(s)
- Yuzhe Yang
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Haoran Zhang
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Judy W Gichoya
- Department of Radiology, Emory University School of Medicine, Atlanta, GA, USA
| | - Dina Katabi
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
45
|
Li J, Pu S, Shu L, Guo M, He Z. Identification of diagnostic candidate genes in COVID-19 patients with sepsis. Immun Inflamm Dis 2024; 12:e70033. [PMID: 39377750 PMCID: PMC11460023 DOI: 10.1002/iid3.70033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 09/16/2024] [Accepted: 09/19/2024] [Indexed: 10/09/2024] Open
Abstract
PURPOSE Coronavirus Disease 2019 (COVID-19) and sepsis are closely related. This study aims to identify pivotal diagnostic candidate genes in COVID-19 patients with sepsis. PATIENTS AND METHODS We obtained a COVID-19 data set and a sepsis data set from the Gene Expression Omnibus (GEO) database. Identification of differentially expressed genes (DEGs) and module genes using the Linear Models for Microarray Data (LIMMA) and weighted gene co-expression network analysis (WGCNA), functional enrichment analysis, protein-protein interaction (PPI) network construction, and machine learning algorithms (least absolute shrinkage and selection operator (LASSO) regression and Random Forest (RF)) were used to identify candidate hub genes for the diagnosis of COVID-19 patients with sepsis. Receiver operating characteristic (ROC) curves were developed to assess the diagnostic value. Finally, the data set GSE28750 was used to verify the core genes and analyze the immune infiltration. RESULTS The COVID-19 data set contained 3,438 DEGs, and 595 common genes were screened in sepsis. sepsis DEGs were mainly enriched in immune regulation. The intersection of DEGs for COVID-19 and core genes for sepsis was 329, which were also mainly enriched in the immune system. After developing the PPI network, 17 node genes were filtered and thirteen candidate hub genes were selected for diagnostic value evaluation using machine learning. All thirteen candidate hub genes have diagnostic value, and 8 genes with an Area Under the Curve (AUC) greater than 0.9 were selected as diagnostic genes. CONCLUSION Five core genes (CD3D, IL2RB, KLRC, CD5, and HLA-DQA1) associated with immune infiltration were identified to evaluate their diagnostic utility COVID-19 patients with sepsis. This finding contributes to the identification of potential peripheral blood diagnostic candidate genes for COVID-19 patients with sepsis.
Collapse
Affiliation(s)
- Jiuang Li
- Department of Critical Care MedicineThe Third Xiangya Hospital, Central South UniversityChangshaHunanChina
| | - Shiqian Pu
- Department of Critical Care MedicineThe Third Xiangya Hospital, Central South UniversityChangshaHunanChina
| | - Lei Shu
- Department of Critical Care MedicineThe Third Xiangya Hospital, Central South UniversityChangshaHunanChina
| | - Mingjun Guo
- Department of Critical Care MedicineThe Third Xiangya Hospital, Central South UniversityChangshaHunanChina
| | - Zhihui He
- Department of Critical Care MedicineThe Third Xiangya Hospital, Central South UniversityChangshaHunanChina
| |
Collapse
|
46
|
Sengupta PP, Dey D, Davies RH, Duchateau N, Yanamala N. Challenges for augmenting intelligence in cardiac imaging. Lancet Digit Health 2024; 6:e739-e748. [PMID: 39214759 DOI: 10.1016/s2589-7500(24)00142-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 05/15/2024] [Accepted: 06/17/2024] [Indexed: 09/04/2024]
Abstract
Artificial Intelligence (AI), through deep learning, has brought automation and predictive capabilities to cardiac imaging. However, despite considerable investment, tangible health-care cost reductions remain unproven. Although AI holds promise, there has been insufficient time for both methodological development and prospective clinical trials to establish its advantage over human interpretations in terms of its effect on patient outcomes. Challenges such as data scarcity, privacy issues, and ethical concerns impede optimal AI training. Furthermore, the absence of a unified model for the complex structure and function of the heart and evolving domain knowledge can introduce heuristic biases and influence underlying assumptions in model development. Integrating AI into diverse institutional picture archiving and communication systems and devices also presents a clinical hurdle. This hurdle is further compounded by an absence of high-quality labelled data, difficulty sharing data between institutions, and non-uniform and inadequate gold standards for external validations and comparisons of model performance in real-world settings. Nevertheless, there is a strong push in industry and academia for AI solutions in medical imaging. This Series paper reviews key studies and identifies challenges that require a pragmatic change in the approach for using AI for cardiac imaging, whereby AI is viewed as augmented intelligence to complement, not replace, human judgement. The focus should shift from isolated measurements to integrating non-linear and complex data towards identifying disease phenotypes-emphasising pattern recognition where AI excels. Algorithms should enhance imaging reports, enriching patients' understanding, communication between patients and clinicians, and shared decision making. The emergence of professional standards and guidelines is essential to address these developments and ensure the safe and effective integration of AI in cardiac imaging.
Collapse
Affiliation(s)
- Partho P Sengupta
- Division of Cardiovascular Disease and Hypertension, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA.
| | - Damini Dey
- Biomedical Imaging Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Rhodri H Davies
- Institute of Cardiovascular Science, University College London, London, UK
| | - Nicolas Duchateau
- CREATIS, INSA, CNRS UMR 5220, INSERM U1294, Université Lyon 1, UJM Saint-Etienne, Lyon, France; Institut Universitaire de France, Paris, France
| | - Naveena Yanamala
- Division of Cardiovascular Disease and Hypertension, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
| |
Collapse
|
47
|
Susser D, Schiff DS, Gerke S, Cabrera LY, Cohen IG, Doerr M, Harrod J, Kostick-Quenet K, McNealy J, Meyer MN, Price WN, Wagner JK. Synthetic Health Data: Real Ethical Promise and Peril. Hastings Cent Rep 2024; 54:8-13. [PMID: 39487776 PMCID: PMC11555762 DOI: 10.1002/hast.4911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2024]
Abstract
Researchers and practitioners are increasingly using machine-generated synthetic data as a tool for advancing health science and practice, by expanding access to health data while-potentially-mitigating privacy and related ethical concerns around data sharing. While using synthetic data in this way holds promise, we argue that it also raises significant ethical, legal, and policy concerns, including persistent privacy and security problems, accuracy and reliability issues, worries about fairness and bias, and new regulatory challenges. The virtue of synthetic data is often understood to be its detachment from the data subjects whose measurement data is used to generate it. However, we argue that addressing the ethical issues synthetic data raises might require bringing data subjects back into the picture, finding ways that researchers and data subjects can be more meaningfully engaged in the construction and evaluation of datasets and in the creation of institutional safeguards that promote responsible use.
Collapse
|
48
|
Kapral L, Dibiasi C, Jeremic N, Bartos S, Behrens S, Bilir A, Heitzinger C, Kimberger O. Development and external validation of temporal fusion transformer models for continuous intraoperative blood pressure forecasting. EClinicalMedicine 2024; 75:102797. [PMID: 39281101 PMCID: PMC11402414 DOI: 10.1016/j.eclinm.2024.102797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 08/02/2024] [Accepted: 08/06/2024] [Indexed: 09/18/2024] Open
Abstract
Background During surgery, intraoperative hypotension is associated with postoperative morbidity and should therefore be avoided. Predicting the occurrence of hypotension in advance may allow timely interventions to prevent hypotension. Previous prediction models mostly use high-resolution waveform data, which is often not available. Methods We utilised a novel temporal fusion transformer (TFT) algorithm to predict intraoperative blood pressure trajectories 7 min in advance. We trained the model with low-resolution data (sampled every 15 s) from 73,009 patients who were undergoing general anaesthesia for non-cardiothoracic surgery between January 1, 2017, and December 30, 2020, at the General Hospital of Vienna, Austria. The data set contained information on patient demographics, vital signs, medication, and ventilation. The model was evaluated using an internal (n = 8113) and external test set (n = 5065) obtained from the openly accessible Vital Signs Database. Findings In the internal test set, the mean absolute error for predicting mean arterial blood pressure was 0.376 standard deviations-or 4 mmHg-and 0.622 standard deviations-or 7 mmHg-in the external test set. We also adapted the TFT model to binarily predict the occurrence of hypotension as defined by mean arterial blood pressure < 65 mmHg in the next one, three, five, and 7 min. Here, model discrimination was excellent, with a mean area under the receiver operating characteristic curve (AUROC) of 0.933 in the internal test set and 0.919 in the external test set. Interpretation Our TFT model is capable of accurately forecasting intraoperative arterial blood pressure using only low-resolution data showing a low prediction error. When used for binary prediction of hypotension, we obtained excellent performance. Funding No external funding.
Collapse
Affiliation(s)
- Lorenz Kapral
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
- Technical University Vienna, Department of Informatics, Research Unit Machine Learning, Favoritenstraße 9/11, Vienna 1040 Wien, Austria
| | - Christoph Dibiasi
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Natasa Jeremic
- Medical University of Vienna, Department of Ophthalmology and Optometry, Währinger Gürtel 18-20, Vienna 1090 Wien, Austria
| | - Stefan Bartos
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Sybille Behrens
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Aylin Bilir
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| | - Clemens Heitzinger
- Technical University Vienna, Department of Informatics, Research Unit Machine Learning, Favoritenstraße 9/11, Vienna 1040 Wien, Austria
| | - Oliver Kimberger
- Medical University of Vienna, Department of Anaesthesia, Intensive Care Medicine and Pain Medicine, Währinger Gürtel 18-20, Vienna 1090, Austria
- Ludwig Boltzmann Institute Digital Health and Patient Safety, Währinger Straße. 104/10, Vienna, 1180 Wien, Austria
| |
Collapse
|
49
|
Tokodi M, Kosztin A, Kovács A, Gellér L, Schwertner WR, Veres B, Behon A, Lober C, Bogale N, Linde C, Normand C, Dickstein K, Merkely B. Machine learning-based prediction of 1-year all-cause mortality in patients undergoing CRT implantation: validation of the SEMMELWEIS-CRT score in the European CRT Survey I dataset. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2024; 5:563-571. [PMID: 39318695 PMCID: PMC11417478 DOI: 10.1093/ehjdh/ztae051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 04/23/2024] [Accepted: 07/01/2024] [Indexed: 09/26/2024]
Abstract
Aims We aimed to externally validate the SEMMELWEIS-CRT score for predicting 1-year all-cause mortality in the European Cardiac Resynchronization Therapy (CRT) Survey I dataset-a large multi-centre cohort of patients undergoing CRT implantation. Methods and results The SEMMELWEIS-CRT score is a machine learning-based tool trained for predicting all-cause mortality in patients undergoing CRT implantation. This tool demonstrated impressive performance during internal validation but has not yet been validated externally. To this end, we applied it to the data of 1367 patients from the European CRT Survey I dataset. The SEMMELWEIS-CRT predicted 1-year mortality with an area under the receiver operating characteristic curve (AUC) of 0.729 (0.682-0.776), which concurred with the performance measured during internal validation [AUC: 0.768 (0.674-0.861), P = 0.466]. Moreover, the SEMMELWEIS-CRT score outperformed multiple conventional statistics-based risk scores, and we demonstrated that a higher predicted probability is not only associated with a higher risk of death [odds ratio (OR): 1.081 (1.061-1.101), P < 0.001] but also with an increased risk of hospitalizations for any cause [OR: 1.013 (1.002-1.025), P = 0.020] or for heart failure [OR: 1.033 (1.015-1.052), P < 0.001], a less than 5% improvement in left ventricular ejection fraction [OR: 1.033 (1.021-1.047), P < 0.001], and lack of improvement in New York Heart Association functional class compared with baseline [OR: 1.018 (1.006-1.029), P = 0.003]. Conclusion In the European CRT Survey I dataset, the SEMMELWEIS-CRT score predicted 1-year all-cause mortality with good discriminatory power, which confirms the generalizability and demonstrates the potential clinical utility of this machine learning-based risk stratification tool.
Collapse
Affiliation(s)
- Márton Tokodi
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
- Department of Surgical Research and Techniques, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | - Annamária Kosztin
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | - Attila Kovács
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
- Department of Surgical Research and Techniques, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | - László Gellér
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | | | - Boglárka Veres
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | - Anett Behon
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| | | | - Nigussie Bogale
- Department of Heart Disease, Haukeland University Hospital, Bergen, Norway
| | - Cecilia Linde
- Division of Cardiology, Department of Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Camilla Normand
- Cardiology Division, Stavanger University Hospital, Stavanger, Norway
- Department of Quality and Health Technology, University of Stavanger, Stavanger, Norway
| | - Kenneth Dickstein
- Cardiology Division, Stavanger University Hospital, Stavanger, Norway
- Institute of Internal Medicine, University of Bergen, Bergen, Norway
| | - Béla Merkely
- Heart and Vascular Centre, Semmelweis University, 68 Városmajor Street, 1122 Budapest, Hungary
| |
Collapse
|
50
|
Ong W, Lee A, Tan WC, Fong KTD, Lai DD, Tan YL, Low XZ, Ge S, Makmur A, Ong SJ, Ting YH, Tan JH, Kumar N, Hallinan JTPD. Oncologic Applications of Artificial Intelligence and Deep Learning Methods in CT Spine Imaging-A Systematic Review. Cancers (Basel) 2024; 16:2988. [PMID: 39272846 PMCID: PMC11394591 DOI: 10.3390/cancers16172988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 08/14/2024] [Accepted: 08/26/2024] [Indexed: 09/15/2024] Open
Abstract
In spinal oncology, integrating deep learning with computed tomography (CT) imaging has shown promise in enhancing diagnostic accuracy, treatment planning, and patient outcomes. This systematic review synthesizes evidence on artificial intelligence (AI) applications in CT imaging for spinal tumors. A PRISMA-guided search identified 33 studies: 12 (36.4%) focused on detecting spinal malignancies, 11 (33.3%) on classification, 6 (18.2%) on prognostication, 3 (9.1%) on treatment planning, and 1 (3.0%) on both detection and classification. Of the classification studies, 7 (21.2%) used machine learning to distinguish between benign and malignant lesions, 3 (9.1%) evaluated tumor stage or grade, and 2 (6.1%) employed radiomics for biomarker classification. Prognostic studies included three (9.1%) that predicted complications such as pathological fractures and three (9.1%) that predicted treatment outcomes. AI's potential for improving workflow efficiency, aiding decision-making, and reducing complications is discussed, along with its limitations in generalizability, interpretability, and clinical integration. Future directions for AI in spinal oncology are also explored. In conclusion, while AI technologies in CT imaging are promising, further research is necessary to validate their clinical effectiveness and optimize their integration into routine practice.
Collapse
Affiliation(s)
- Wilson Ong
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Aric Lee
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Wei Chuan Tan
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Kuan Ting Dominic Fong
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Daoyong David Lai
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Yi Liang Tan
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
| | - Xi Zhen Low
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| | - Shuliang Ge
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| | - Andrew Makmur
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| | - Shao Jin Ong
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| | - Yong Han Ting
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| | - Jiong Hao Tan
- National University Spine Institute, Department of Orthopaedic Surgery, National University Health System, 1E, Lower Kent Ridge Road, Singapore 119228, Singapore
| | - Naresh Kumar
- National University Spine Institute, Department of Orthopaedic Surgery, National University Health System, 1E, Lower Kent Ridge Road, Singapore 119228, Singapore
| | - James Thomas Patrick Decourcy Hallinan
- Department of Diagnostic Imaging, National University Hospital, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
- Department of Diagnostic Radiology, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore 117597, Singapore
| |
Collapse
|