Lal A, Pinevich Y, Gajic O, Herasevich V, Pickering B. Artificial intelligence and computer simulation models in critical illness. World J Crit Care Med 2020; 9(2): 13-19 [PMID: 32577412 DOI: 10.5492/wjccm.v9.i2.13]
Corresponding Author of This Article
Amos Lal, MBBS, Department of Medicine, Division of Pulmonary and Critical Care Medicine, Multidisciplinary Epidemiology and Translational Research in Intensive Care Group, Mayo Clinic, 200 1st St SW, Rochester, MN 55905, United States. lal.amos@mayo.edu
Research Domain of This Article
Critical Care Medicine
Article-Type of This Article
Minireviews
Open-Access Policy of This Article
This article is an open-access article which was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Amos Lal, Ognjen Gajic, Department of Medicine, Division of Pulmonary and Critical Care Medicine, Rochester, Mayo Clinic, MN 55905, United States
Amos Lal, Yuliya Pinevich, Ognjen Gajic, Vitaly Herasevich, Brian Pickering, Multidisciplinary Epidemiology and Translational Research in Intensive Care Group, Mayo Clinic, Rochester, MN 55905, United States
Yuliya Pinevich, Vitaly Herasevich, Brian Pickering, Department of Anesthesiology and Perioperative Medicine, Division of Critical Care, Mayo Clinic, Rochester, MN 55905, United States
Author contributions: All authors equally contributed to this paper.
Conflict-of-interest statement: The authors declare that they have no conflict of interest.
Open-Access: This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
Corresponding author: Amos Lal, MBBS, Department of Medicine, Division of Pulmonary and Critical Care Medicine, Multidisciplinary Epidemiology and Translational Research in Intensive Care Group, Mayo Clinic, 200 1st St SW, Rochester, MN 55905, United States. lal.amos@mayo.edu
Received: December 31, 2019 Peer-review started: December 31, 2019 First decision: March 28, 2020 Revised: April 21, 2020 Accepted: May 12, 2020 Article in press: May 12, 2020 Published online: June 5, 2020 Processing time: 156 Days and 12.6 Hours
Abstract
Widespread implementation of electronic health records has led to the increased use of artificial intelligence (AI) and computer modeling in clinical medicine. The early recognition and treatment of critical illness are central to good outcomes but are made difficult by, among other things, the complexity of the environment and the often non-specific nature of the clinical presentation. Increasingly, AI applications are being proposed as decision supports for busy or distracted clinicians, to address this challenge. Data driven “associative” AI models are built from retrospective data registries with missing data and imprecise timing. Associative AI models lack transparency, often ignore causal mechanisms, and, while potentially useful in improved prognostication, have thus far had limited clinical applicability. To be clinically useful, AI tools need to provide bedside clinicians with actionable knowledge. Explicitly addressing causal mechanisms not only increases validity and replicability of the model, but also adds transparency and helps gain trust from the bedside clinicians for real world use of AI models in teaching and patient care.
Core tip: Widespread implementation of electronic health records coupled with increased computer power has led to the increased use of artificial intelligence and computer modeling in clinical medicine. To be clinically useful, artificial intelligence models need to be built on accurate data, take into consideration causal mechanisms, and provide actionable information at the point of care.
Citation: Lal A, Pinevich Y, Gajic O, Herasevich V, Pickering B. Artificial intelligence and computer simulation models in critical illness. World J Crit Care Med 2020; 9(2): 13-19
The complex nature of critical illness calls for an exploration of alternative approaches to assist clinicians in their timely diagnosis and management. Artificial intelligence (AI) applications have transformed various human domains from economics to traffic and have recently been introduced into health care.
AI IN HEALTH CARE
Widespread implementation of electronic health records (EHRs) has led to the increased use of AI and computer modeling in clinical medicine. The hope is that these techniques will prove superior to traditional epidemiologic and statistical approaches and will unlock insights that lead to the development of new treatment recommendations and prediction models. AI can be defined as the field of computer science that enables computers to perform the human cognitive tasks[1]. The interest in AI and systems science methodologies in the research community has grown rapidly in recent years[2]. Specific AI applications of interest to critical care include machine and deep learning algorithms, “in silico” simulation models, and “digital twins”.
Machine learning
Machine learning (ML) is an application of AI that develops statistical analysis models using computational technologies applied to big data[3]. The following learning techniques could be used: (1) Supervised learning techniques include but are not limited to linear regression, decision trees, and Naive Bayes. The models developed based on these are normally used for anomaly detection with the use of algorithm approximating a known output with a higher accuracy from a labeled data set, for example: Electrocardiogram interpretation by the automated machine or detection of a lung nodule from a chest X ray or a CT scan based on pattern recognition[4,5]. The aim of models developed using this technique is to decipher rules and latent relationships within data. “Support Vector Machine” is an example of supervised ML algorithm which is used for both classification and regression challenges and give a different dimension to the ensemble models. They are crucial in cases which require high predictive power but these algorithms are hard to visualize due to the complexity in formulation; (2) Unsupervised learning: Unsupervised ML models are developed using clustering techniques which includes segmenting data by some shared attributes, detecting anomalies that do not fit to any group and simplifying datasets by aggregating variables with similar attributes. The main goal is to study and determine the intrinsic and often hidden structure of the data. These models use algorithms on unlabeled data with no outputs to predict but are exploratory and intend to find naturally occurring patterns within the data[6]. This technique can be condensed in two major types of problems that unsupervised ML models try to solve, clustering and dimensionality reduction; (3) Semi-supervised learning uses a dataset with unlabeled as well as labeled data to increase the learning precision and appropriate prediction of label function. Further the model is trained and retrained with the estimated labels from the previous step[7]. These semi-supervised ML models are commonly used in medicine such as in voice recognition (medical dictation applications), data mining, and video surveillance (used in e-ICUs)[8,9]; and (4) Reinforcement learning: Reinforcement ML algorithms learn by observing the result of an action taken by the algorithm and applying a similar algorithm where the data are limited or missing[10]. The algorithm iteratively learns from previous response (reward or penalty) and acts with a goal to receive maximum reward in the future.
Deep learning
Deep learning (DL) refers to the automatic determination and processing of the parameters in a network, on the basis of experience. DL is a ML technique that is designed with multiple layers of neurons, including input and output layers, and so-called “hidden layers”[11]. This idea of hidden layers (neural network) is inherited from a popular engineering and cognitive science topic since the 1980s[12,13]. The input data is passed through the layers, and the complexity of output function increases from layer to layer. In the recent past, the use of DL models in medicine has introduced the idea of data analytic modeling from expert-driven feature to data-driven feature. Large and complex databases (with longitudinal event sequences and continuous data points) have made it possible to train complex DL models. These models developed from large and complex databases with multiple hidden neural layers provide limited transparency to the users and are aptly described as “black box” models. The user of “black box” AI knows inputs and understands outcomes of the model, but how the output value was generated is unknown. These DL models are most commonly utilized in the field of medicine for following categories of analytical tasks: (1) Disease detection or classification, where DL models are used to detect a specific disease(s) with the help of data mining from EHR[14]; (2) Sequential prediction of clinical events, where DL models predict future clinical events learning from the previous event sequences[15]; (3) Concept embedding, where DL models derive feature representation of clinical concepts algorithmically from the EHR data[16]; (4) Data augmentation, where DL models create realistic data elements for the use in clinical research or otherwise based on real EHR data[17]; and (5) EHR data privacy, where DL models derive techniques to protect patient EHR privacy by de-identification[18].
In simpler words, it would be easier to understand the relationship of AI, ML and DL by visualizing them as 3 concentric circles with DL being the innermost circle which is a subset of ML. ML in turn is a part of the greater all-encompassing concept of AI (thus AI fits inside both ML and DL).
In silico simulation models and digital twins
“In silico” experimentation or simulation involves mathematical and computer based exemplifications to construct models[19]. Computer based experiments can then be carried out to conduct investigations of hypotheses in a virtual environment without actually involving human subjects. The Archimedes model illustrates the use of mathematical techniques to reproduce the complex nature of disease[20]. The core model is a set of ordinary differential equations, which represent the physiologic, clinical, and social pathways that are relevant to diabetes and diabetes-related complications. The use of causal pathways (i.e., Disease Acyclic Graphs) distinguishes Archimedes from conventional, associative AI models[17]. Digital twin is a type of simulation model that combines current data from the object with its simulation model to enhance insight and assist with decision making[21,22]. The digital twin has proven to be effective in industry and transportation, such as gas turbine fleet, rail fleet, and production line. The advantage of this approach is the ability to get the representative operational updates from the real-world object that allows model to give an accurate prediction and to give the feedback to the real-world state directly to make operational changes.
AI IN CRITICAL CARE
Critical illness offers a number of advantages for the developers of AI models compared to chronic disease, such as the availability of large quantities of qualitative and quantitative data and relatively short trajectory of critical illness to a stable outcome. This results in the possible iterative testing of hypotheses raised by simulation modeling in independent patient cohorts. For example, recently, a group of computer scientists and clinicians from the Imperial College, London, United Kingdom used an AI approach to develop a decision support model aptly named AI Clinician[23]. Using reinforcement learning (RL), AI Clinician is designed to assist with optimal treatment interventions for sepsis in real-time. It was developed and validated in two clinical databases: MIMIC-III and e-ICU research database[24,25]. Similar methodology has recently been applied to the continuous prediction of acute kidney injury (AKI)[26]. Tools that are developed based on the current AI models have low specificity in predicting the intervention points for real life sepsis patients. This is one of the major obstacles faced by AI models for treating the critically ill patients. While most of the currently devised models are based on the retrospective data from the data banks, the accuracy and performance of these algorithms on real-time data may not achieve the same level. Patient privacy concerns and question of responsibility may preclude rapid integration of AI models into current ICU practice. High heterogeneity of patients and their specific needs could be easily illustrated by managing a patient on mechanical ventilation. “Intelligent” ventilation modes may do more harm than good without thorough supervision by a specialist.
The above examples highlight a new approach to predictive and prognostic analytics in the area of critical care. Although these models yielded clinically plausible results, major shortcomings limit inferences and use in the real world of bedside clinical medicine. First, built exclusively from retrospective EHR data, the models suffer from missing data and imprecise timing (back charting) particularly during the initial, golden hours of critical illness. Not unlike retrospective studies using traditional methods (logistic regression), the output results are only hypothesis raising and require prospective confirmation.
PROGNOSTIC (ASSOCIATIVE) VS PREDICTIVE (ACTIONABLE) AI MODELS
While offering marginal improvements in performance over traditional epidemiological or logistic regression approaches, associative AI models generally underperform in the live clinical setting and struggle to breach the threshold of usefulness for most clinicians[27]. Even accurate prognostic enrichment (classifying patients with high or low likelihood of death or AKI) is of limited value to the bedside clinician. For example, the prediction model of AKI does not provide any predictive enrichment with regards to potential intervention[26]. For example, will my patient benefit from a red cell transfusion, or continuous vs intermittent renal replacement?
Predicting the risk vs the benefit of a particular treatment (i.e., actionable AI) is more difficult. Differences between associative and inquisitive/actionable AI are highlighted in Table 1. In contrast to “black box“ associative AI, actionable AI models should explicitly address causal relationships[28]. Directed acyclic graphs – (DAG) approach has been increasingly used to address causal relationships in different research domains[29]. DAGs facilitate integration of expert knowledge into data driven AI models and are well suited for building advanced AI algorithms and simulation models.
Table 1 Differences between associative artificial intelligence and actionable artificial intelligence models.
Models based on associative artificial intelligence
Models based on actionable artificial intelligence
These applications are built using available historical public or institutional data repositories[26,31,32].
These applications are built more often on the prospectively collected data points, predicting risk vs benefit of a particular treatment or intervention[17,30,33,34].
Developed using the data points that are collected prospectively in real-time[30,34].
Purely data driven associative models often without explicit consideration of causal pathways[37-39].
These models are developed with an understanding based on the underlying causal pathways, therefore providing greater clinical utility and accuracy[40-42].
Representative examples: Development and validation of a data driven tool to predict sepsis based on vital signs by Mao et al[43]. Provides no actionable benefit to the bedside clinician. Similarly, a model developed to predict AKI in a patient based on retrospectively collected dataset from electronic health records by Tomasev et al[26]. The model was associated with high false positive alerts (2 false positive alerts for each true alert).
Representative examples: Improving the safety of ventilator care by avoiding ventilator-induced lung injury. Electronic algorithm based on near real-time data and notification of bedside providers giving actionable information, developed by Herasevich et al[33]. Artificial neural network based model developed for forecasting ICP for medical decision support, by Zhang et al[42]. This model provided actionable treatment planning for patients based on the predicted future trends of ICP.
Bayesian networks are DAGs whose nodes represent variables in the Bayesian sense: They may be observable quantities, latent variables, unknown parameters or hypotheses. Edges represent conditional dependencies; nodes that are not connected (no path connects one node to another) represent variables that are conditionally independent of each other. Each node is associated with a probability function that takes, as input, a particular set of values for the node's parent variables, and gives (as output) the probability (or probability distribution, if applicable) of the variable represented by the node. Directed acyclic graphical model is a probabilistic graphical model (a type of statistical model) that represents a set of variables and their conditional dependencies – also known as the Bayesian network Model.
Unidirectional arrows of DAGs are based on known causal effects (and prior knowledge) (Figure 1). DAGs enable clear representation and better understanding of the key concepts of exposure, outcome, causation, confounding, and bias. DAGs are built as simple integers of physiology as a basis to building complex patterns for seamless functionality of a simulation model and AI application. One of the advantages of using multiple basic DAGs to build a complex model is that, the model can be easily disassembled as individual components (DAGs) to ensure that the complex model can be better understood and refined as necessary.
In a complex critical care environment clinicians are challenged with making decisions with a high degree of uncertainty under time constraints. Data driven associative AI models hold promise for better prognostication and to augment the diagnostic process but thus far have not been proven useful for bedside clinicians. Transparency of the model in terms of analytics and algorithms is important for patient safety and to earn the trust of the treating clinician[30]. Actionable AI models are more challenging to build and require explicit consideration of causal mechanisms. Accurate prediction of the response to treatment or intervention without exposing the patients to potential risks is an ultimate AI challenge for the benefit of patient and clinicians alike.
ACKNOWLEDGEMENTS
We thank Mrs Ann Johnson for help with proofreading and language editing of the manuscript.
Gold M, Beitsch L, Essien J. For the public’s health: The role of measurement in action and accountability. National Academies Press website. 2011.
Available from: https://www.nap.edu/read/13005.
[PubMed] [DOI][Cited in This Article: ]
El Naqa I, Murphy MJ. What Is Machine Learning?
What Is Machine Learning? In: El Naqa I, Li R, Murphy M, editors. Machine Learning in Radiation Oncology: Theory and Applications. Cham: Springer International Publishing, 2015: 3-11.
[PubMed] [DOI][Cited in This Article: ]
Rumelhart DE, Mcclelland JL. Parallel distributed processing: Explorations in the microstructure of cognition.
Bradford: MIT Press, 1986.
[PubMed] [DOI][Cited in This Article: ]
Tomašev N, Glorot X, Rae JW, Zielinski M, Askham H, Saraiva A, Mottram A, Meyer C, Ravuri S, Protsyuk I, Connell A, Hughes CO, Karthikesalingam A, Cornebise J, Montgomery H, Rees G, Laing C, Baker CR, Peterson K, Reeves R, Hassabis D, King D, Suleyman M, Back T, Nielson C, Ledsam JR, Mohamed S. A clinically applicable approach to continuous prediction of future acute kidney injury.Nature. 2019;572:116-119.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 662][Cited by in F6Publishing: 520][Article Influence: 104.0][Reference Citation Analysis (0)]
Lederer DJ, Bell SC, Branson RD, Chalmers JD, Marshall R, Maslove DM, Ost DE, Punjabi NM, Schatz M, Smyth AR, Stewart PW, Suissa S, Adjei AA, Akdis CA, Azoulay É, Bakker J, Ballas ZK, Bardin PG, Barreiro E, Bellomo R, Bernstein JA, Brusasco V, Buchman TG, Chokroverty S, Collop NA, Crapo JD, Fitzgerald DA, Hale L, Hart N, Herth FJ, Iwashyna TJ, Jenkins G, Kolb M, Marks GB, Mazzone P, Moorman JR, Murphy TM, Noah TL, Reynolds P, Riemann D, Russell RE, Sheikh A, Sotgiu G, Swenson ER, Szczesniak R, Szymusiak R, Teboul JL, Vincent JL. Control of Confounding and Reporting of Results in Causal Inference Studies. Guidance for Authors from Editors of Respiratory, Sleep, and Critical Care Journals.Ann Am Thorac Soc. 2019;16:22-28.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 314][Cited by in F6Publishing: 458][Article Influence: 114.5][Reference Citation Analysis (0)]
Ginestra JC, Giannini HM, Schweickert WD, Meadows L, Lynch MJ, Pavan K, Chivers CJ, Draugelis M, Donnelly PJ, Fuchs BD, Umscheid CA. Clinician Perception of a Machine Learning-Based Early Warning System Designed to Predict Severe Sepsis and Septic Shock.Crit Care Med. 2019;47:1477-1484.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 62][Cited by in F6Publishing: 76][Article Influence: 19.0][Reference Citation Analysis (0)]
Herasevich V, Tsapenko M, Kojicic M, Ahmed A, Kashyap R, Venkata C, Shahjehan K, Thakur SJ, Pickering BW, Zhang J, Hubmayr RD, Gajic O. Limiting ventilator-induced lung injury through individual electronic medical record surveillance.Crit Care Med. 2011;39:34-39.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 62][Cited by in F6Publishing: 65][Article Influence: 5.0][Reference Citation Analysis (0)]
Giannini HM, Ginestra JC, Chivers C, Draugelis M, Hanish A, Schweickert WD, Fuchs BD, Meadows L, Lynch M, Donnelly PJ, Pavan K, Fishman NO, Hanson CW, Umscheid CA. A Machine Learning Algorithm to Predict Severe Sepsis and Septic Shock: Development, Implementation, and Impact on Clinical Practice.Crit Care Med. 2019;47:1485-1492.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 92][Cited by in F6Publishing: 126][Article Influence: 31.5][Reference Citation Analysis (0)]
Gao Y, Xu A, Hu PJH, Cheng TH. Incorporating association rule networks in feature category-weighted naive Bayes model to support weaning decision making.Decis Support Syst. 2017;96:27-38.
[PubMed] [DOI][Cited in This Article: ]
Zheng B, Zhang J, Yoon SW, Lam SS, Khasawneh M, Poranki S. Predictive modeling of hospital readmissions using metaheuristics and data mining.Expert Sys Appl. 2015;42:7110-120.
[PubMed] [DOI][Cited in This Article: ]
Chen L, Dubrawski A, Wang D, Fiterau M, Guillame-Bert M, Bose E, Kaynar AM, Wallace DJ, Guttendorf J, Clermont G, Pinsky MR, Hravnak M. Using Supervised Machine Learning to Classify Real Alerts and Artifact in Online Multisignal Vital Sign Monitoring Data.Crit Care Med. 2016;44:e456-e463.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 48][Cited by in F6Publishing: 50][Article Influence: 7.1][Reference Citation Analysis (0)]
Li A, Lewis M, Lebiere C, Sycara K, Khatib SS, Tang Y, Siedsma M, Morrison D, editors. A computational model based on human performance for fluid management in critical care.
2016 IEEE Symposium Series on Computational Intelligence (SSCI); 2002, December 6-9, Athens, Greece.
[PubMed] [DOI][Cited in This Article: ]
Mao Q, Jay M, Hoffman JL, Calvert J, Barton C, Shimabukuro D, Shieh L, Chettipally U, Fletcher G, Kerem Y, Zhou Y, Das R. Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU.BMJ Open. 2018;8:e017833.
[PubMed] [DOI][Cited in This Article: ][Cited by in Crossref: 178][Cited by in F6Publishing: 166][Article Influence: 27.7][Reference Citation Analysis (0)]