1
|
Shao L, Wang J, Li L, Liu Y, Liu Z, Song B, Li S. ADEPT: An advanced data exploration and processing tool for clinical data insights. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 268:108860. [PMID: 40403533 DOI: 10.1016/j.cmpb.2025.108860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 05/12/2025] [Accepted: 05/16/2025] [Indexed: 05/24/2025]
Abstract
BACKGROUND AND OBJECTIVE The rapid growth of clinical data creates challenges in analysis and interpretation for medical professionals. To address these issues, we developed the Advanced Data Exploration and Processing Tool (ADEPT), integrating data preprocessing, modeling, visualization, and statistical reporting to resolve common problems like inaccurate terminology, outliers, and missing values. METHODS ADEPT incorporates advanced preprocessing, including standardizing numerical values, detecting outliers via Isolation Forest and DBSCAN, and filling missing data with KNN and MissForest. Tokenized text features are processed through keyword-based classification and K-means clustering. Five machine learning models-Gradient Boosting Machine, Random Forest, Extreme Gradient Boosting, Logistic Regression, and Support Vector Machine-are combined with a dynamic voting mechanism. Performance was assessed using precision, sensitivity, and specificity. RESULTS ADEPT demonstrated substantial performance improvements, with the Area Under the Curve (AUC) increasing by over 14 %. Key results include enhanced precision, sensitivity, and specificity, validating the tool's ability to extract valuable insights from complex datasets. CONCLUSIONS ADEPT offers a comprehensive solution for automated clinical data analysis, combining rigorous preprocessing with advanced modeling. Its dynamic voting mechanism and integrated tools enhance accuracy and interpretability, addressing critical challenges in clinical data management and decision-making.
Collapse
Affiliation(s)
- Lingyu Shao
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China.
| | - Jiarui Wang
- The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou 221006, Jiangsu, China.
| | - Lin Li
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China.
| | - Yujie Liu
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China
| | - Zhaoqing Liu
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China
| | - Boming Song
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China.
| | - Shuyan Li
- School of Medical Informatics and Engineering, Xuzhou Medical University, Xuzhou 221004, Jiangsu, China.
| |
Collapse
|
2
|
Lin N, Abbas-Aghababazadeh F, Su J, Wu AJ, Lin C, Shi W, Xu W, Haibe-Kains B, Liu FF, Kwan JYY. Development of Machine Learning Models for Predicting Radiation Dermatitis in Breast Cancer Patients Using Clinical Risk Factors, Patient-Reported Outcomes, and Serum Cytokine Biomarkers. Clin Breast Cancer 2025:S1526-8209(25)00048-5. [PMID: 40155248 DOI: 10.1016/j.clbc.2025.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 02/27/2025] [Accepted: 03/01/2025] [Indexed: 04/01/2025]
Abstract
BACKGROUND Radiation dermatitis (RD) is a significant side effect of radiotherapy experienced by breast cancer patients. Severe symptoms include desquamation or ulceration of irradiated skin, which impacts quality of life and increases healthcare costs. Early identification of patients at risk for severe RD can facilitate preventive management and reduce severe symptoms. This study evaluated the utility of subjective and objective factors, such as patient-reported outcomes (PROs) and serum cytokines, for predicting RD in breast cancer patients. The performance of machine learning (ML) and logistic regression-based models were compared. PATIENTS AND METHODS Data from 147 breast cancer patients who underwent radiotherapy was analyzed to develop prognostic models. ML algorithms, including neural networks, random forest, XGBoost, and logistic regression, were employed to predict clinically significant Grade 2+ RD. Clinical factors, PROs, and cytokine biomarkers were incorporated into the risk models. Model performance was evaluated using nested cross-validation with separate loops for hyperparameter tuning and calculating performance metrics. RESULTS Feature selection identified 18 predictors of Grade 2+ RD including smoking, radiotherapy boost, reduced motivation, and the cytokines interleukin-4, interleukin-17, interleukin-1RA, interferon-gamma, and stromal cell-derived factor-1a. Incorporating these predictors, the XGBoost model achieved the highest performance with an area under the curve (AUC) of 0.780 (95% CI: 0.701-0.854). This was not significantly improved over the logistic regression model, which demonstrated an AUC of 0.714 (95% CI: 0.629-0.798). CONCLUSION Clinical risk factors, PROs, and serum cytokine levels provide complementary prognostic information for predicting severe RD in breast cancer patients undergoing radiotherapy. ML and logistic regression models demonstrated comparable performance for predicting clinically significant RD with AUC>0.70.
Collapse
Affiliation(s)
- Neil Lin
- Institute of Medical Science, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | - Farnoosh Abbas-Aghababazadeh
- Princess Margaret Bioinformatics and Computational Genomics Laboratory, University Health Network, Toronto, Canada
| | - Jie Su
- Biostatistics Division, Princess Margaret Cancer Centre, Toronto, Canada
| | - Alison J Wu
- Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | - Cherie Lin
- Institute of Medical Science, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | - Wei Shi
- Research Institute, Princess Margaret Cancer Centre, Toronto, Canada
| | - Wei Xu
- Biostatistics Division, Princess Margaret Cancer Centre, Toronto, Canada
| | - Benjamin Haibe-Kains
- Princess Margaret Bioinformatics and Computational Genomics Laboratory, University Health Network, Toronto, Canada; Research Institute, Princess Margaret Cancer Centre, Toronto, Canada; Department of Computer Science, University of Toronto, Toronto, Canada; Ontario Institute for Cancer Research, Toronto, Canada; Vector Institute for Artificial Intelligence, Toronto, Canada; Department of Medical Biophysics, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada
| | - Fei-Fei Liu
- Research Institute, Princess Margaret Cancer Centre, Toronto, Canada; Department of Radiation Oncology, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Department of Medical Biophysics, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Canada
| | - Jennifer Y Y Kwan
- Institute of Medical Science, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Research Institute, Princess Margaret Cancer Centre, Toronto, Canada; Department of Radiation Oncology, Temerty Faculty of Medicine, University of Toronto, Toronto, Canada; Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Canada.
| |
Collapse
|
3
|
D'Auria D, Bettini F, Tognarelli S, Calvanese D, Menciassi A. Technologies and main functionalities of the telemonitoring application reCOVeryaID. Front Big Data 2024; 7:1360092. [PMID: 39104732 PMCID: PMC11298383 DOI: 10.3389/fdata.2024.1360092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 07/08/2024] [Indexed: 08/07/2024] Open
Abstract
The COVID-19 pandemic has highlighted the need to take advantage of specific and effective patient telemonitoring platforms, with specific reference to the constant monitoring of vital parameters of patients most at risk. Among the various applications developed in Italy, certainly there is reCOVeryaID, a web application aimed at remotely monitoring patients potentially, currently or no longer infected with COVID-19. Therefore, in this paper we present a system model, consisting of a multi-platform intelligent telemonitoring application, that enables remote monitoring and provision of integrated home care to both patients symptomatic, asymptomatic and pre-symptomatic with severe acute respiratory infectious disease or syndrome caused by viruses belonging to the Coronavirus family, as well as simply to people with respiratory problems and/or related diseases (chronic obstructive pulmonary disease or asthma). In fact, in this paper we focus on exposing the technologies and various functionalities offered by the system, which constitute the practical implementation of the theoretical framework described in detail in another paper. Specifically, the reCOVeryaID telemonitoring application is a stand-alone, knowledge base-supported application that can promptly react and inform physicians if dangerous trends in a patient's short- and long-term vital signs are detected, thus enabling them to be monitored continuously, both in the hospital and at home. The paper also reports an evaluation of user satisfaction, carried out by actual patients and medical doctors.
Collapse
Affiliation(s)
- Daniela D'Auria
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
| | - Fabio Bettini
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
| | | | - Diego Calvanese
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
- Department of Computing Science, Umeå University, Umeå, Sweden
| | | |
Collapse
|
4
|
Scott IA, De Guzman KR, Falconer N, Canaris S, Bonilla O, McPhail SM, Marxen S, Van Garderen A, Abdel-Hafez A, Barras M. Evaluating automated machine learning platforms for use in healthcare. JAMIA Open 2024; 7:ooae031. [PMID: 38863963 PMCID: PMC11165368 DOI: 10.1093/jamiaopen/ooae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/06/2024] [Accepted: 04/22/2024] [Indexed: 06/13/2024] Open
Abstract
Objective To describe development and application of a checklist of criteria for selecting an automated machine learning (Auto ML) platform for use in creating clinical ML models. Materials and Methods Evaluation criteria for selecting an Auto ML platform suited to ML needs of a local health district were developed in 3 steps: (1) identification of key requirements, (2) a market scan, and (3) an assessment process with desired outcomes. Results The final checklist comprising 21 functional and 6 non-functional criteria was applied to vendor submissions in selecting a platform for creating a ML heparin dosing model as a use case. Discussion A team of clinicians, data scientists, and key stakeholders developed a checklist which can be adapted to ML needs of healthcare organizations, the use case providing a relevant example. Conclusion An evaluative checklist was developed for selecting Auto ML platforms which requires validation in larger multi-site studies.
Collapse
Affiliation(s)
- Ian A Scott
- Centre for Health Services Research, University of Queensland, Brisbane, 4102, Australia
- Department of Internal Medicine and Clinical Epidemiology, Princess Alexandra Hospital, Brisbane, 4102, Australia
| | - Keshia R De Guzman
- Department of Pharmacy, Princess Alexandra Hospital, Brisbane, 4102, Australia
- School of Pharmacy, The University of Queensland, Brisbane, 4102, Australia
| | - Nazanin Falconer
- Department of Pharmacy, Princess Alexandra Hospital, Brisbane, 4102, Australia
- School of Pharmacy, The University of Queensland, Brisbane, 4102, Australia
| | - Stephen Canaris
- Digital Health and Informatics, Metro South Health, Brisbane, 4102, Australia
| | - Oscar Bonilla
- Digital Health and Informatics, Metro South Health, Brisbane, 4102, Australia
| | - Steven M McPhail
- Digital Health and Informatics, Metro South Health, Brisbane, 4102, Australia
- Australian Centre for Health Services Innovation and Centre for Healthcare Transformation, School of Public Health and Social Work, Queensland University of Technology, Brisbane, 4059, Australia
| | - Sven Marxen
- Pharmacy Service, Logan and Beaudesert Hospitals, Logan, 4131, Australia
| | - Aaron Van Garderen
- Digital Health and Informatics, Metro South Health, Brisbane, 4102, Australia
- Pharmacy Service, Logan and Beaudesert Hospitals, Logan, 4131, Australia
| | - Ahmad Abdel-Hafez
- Digital Health and Informatics, Metro South Health, Brisbane, 4102, Australia
- Australian Centre for Health Services Innovation and Centre for Healthcare Transformation, School of Public Health and Social Work, Queensland University of Technology, Brisbane, 4059, Australia
| | - Michael Barras
- Department of Pharmacy, Princess Alexandra Hospital, Brisbane, 4102, Australia
- School of Pharmacy, The University of Queensland, Brisbane, 4102, Australia
| |
Collapse
|
5
|
Sokolski M, Trenson S, Reszka K, Urban S, Sokolska JM, Biering-Sørensen T, Højbjerg Lassen MC, Skaarup KG, Basic C, Mandalenakis Z, Ablasser K, Rainer PP, Wallner M, Rossi VA, Lilliu M, Loncar G, Cakmak HA, Ruschitzka F, Flammer AJ. Phenotype clustering of hospitalized high-risk patients with COVID-19 - a machine learning approach within the multicentre, multinational PCHF-COVICAV registry. Cardiol J 2024; 31:512-521. [PMID: 38832553 PMCID: PMC11374323 DOI: 10.5603/cj.98489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Accepted: 04/10/2024] [Indexed: 06/05/2024] Open
Abstract
IMTRODUCTION The high-risk population of patients with cardiovascular (CV) disease or risk factors (RF) suffering from COVID-19 is heterogeneous. Several predictors for impaired prognosis have been identified. However, with machine learning (ML) approaches, certain phenotypes may be confined to classify the affected population and to predict outcome. This study aimed to phenotype patients using unsupervised ML technique within the International Postgraduate Course Heart Failure Registry for patients hospitalized with COVID-19 and Cardiovascular disease and/or RF (PCHF-COVICAV). MATERIAL AND METHODS Patients from the eight centres with follow-up data available from the PCHF-COVICAV registry were included in this ML analysis (K-medoids algorithm). RESULTS Out of 617 patients included into the prospective part of the registry, 458 [median age: 76 (IQR:65-84) years, 55% male] were analyzed and 46 baseline variables, including demographics, clinical status, comorbidities and biochemical characteristics were incorporated into the ML. Three clusters were extracted by this ML method. Cluster 1 (n = 181) represents mainly women with the least number of overall comorbidities and cardiovascular RF. Cluster 2 (n = 227) is characterized mainly by men with non-CV conditions and less severe symptoms of infection. Cluster 3 (n=50) mainly represents men with the highest prevalence of cardiac comorbidities and RF, more extensive inflammation and organ dysfunction with the highest 6-month all-cause mortality risk. CONCLUSIONS The ML process has identified three important clinical clusters from hospitalized COVID-19 CV and/or RF patients. The cluster of males with severe CV disease, particularly HF, and multiple RF presenting with increased inflammation had a particularly poor outcome.
Collapse
Affiliation(s)
- Mateusz Sokolski
- Wroclaw Medical University, Faculty of Medicine, Institute of Heart Diseases, Wroclaw, Poland and Intitute of Heart Diseases, University Hospital, Wroclaw, Poland.
| | - Sander Trenson
- Department of Cardiology, Sint-Jan Hospital Bruges, Bruges, Belgium
| | - Konrad Reszka
- Wroclaw Medical University, Faculty of Medicine, Institute of Heart Diseases, Wroclaw, Poland and Intitute of Heart Diseases, University Hospital, Wroclaw, Poland
| | - Szymon Urban
- Wroclaw Medical University, Faculty of Medicine, Institute of Heart Diseases, Wroclaw, Poland and Intitute of Heart Diseases, University Hospital, Wroclaw, Poland
| | - Justyna M Sokolska
- Wroclaw Medical University, Faculty of Medicine, Institute of Heart Diseases, Wroclaw, Poland and Intitute of Heart Diseases, University Hospital, Wroclaw, Poland
| | - Tor Biering-Sørensen
- Department of Cardiology, Copenhagen University Hospital - Herlev & Gentofte, Copenhagen, Denmark
| | - Mats C Højbjerg Lassen
- Department of Cardiology, Copenhagen University Hospital - Herlev & Gentofte, Copenhagen, Denmark
| | | | - Carmen Basic
- Department of Molecular and Clinical Medicine, Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Zacharias Mandalenakis
- Department of Molecular and Clinical Medicine, Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | | | - Peter P Rainer
- Division of Cardiology, Medical University of Graz, Austria
| | - Markus Wallner
- Division of Cardiology, Medical University of Graz, Austria
- Cardiovascular Research Center, Lewis Katz School of Medicine, Temple University, Philadelphia, United States
- Center for Biomarker Research in Medicine, CBmed GmbH, Graz, Austria
| | - Valentina A Rossi
- Department of Cardiology, University Heart Center, University Hospital, Zurich, Switzerland
| | - Marzia Lilliu
- Division of Infectious Diseases, Azienda ULSS 9, M. Magalini Hospital, Villafranca di Verona, Verona, Italy
| | - Goran Loncar
- Institute for Cardiovascular Diseases Dedinje, Faculty of Medicine, University of Belgrade, Belgrade, Serbia
| | - Huseyin A Cakmak
- Department of Cardiology, Mustafakemalpasa State Hospital, Bursa, Türkiye
| | - Frank Ruschitzka
- Department of Cardiology, University Heart Center, University Hospital, Zurich, Switzerland
| | - Andreas J Flammer
- Department of Cardiology, University Heart Center, University Hospital, Zurich, Switzerland
| |
Collapse
|
6
|
Chen M, Wu Y, Wingerd B, Liu Z, Xu J, Thakkar S, Pedersen TJ, Donnelly T, Mann N, Tong W, Wolfinger RD, Bao W. Automatic text classification of drug-induced liver injury using document-term matrix and XGBoost. Front Artif Intell 2024; 7:1401810. [PMID: 38887604 PMCID: PMC11181907 DOI: 10.3389/frai.2024.1401810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Accepted: 05/17/2024] [Indexed: 06/20/2024] Open
Abstract
Introduction Regulatory agencies generate a vast amount of textual data in the review process. For example, drug labeling serves as a valuable resource for regulatory agencies, such as U.S. Food and Drug Administration (FDA) and Europe Medical Agency (EMA), to communicate drug safety and effectiveness information to healthcare professionals and patients. Drug labeling also serves as a resource for pharmacovigilance and drug safety research. Automated text classification would significantly improve the analysis of drug labeling documents and conserve reviewer resources. Methods We utilized artificial intelligence in this study to classify drug-induced liver injury (DILI)-related content from drug labeling documents based on FDA's DILIrank dataset. We employed text mining and XGBoost models and utilized the Preferred Terms of Medical queries for adverse event standards to simplify the elimination of common words and phrases while retaining medical standard terms for FDA and EMA drug label datasets. Then, we constructed a document term matrix using weights computed by Term Frequency-Inverse Document Frequency (TF-IDF) for each included word/term/token. Results The automatic text classification model exhibited robust performance in predicting DILI, achieving cross-validation AUC scores exceeding 0.90 for both drug labels from FDA and EMA and literature abstracts from the Critical Assessment of Massive Data Analysis (CAMDA). Discussion Moreover, the text mining and XGBoost functions demonstrated in this study can be applied to other text processing and classification tasks.
Collapse
Affiliation(s)
- Minjun Chen
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Yue Wu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Byron Wingerd
- JMP Statistical Discovery LLC, Cary, NC, United States
| | - Zhichao Liu
- Boehringer Ingelheim Pharmaceuticals, Inc., Ridgefield, CT, United States
| | - Joshua Xu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Shraddha Thakkar
- Department of Pharmaceutical Sciences, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | | | - Tom Donnelly
- JMP Statistical Discovery LLC, Cary, NC, United States
| | - Nicholas Mann
- Department of Mathematics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | | | - Wenjun Bao
- JMP Statistical Discovery LLC, Cary, NC, United States
| |
Collapse
|
7
|
Shayegan MJ. A brief review and scientometric analysis on ensemble learning methods for handling COVID-19. Heliyon 2024; 10:e26694. [PMID: 38420425 PMCID: PMC10901105 DOI: 10.1016/j.heliyon.2024.e26694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 02/07/2024] [Accepted: 02/19/2024] [Indexed: 03/02/2024] Open
Abstract
Numerous efforts and research have been conducted worldwide to combat the coronavirus disease 2019 (COVID-19) pandemic. In this regard, some researchers have focused on deep and machine-learning approaches to discover more about this disease. There have been many articles on using ensemble learning methods for COVID-19 detection. Still, there seems to be no scientometric analysis or a brief review of these researches. Hence, a combined method of scientometric analysis and brief review was used to study the published articles that employed an ensemble learning approach to detect COVID-19. This research used both methods to overcome their limitations, leading to enhanced and reliable outcomes. The related articles were retrieved from the Scopus database. Then a two-step procedure was employed. A concise review of the collected articles was conducted. Then they underwent scientometric and bibliometric analyses. The findings revealed that convolutional neural network (CNN) is the mostly employed algorithm, while support vector machine (SVM), random forest, Resnet, DenseNet, and visual geometry group (VGG) were also frequently used. Additionally, China has had a significant presence in the numerous top-ranking categories of this field of research. Both study phases yielded valuable results and rankings.
Collapse
|
8
|
Valdez ZI, Díaz LA, Vizcardo Cornejo M, Ravelo-García AG. A Permutation Entropy analysis to determine significant daily intervals to improve risk stratification tasks from COVID patients. Biomed Phys Eng Express 2024; 10:025010. [PMID: 38198732 DOI: 10.1088/2057-1976/ad1d0a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 01/10/2024] [Indexed: 01/12/2024]
Abstract
SARS-CoV-2 infection has a wide range of clinical manifestations making its diagnosis difficult, which is an important problem to solve. We evaluated heart rate data extracted from the Stanford University database. The data set considers heart rate and step records of 118 patients, where 90 correspond to healthy individuals and 28 patients with COVID. Each daily record was divided into 5-minute segments, providing 288 data per patient. The date of symptom onset was considered as a reference point to extract subsets of data whose variability was considerable, such as 30 days before the date and 30 days after it. Each of the 60 segments of 288 data per patient was treated using Permutation Entropy, Approximate Entropy, Spectral Entropy and Singular Value Decomposition Entropy. The average of the data from each group was used to construct the circadian profiles which were analyzed using the Mann-Whitney-Wilcoxon test, determining the most relevant 5-minute segments, whose p-value was less than 0.05. In this way, the Spectral Entropy was discarded as it did not show any significantly different segment. The efficiency of the method was reflected in the performance of a logistic model for binary classification proposed in this work, which reflected an accuracy of 94.12% in the PE case, 88% in the ApEn case and 94% in the SVDE case. The proposed analysis turns out to be highly efficient when detecting significant segments that allow improving the classification tasks carried out by Machine Learning models, which provides a basis for the study of statistics such as entropy to delimit databases and improve the performance of classifier models.
Collapse
Affiliation(s)
- Zayd Isaac Valdez
- Escuela de Física, Universidad Nacional de San Agustin de Arequipa, Av. Independencia s/n, Arequipa, 04002, Arequipa, Peru
| | - Luz Alexandra Díaz
- Escuela de Física, Universidad Nacional de San Agustin de Arequipa, Av. Independencia s/n, Arequipa, 04002, Arequipa, Peru
| | - Miguel Vizcardo Cornejo
- Escuela de Física, Universidad Nacional de San Agustin de Arequipa, Av. Independencia s/n, Arequipa, 04002, Arequipa, Peru
| | - Antonio G Ravelo-García
- Departmento de Señales y Comunicaciones, Universidad Las Palmas de Gran Canaria, Campus Tafira, Las Palmas de Gran Canaria, 35017, Las Palmas, Spain
| |
Collapse
|
9
|
Ahmad Amshi H, Prasad R, Sharma BK, Yusuf SI, Sani Z. How can machine learning predict cholera: insights from experiments and design science for action research. JOURNAL OF WATER AND HEALTH 2024; 22:21-35. [PMID: 38295070 PMCID: wh_2023_026 DOI: 10.2166/wh.2023.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2024]
Abstract
Cholera is a leading cause of mortality in Nigeria. The two most significant predictors of cholera are a lack of access to clean water and poor sanitary conditions. Other factors such as natural disasters, illiteracy, and internal conflicts that drive people to seek sanctuary in refugee camps may contribute to the spread of cholera in Nigeria. The aim of this research is to develop a cholera outbreak risk prediction (CORP) model using machine learning tools and data science. In this study, we developed a CORP model using design science perspectives and machine learning to detect cholera outbreaks in Nigeria. Nonnegative matrix factorization (NMF) was used for dimensionality reduction, and synthetic minority oversampling technique (SMOTE) was used for data balancing. Outliers were detected using density-based spatial clustering of applications with noise (DBSCAN) were removed improving the overall performance of the model, and the extreme-gradient boost algorithm was used for prediction. The findings revealed that the CORP model outcomes resulted in the best accuracy of 99.62%, Matthews's correlation coefficient of 0.976, and area under the curve of 99.2%, which were improved compared with the previous findings. The developed model can be helpful to healthcare providers in predicting possible cholera outbreaks.
Collapse
Affiliation(s)
- Hauwa Ahmad Amshi
- African University of Science and Technology, Abuja, Nigeria E-mail:
| | - Rajesh Prasad
- Department of Computer Science and Engineering, Ajay Kumar Garg Engineering College, Ghaziabad, India
| | | | | | | |
Collapse
|
10
|
Adedinsewo D, Dugan J, Johnson PW, Douglass EJ, Morales-Lara AC, Parkulo MA, Ting HH, Cooper LT, Scott LR, Valverde AM, Padmanabhan D, Peters NS, Bachtiger P, Kelshiker M, Fernandez-Aviles F, Atienza F, Glotzer TV, Lahiri MK, Dominic P, Attia ZI, Kapa S, Noseworthy PA, Pereira NL, Cruz J, Berbari EF, Carter RE, Friedman PA. RApid Throughput Screening for Asymptomatic COVID-19 Infection With an Electrocardiogram: A Prospective Observational Study. MAYO CLINIC PROCEEDINGS. DIGITAL HEALTH 2023; 1:455-466. [PMID: 40206301 PMCID: PMC11975729 DOI: 10.1016/j.mcpdig.2023.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/11/2025]
Abstract
Objective To evaluate the ability of a neural network to identify severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection using point-of-care electrocardiography obtained with a portable device. Patient and Methods We enrolled 2827 patients in a prospective observational study, from December 10, 2020, through June 4, 2021, to determine the accuracy of a point-of-care, handheld, smartphone-compatible, artificial intelligence-enabled electrocardiography (ECG) (POC AI-ECG) in detecting asymptomatic SARS-CoV-2 infection using a modified version of an existing deep learning model framework trained on 12-lead ECG data. Results Study participants were 48% (n=1067) female, 79% (n=1749) White, and 7% (n=153) endorsed previous COVID-19 infection. We found the POC AI-ECG algorithm was ineffective for detecting asymptomatic SARS-CoV-2 infection (area under curve, 0.56; 95% CI, 0.46-0.66), failing to adequately discriminate between ECGs performed among participants who tested positive compared to those who tested negative. Conclusion Contrary to the prior 12-lead ECG study, a POC AI-ECG failed to reliably identify asymptomatic SARS-CoV-2 infection among adults. This study underscores the importance of prospective testing, assuring similar populations, and using similar signals or data when developing AI-ECG tools. Trial registration clinicaltrials.gov Identifier: NCT04725097.
Collapse
Affiliation(s)
| | - Jennifer Dugan
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Patrick W. Johnson
- Department of Quantitative Health Sciences, Mayo Clinic, Jacksonville, FL
| | - Erika J. Douglass
- Department of Cardiovascular Medicine, Mayo Clinic, Jacksonville, FL
| | | | - Mark A. Parkulo
- Department of Community Internal Medicine, Mayo Clinic, Jacksonville, FL
| | - Henry H. Ting
- Department of Cardiovascular Medicine, Mayo Clinic, Jacksonville, FL
| | - Leslie T. Cooper
- Department of Cardiovascular Medicine, Mayo Clinic, Jacksonville, FL
| | - Luis R. Scott
- Department of Cardiovascular Medicine, Mayo Clinic, Phoenix, AZ
| | | | - Deepak Padmanabhan
- Sri Jayadeva Institute of Cardiovascular Sciences and Research, Bangalore, India
| | - Nicholas S. Peters
- National Heart and Lung Institute and Centre for Cardiac Engineering, Imperial College London, and Imperial College Healthcare NHS Trust, United Kingdom
| | - Patrik Bachtiger
- National Heart and Lung Institute and Centre for Cardiac Engineering, Imperial College London, and Imperial College Healthcare NHS Trust, United Kingdom
| | - Mihir Kelshiker
- National Heart and Lung Institute and Centre for Cardiac Engineering, Imperial College London, and Imperial College Healthcare NHS Trust, United Kingdom
| | | | - Felipe Atienza
- Hospital General Universitario Gregorio Marañon, Madrid, Spain
| | | | - Marc K. Lahiri
- Heart and Vascular Institute, Henry Ford Hospital, Detroit, MI
| | - Paari Dominic
- Louisiana State University Health Sciences Center, Shreveport, LA
| | - Zachi I. Attia
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Suraj Kapa
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | | | | | - Jessica Cruz
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Elie F. Berbari
- Division of Public Health, Infectious Diseases and Occupational Medicine, Mayo Clinic, Rochester, MN
| | - Rickey E. Carter
- Department of Quantitative Health Sciences, Mayo Clinic, Jacksonville, FL
| | - Paul A. Friedman
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| |
Collapse
|
11
|
Tang SY, Chen TH, Kuo KL, Huang JN, Kuo CT, Chu YC. Using artificial intelligence algorithms to predict the overall survival of hemodialysis patients during the COVID-19 pandemic: A prospective cohort study. J Chin Med Assoc 2023; 86:1020-1027. [PMID: 37713313 DOI: 10.1097/jcma.0000000000000994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/17/2023] Open
Abstract
BACKGROUND Hemodialysis (HD) patients are a vulnerable population at high risk for severe complications from COVID-19. The impact of partial COVID-19 vaccination on the survival of HD patients remains uncertain. This prospective cohort study was designed to use artificial intelligence algorithms to predict the survival impact of partial COVID-19 vaccination in HD patients. METHODS A cohort of 433 HD patients was used to develop machine-learning models based on a subset of clinical features assessed between July 1, 2021, and April 29, 2022. The patient cohort was randomly split into training (80%) and testing (20%) sets for model development and evaluation. Machine-learning models, including categorical boosting (CatBoost), light gradient boosting machines (LightGBM), RandomForest, and extreme gradient boosting models (XGBoost), were applied to evaluate their discriminative performance using the patient cohorts. RESULTS Among these models, LightGBM achieved the highest F1 score of 0.95, followed by CatBoost, RandomForest, and XGBoost, with area under the receiver operating characteristic curve values of 0.94 on the testing dataset. The SHapley Additive explanation summary plot derived from the XGBoost model indicated that key features such as age, albumin, and vaccination details had a significant impact on survival. Furthermore, the fully vaccinated group exhibited higher levels of anti-spike (S) receptor-binding domain antibodies. CONCLUSION This prospective cohort study involved using artificial intelligence algorithms to predict overall survival in HD patients during the COVID-19 pandemic. These predictive models assisted in identifying high-risk individuals and guiding vaccination strategies for HD patients, ultimately improving overall prognosis. Further research is warranted to validate and refine these predictive models in larger and more diverse populations of HD patients.
Collapse
Affiliation(s)
- Shao-Yu Tang
- Division of Nephrology, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Taipei, Taiwan, ROC
| | - Tz-Heng Chen
- Division of Nephrology, Department of Medicine, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
- School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan, ROC
- Institute of Emergency and Critical Care Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan, ROC
| | - Ko-Lin Kuo
- Division of Nephrology, Taipei Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Taipei, Taiwan, ROC
| | - Jue-Ni Huang
- Information Management Office, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
| | - Chen-Tsung Kuo
- Information Management Office, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
- Big Data Center, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
- Department of Information Management, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan, ROC
| | - Yuan-Chia Chu
- Information Management Office, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
- Big Data Center, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
- Department of Information Management, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan, ROC
| |
Collapse
|
12
|
D'Auria D, Russo R, Fedele A, Addabbo F, Calvanese D. An intelligent telemonitoring application for coronavirus patients: reCOVeryaID. Front Big Data 2023; 6:1205766. [PMID: 37790086 PMCID: PMC10543687 DOI: 10.3389/fdata.2023.1205766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 08/29/2023] [Indexed: 10/05/2023] Open
Abstract
The COVID-19 emergency underscored the importance of resolving crucial issues of territorial health monitoring, such as overloaded phone lines, doctors exposed to infection, chronically ill patients unable to access hospitals, etc. In fact, it often happened that people would call doctors/hospitals just out of anxiety, not realizing that they were clogging up communications, thus causing problems for those who needed them most; such people, often elderly, have often felt lonely and abandoned by the health care system because of poor telemedicine. In addition, doctors were unable to follow up on the most serious cases or make sure that others did not worsen. Thus, uring the first pandemic wave we had the idea to design a system that could help people alleviate their fears and be constantly monitored by doctors both in hospitals and at home; consequently, we developed reCOVeryaID, a telemonitoring application for coronavirus patients. It is an autonomous application supported by a knowledge base that can react promptly and inform medical doctors if dangerous trends in the patient's short- and long-term vital signs are detected. In this paper, we also validate the knowledge-base rules in real-world settings by testing them on data from real patients infected with COVID-19.
Collapse
Affiliation(s)
- Daniela D'Auria
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
| | | | | | | | - Diego Calvanese
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
- Department of Computing Science, Umeå University, Umeå, Sweden
| |
Collapse
|
13
|
Kablan R, Miller HA, Suliman S, Frieboes HB. Evaluation of stacked ensemble model performance to predict clinical outcomes: A COVID-19 study. Int J Med Inform 2023; 175:105090. [PMID: 37172507 PMCID: PMC10165871 DOI: 10.1016/j.ijmedinf.2023.105090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 04/17/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023]
Abstract
BACKGROUND The application of machine learning (ML) to analyze clinical data with the goal to predict patient outcomes has garnered increasing attention. Ensemble learning has been used in conjunction with ML to improve predictive performance. Although stacked generalization (stacking), a type of heterogeneous ensemble of ML models, has emerged in clinical data analysis, it remains unclear how to define the best model combinations for strong predictive performance. This study develops a methodology to evaluate the performance of "base" learner models and their optimized combination using "meta" learner models in stacked ensembles to accurately assess performance in the context of clinical outcomes. METHODS De-identified COVID-19 data was obtained from the University of Louisville Hospital, where a retrospective chart review was performed from March 2020 to November 2021. Three differently-sized subsets using features from the overall dataset were chosen to train and evaluate ensemble classification performance. The number of base learners chosen from several algorithm families coupled with a complementary meta learner was varied from a minimum of 2 to a maximum of 8. Predictive performance of these combinations was evaluated in terms of mortality and severe cardiac event outcomes using area-under-the-receiver-operating-characteristic (AUROC), F1, balanced accuracy, and kappa. RESULTS The results highlight the potential to accurately predict clinical outcomes, such as severe cardiac events with COVID-19, from routinely acquired in-hospital patient data. Meta learners Generalized Linear Model (GLM), Multi-Layer Perceptron (MLP), and Partial Least Squares (PLS) had the highest AUROC for both outcomes, while K-Nearest Neighbors (KNN) had the lowest. Performance trended lower in the training set as the number of features increased, and exhibited less variance in both training and validation across all feature subsets as the number of base learners increased. CONCLUSION This study offers a methodology to robustly evaluate ensemble ML performance when analyzing clinical data.
Collapse
Affiliation(s)
- Rianne Kablan
- Department of Bioengineering, University of Louisville, Louisville, KY, USA
| | - Hunter A Miller
- Department of Bioengineering, University of Louisville, Louisville, KY, USA
| | | | - Hermann B Frieboes
- Department of Bioengineering, University of Louisville, Louisville, KY, USA; James Graham Brown Cancer Center, University of Louisville, Louisville, KY, USA; Center for Predictive Medicine, University of Louisville, Louisville, KY, USA.
| |
Collapse
|
14
|
Mondal R, Do MD, Ahmed NU, Walke D, Micheel D, Broneske D, Saake G, Heyer R. Decision tree learning in Neo4j on homogeneous and unconnected graph nodes from biological and clinical datasets. BMC Med Inform Decis Mak 2023; 22:347. [PMID: 36879243 PMCID: PMC9988195 DOI: 10.1186/s12911-023-02112-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 01/13/2023] [Indexed: 03/08/2023] Open
Abstract
BACKGROUND Graph databases enable efficient storage of heterogeneous, highly-interlinked data, such as clinical data. Subsequently, researchers can extract relevant features from these datasets and apply machine learning for diagnosis, biomarker discovery, or understanding pathogenesis. METHODS To facilitate machine learning and save time for extracting data from the graph database, we developed and optimized Decision Tree Plug-in (DTP) containing 24 procedures to generate and evaluate decision trees directly in the graph database Neo4j on homogeneous and unconnected nodes. RESULTS Creation of the decision tree for three clinical datasets directly in the graph database from the nodes required between 0.059 and 0.099 s, while calculating the decision tree with the same algorithm in Java from CSV files took 0.085-0.112 s. Furthermore, our approach was faster than the standard decision tree implementations in R (0.62 s) and equal to Python (0.08 s), also using CSV files as input for small datasets. In addition, we have explored the strengths of DTP by evaluating a large dataset (approx. 250,000 instances) to predict patients with diabetes and compared the performance against algorithms generated by state-of-the-art packages in R and Python. By doing so, we have been able to show competitive results on the performance of Neo4j, in terms of quality of predictions as well as time efficiency. Furthermore, we could show that high body-mass index and high blood pressure are the main risk factors for diabetes. CONCLUSION Overall, our work shows that integrating machine learning into graph databases saves time for additional processes as well as external memory, and could be applied to a variety of use cases, including clinical applications. This provides user with the advantages of high scalability, visualization and complex querying.
Collapse
Affiliation(s)
- Rahul Mondal
- Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany.
| | - Minh Dung Do
- Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Nasim Uddin Ahmed
- Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Daniel Walke
- Faculty of Process and Systems Engineering, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Daniel Micheel
- German Centre for Higher Education Research and Science Studies, Lange Laube 12, 30159, Hannover, Germany
| | - David Broneske
- German Centre for Higher Education Research and Science Studies, Lange Laube 12, 30159, Hannover, Germany
- Research Group Databases and Software Engineering, Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Gunter Saake
- Research Group Databases and Software Engineering, Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| | - Robert Heyer
- Research Group Databases and Software Engineering, Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
| |
Collapse
|
15
|
Altantawy DA, Kishk SS. Equilibrium-based COVID-19 diagnosis from routine blood tests: A sparse deep convolutional model. EXPERT SYSTEMS WITH APPLICATIONS 2023; 213:118935. [PMID: 36210961 PMCID: PMC9527205 DOI: 10.1016/j.eswa.2022.118935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 06/21/2022] [Accepted: 09/27/2022] [Indexed: 06/16/2023]
Abstract
SARS-CoV2 (COVID-19) is the virus that causes the pandemic that has severely impacted human society with a massive death toll worldwide. Hence, there is a persistent need for fast and reliable automatic tools to help health teams in making clinical decisions. Predictive models could potentially ease the strain on healthcare systems by early and reliable screening of COVID-19 patients which helps to combat the spread of the disease. Recent studies have reported some key advantages of employing routine blood tests for initial screening of COVID-19 patients. Thus, in this paper, we propose a novel COVID-19 prediction model based on routine blood tests. In this model, we depend on exploiting the real dependency among the employed feature pool by a sparsification procedure. In this sparse domain, a hybrid feature selection mechanism is proposed. This mechanism fuses the selected features from two perspectives, the first is Pearson correlation and the second is a new Minkowski-based equilibrium optimizer (MEO). Then, the selected features are fed into a new 1D Convolutional Neural Network (1DCNN) for a final diagnosis decision. The proposed prediction model is tested with a new public dataset from San Raphael Hospital, Milan, Italy, i.e., OSR dataset which has two sub-datasets. According to the experimental results, the proposed model outperforms the state-of-the-art techniques with an average testing accuracy of 98.5% while we employ only less than half the size of the feature pool, i.e., we need only less than half the given blood tests in the employed dataset to get a final diagnosis decision.
Collapse
Affiliation(s)
- Doaa A Altantawy
- Electronics and Communications Engineering Department, Faculty of Engineering, Mansoura University, 60 El-Gomhoria Street, Mansoura, Egypt
| | - Sherif S Kishk
- Electronics and Communications Engineering Department, Faculty of Engineering, Mansoura University, 60 El-Gomhoria Street, Mansoura, Egypt
| |
Collapse
|
16
|
Paul SG, Saha A, Biswas AA, Zulfiker MS, Arefin MS, Rahman MM, Reza AW. Combating Covid-19 using machine learning and deep learning: Applications, challenges, and future perspectives. ARRAY 2023; 17:100271. [PMID: 36530931 PMCID: PMC9737520 DOI: 10.1016/j.array.2022.100271] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 12/05/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
COVID-19, a worldwide pandemic that has affected many people and thousands of individuals have died due to COVID-19, during the last two years. Due to the benefits of Artificial Intelligence (AI) in X-ray image interpretation, sound analysis, diagnosis, patient monitoring, and CT image identification, it has been further researched in the area of medical science during the period of COVID-19. This study has assessed the performance and investigated different machine learning (ML), deep learning (DL), and combinations of various ML, DL, and AI approaches that have been employed in recent studies with diverse data formats to combat the problems that have arisen due to the COVID-19 pandemic. Finally, this study shows the comparison among the stand-alone ML and DL-based research works regarding the COVID-19 issues with the combinations of ML, DL, and AI-based research works. After in-depth analysis and comparison, this study responds to the proposed research questions and presents the future research directions in this context. This review work will guide different research groups to develop viable applications based on ML, DL, and AI models, and will also guide healthcare institutes, researchers, and governments by showing them how these techniques can ease the process of tackling the COVID-19.
Collapse
Affiliation(s)
- Showmick Guha Paul
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Arpa Saha
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Al Amin Biswas
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Corresponding author
| | - Md. Sabab Zulfiker
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Mohammad Shamsul Arefin
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh,Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong, Bangladesh
| | - Md. Mahfujur Rahman
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Ahmed Wasif Reza
- Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
| |
Collapse
|
17
|
Zhong F, Zhao W, Wang L, Shen Y. Clinical application of Mycobacterium RT-PCR assay using various specimens for the rapid detection of lymph node tuberculosis: A diagnostic accuracy study. Medicine (Baltimore) 2023; 102:e33065. [PMID: 36827006 PMCID: PMC11309714 DOI: 10.1097/md.0000000000033065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/26/2023] [Accepted: 02/01/2023] [Indexed: 02/25/2023] Open
Abstract
To evaluate the diagnostic accuracy of the Capital Bio Mycobacterium real-time polymerase chain reaction assay Capital Bio assay for lymph node (LN) tuberculosis (LNTB), and to further compare the effect of different types of LN specimens on the detection capability of the test. We retrospectively analyzed the medical records of LNTB patients who met the inclusion criteria. The sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve of Capital Bio assay were calculated to evaluate its diagnostic accuracy compared with the final clinical diagnosis as reference standard. Three hundred sixty-four patients were included in the study. The overall sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve of the Capital Bio assay for LNTB were 74.4%, 100.0%, 100.0%, 34.9%, and 0.87, respectively. For the pus specimens, these values for Capital Bio assay were 93.2%, 100.0%, 100.0%, 27.3%, 0.97, respectively. For the core needle biopsy specimens, these values were 65.9%, 100.0%, 100.0%, 33.3%, and 0.83, respectively. For the fine-needle aspiration specimens, these values were 60.0%, 100.0%, 100.0%, 53.9%, and 0.80, respectively. For the tissue, these values were 59.3%, 100.0%, 100.0%, 33.3%, 0.80, respectively. The Capital Bio assay had good effective for the diagnosis of LNTB. Compared to LN fine-needle aspiration and core needle biopsy specimens and tissue specimens, pus specimens were more suitable for molecular testing and had the best diagnostic efficacy.
Collapse
Affiliation(s)
- Fangming Zhong
- Zhejiang Tuberculosis Diagnosis and Treatment Center, Affiliated Hangzhou Chest Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Wuchen Zhao
- Zhejiang Tuberculosis Diagnosis and Treatment Center, Affiliated Hangzhou Chest Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Linhua Wang
- Department of Hospital Infection, Affiliated Hangzhou Chest Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Yi Shen
- Operation Center, Affiliated Hangzhou Chest Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| |
Collapse
|
18
|
Using machine learning on clinical data to identify unexpected patterns in groups of COVID-19 patients. Sci Rep 2023; 13:2236. [PMID: 36755135 PMCID: PMC9906583 DOI: 10.1038/s41598-022-26294-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 12/13/2022] [Indexed: 02/10/2023] Open
Abstract
As clinicians are faced with a deluge of clinical data, data science can play an important role in highlighting key features driving patient outcomes, aiding in the development of new clinical hypotheses. Insight derived from machine learning can serve as a clinical support tool by connecting care providers with reliable results from big data analysis that identify previously undetected clinical patterns. In this work, we show an example of collaboration between clinicians and data scientists during the COVID-19 pandemic, identifying sub-groups of COVID-19 patients with unanticipated outcomes or who are high-risk for severe disease or death. We apply a random forest classifier model to predict adverse patient outcomes early in the disease course, and we connect our classification results to unsupervised clustering of patient features that may underpin patient risk. The paradigm for using data science for hypothesis generation and clinical decision support, as well as our triaged classification approach and unsupervised clustering methods to determine patient cohorts, are applicable to driving rapid hypothesis generation and iteration in a variety of clinical challenges, including future public health crises.
Collapse
|
19
|
Carvalho RMS, Oliveira D, Pesquita C. Knowledge Graph Embeddings for ICU readmission prediction. BMC Med Inform Decis Mak 2023; 23:12. [PMID: 36658526 PMCID: PMC9850812 DOI: 10.1186/s12911-022-02070-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 11/28/2022] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Intensive Care Unit (ICU) readmissions represent both a health risk for patients,with increased mortality rates and overall health deterioration, and a financial burden for healthcare facilities. As healthcare became more data-driven with the introduction of Electronic Health Records (EHR), machine learning methods have been applied to predict ICU readmission risk. However, these methods disregard the meaning and relationships of data objects and work blindly over clinical data without taking into account scientific knowledge and context. Ontologies and Knowledge Graphs can help bridge this gap between data and scientific context, as they are computational artefacts that represent the entities of a domain and their relationships to each other in a formalized way. METHODS AND RESULTS We have developed an approach that enriches EHR data with semantic annotations to ontologies to build a Knowledge Graph. A patient's ICU stay is represented by Knowledge Graph embeddings in a contextualized manner, which are used by machine learning models to predict 30-days ICU readmissions. This approach is based on several contributions: (1) an enrichment of the MIMIC-III dataset with patient-oriented annotations to various biomedical ontologies; (2) a Knowledge Graph that defines patient data with biomedical ontologies; (3) a predictive model of ICU readmission risk that uses Knowledge Graph embeddings; (4) a variant of the predictive model that targets different time points during an ICU stay. Our predictive approaches outperformed both a baseline and state-of-the-art works achieving a mean Area Under the Receiver Operating Characteristic Curve of 0.827 and an Area Under the Precision-Recall Curve of 0.691. The application of this novel approach to help clinicians decide whether a patient can be discharged has the potential to prevent the readmission of [Formula: see text] of Intensive Care Unit patients, without unnecessarily prolonging the stay of those who would not require it. CONCLUSION The coupling of semantic annotation and Knowledge Graph embeddings affords two clear advantages: they consider scientific context and they are able to build representations of EHR information of different types in a common format. This work demonstrates the potential for impact that integrating ontologies and Knowledge Graphs into clinical machine learning applications can have.
Collapse
Affiliation(s)
- Ricardo M. S. Carvalho
- grid.9983.b0000 0001 2181 4263LASIGE, Faculty of Sciences, University of Lisbon, Lisbon, Portugal
| | - Daniela Oliveira
- grid.9983.b0000 0001 2181 4263LASIGE, Faculty of Sciences, University of Lisbon, Lisbon, Portugal
| | - Catia Pesquita
- grid.9983.b0000 0001 2181 4263LASIGE, Faculty of Sciences, University of Lisbon, Lisbon, Portugal
| |
Collapse
|
20
|
Jeyananthan P. SARS-CoV-2 Diagnosis Using Transcriptome Data: A Machine Learning Approach. SN COMPUTER SCIENCE 2023; 4:218. [PMID: 36844504 PMCID: PMC9936926 DOI: 10.1007/s42979-023-01703-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 01/24/2023] [Indexed: 05/02/2023]
Abstract
SARS-CoV-2 pandemic is the big issue of the whole world right now. The health community is struggling to rescue the public and countries from this spread, which revives time to time with different waves. Even the vaccination seems to be not prevents this spread. Accurate identification of infected people on time is essential these days to control the spread. So far, Polymerase chain reaction (PCR) and rapid antigen tests are widely used in this identification, accepting their own drawbacks. False negative cases are the menaces in this scenario. To avoid these problems, this study uses machine learning techniques to build a classification model with higher accuracy to filter the COVID-19 cases from the non-COVID individuals. Transcriptome data of the SARS-CoV-2 patients along with the control are used in this stratification using three different feature selection algorithms and seven classification models. Differently expressed genes also studied between these two groups of people and used in this classification. Results shows that mutual information (or DEGs) along with naïve Bayes (or SVM) gives the best accuracy (0.98 ± 0.04) among these methods. Supplementary Information The online version contains supplementary material available at 10.1007/s42979-023-01703-6.
Collapse
|
21
|
Ershadi MM, Rise ZR. Fusing clinical and image data for detecting the severity level of hospitalized symptomatic COVID-19 patients using hierarchical model. RESEARCH ON BIOMEDICAL ENGINEERING 2023; 39:209-232. [PMCID: PMC9957693 DOI: 10.1007/s42600-023-00268-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 02/08/2023] [Indexed: 02/05/2024]
Abstract
Purpose Based on medical reports, it is hard to find levels of different hospitalized symptomatic COVID-19 patients according to their features in a short time. Besides, there are common and special features for COVID-19 patients at different levels based on physicians’ knowledge that make diagnosis difficult. For this purpose, a hierarchical model is proposed in this paper based on experts’ knowledge, fuzzy C-mean (FCM) clustering, and adaptive neuro-fuzzy inference system (ANFIS) classifier. Methods Experts considered a special set of features for different groups of COVID-19 patients to find their treatment plans. Accordingly, the structure of the proposed hierarchical model is designed based on experts’ knowledge. In the proposed model, we applied clustering methods to patients’ data to determine some clusters. Then, we learn classifiers for each cluster in a hierarchical model. Regarding different common and special features of patients, FCM is considered for the clustering method. Besides, ANFIS had better performances than other classification methods. Therefore, FCM and ANFIS were considered to design the proposed hierarchical model. FCM finds the membership degree of each patient’s data based on common and special features of different clusters to reinforce the ANFIS classifier. Next, ANFIS identifies the need of hospitalized symptomatic COVID-19 patients to ICU and to find whether or not they are in the end-stage (mortality target class). Two real datasets about COVID-19 patients are analyzed in this paper using the proposed model. One of these datasets had only clinical features and another dataset had both clinical and image features. Therefore, some appropriate features are extracted using some image processing and deep learning methods. Results According to the results and statistical test, the proposed model has the best performance among other utilized classifiers. Its accuracies based on clinical features of the first and second datasets are 92% and 90% to find the ICU target class. Extracted features of image data increase the accuracy by 94%. Conclusion The accuracy of this model is even better for detecting the mortality target class among different classifiers in this paper and the literature review. Besides, this model is compatible with utilized datasets about COVID-19 patients based on clinical data and both clinical and image data, as well. Highlights • A new hierarchical model is proposed using ANFIS classifiers and FCM clustering method in this paper. Its structure is designed based on experts’ knowledge and real medical process. FCM reinforces the ANFIS classification learning phase based on the features of COVID-19 patients. • Two real datasets about COVID-19 patients are studied in this paper. One of these datasets has both clinical and image data. Therefore, appropriate features are extracted based on its image data and considered with available meaningful clinical data. Different levels of hospitalized symptomatic COVID-19 patients are considered in this paper including the need of patients to ICU and whether or not they are in end-stage. • Well-known classification methods including case-based reasoning (CBR), decision tree, convolutional neural networks (CNN), K-nearest neighbors (KNN), learning vector quantization (LVQ), multi-layer perceptron (MLP), Naive Bayes (NB), radial basis function network (RBF), support vector machine (SVM), recurrent neural networks (RNN), fuzzy type-I inference system (FIS), and adaptive neuro-fuzzy inference system (ANFIS) are designed for these datasets and their results are analyzed for different random groups of the train and test data; • According to unbalanced utilized datasets, different performances of classifiers including accuracy, sensitivity, specificity, precision, F-score, and G-mean are compared to find the best classifier. ANFIS classifiers have the best results for both datasets. • To reduce the computational time, the effects of the Principal Component Analysis (PCA) feature reduction method are studied on the performances of the proposed model and classifiers. According to the results and statistical test, the proposed hierarchical model has the best performances among other utilized classifiers. Graphical Abstract Supplementary Information The online version contains supplementary material available at 10.1007/s42600-023-00268-w.
Collapse
Affiliation(s)
- Mohammad Mahdi Ershadi
- Department of Industrial Engineering and Management Systems, Amirkabir University of Technology, No. 350, Hafez Ave, Valiasr Square, Tehran, 1591634311 Iran
| | - Zeinab Rahimi Rise
- Department of Industrial Engineering and Management Systems, Amirkabir University of Technology, No. 350, Hafez Ave, Valiasr Square, Tehran, 1591634311 Iran
| |
Collapse
|
22
|
Marques JG, Guedes LA, da Costa Abreu MC. Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 20:136. [PMID: 36612458 PMCID: PMC9819042 DOI: 10.3390/ijerph20010136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/13/2022] [Accepted: 12/15/2022] [Indexed: 06/17/2023]
Abstract
Efficiently recognising severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) symptoms enables a quick and accurate diagnosis to be made, and helps in mitigating the spread of the coronavirus disease 2019. However, the emergence of new variants has caused constant changes in the symptoms associate with COVID-19. These constant changes directly impact the performance of machine-learning-based diagnose. In this context, considering the impact of these changes in symptoms over time is necessary for accurate diagnoses. Thus, in this study, we propose a machine-learning-based approach for diagnosing COVID-19 that considers the importance of time in model predictions. Our approach analyses the performance of XGBoost using two different time-based strategies for model training: month-to-month and accumulated strategies. The model was evaluated using known metrics: accuracy, precision, and recall. Furthermore, to explain the impact of feature changes on model prediction, feature importance was measured using the SHAP technique, an XAI technique. We obtained very interesting results: considering time when creating a COVID-19 diagnostic prediction model is advantageous.
Collapse
Affiliation(s)
- Julliana Gonçalves Marques
- Department of Informatics and Applied Mathematics, Federal University of Rio Grande do Norte, Natal 59078-970, Brazil
| | - Luiz Affonso Guedes
- Department of Computer Engineering and Automation, Federal University of Rio Grande do Norte, Natal 59078-970, Brazil
| | | |
Collapse
|
23
|
Martínez JA, Alonso-Bernáldez M, Martínez-Urbistondo D, Vargas-Nuñez JA, Ramírez de Molina A, Dávalos A, Ramos-Lopez O. Machine learning insights concerning inflammatory and liver-related risk comorbidities in non-communicable and viral diseases. World J Gastroenterol 2022; 28:6230-6248. [PMID: 36504554 PMCID: PMC9730439 DOI: 10.3748/wjg.v28.i44.6230] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 10/07/2022] [Accepted: 11/16/2022] [Indexed: 11/25/2022] Open
Abstract
The liver is a key organ involved in a wide range of functions, whose damage can lead to chronic liver disease (CLD). CLD accounts for more than two million deaths worldwide, becoming a social and economic burden for most countries. Among the different factors that can cause CLD, alcohol abuse, viruses, drug treatments, and unhealthy dietary patterns top the list. These conditions prompt and perpetuate an inflammatory environment and oxidative stress imbalance that favor the development of hepatic fibrogenesis. High stages of fibrosis can eventually lead to cirrhosis or hepatocellular carcinoma (HCC). Despite the advances achieved in this field, new approaches are needed for the prevention, diagnosis, treatment, and prognosis of CLD. In this context, the scientific com-munity is using machine learning (ML) algorithms to integrate and process vast amounts of data with unprecedented performance. ML techniques allow the integration of anthropometric, genetic, clinical, biochemical, dietary, lifestyle and omics data, giving new insights to tackle CLD and bringing personalized medicine a step closer. This review summarizes the investigations where ML techniques have been applied to study new approaches that could be used in inflammatory-related, hepatitis viruses-induced, and coronavirus disease 2019-induced liver damage and enlighten the factors involved in CLD development.
Collapse
Affiliation(s)
- J Alfredo Martínez
- Precision Nutrition and Cardiometabolic Health, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Marta Alonso-Bernáldez
- Precision Nutrition and Cardiometabolic Health, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | | | - Juan A Vargas-Nuñez
- Servicio de Medicina Interna, Hospital Universitario Puerta de Hierro Majadahonda, Madrid 28222, Majadahonda, Spain
| | - Ana Ramírez de Molina
- Molecular Oncology and Nutritional Genomics of Cancer, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Alberto Dávalos
- Laboratory of Epigenetics of Lipid Metabolism, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Omar Ramos-Lopez
- Medicine and Psychology School, Autonomous University of Baja California, Tijuana 22390, Baja California, Mexico
| |
Collapse
|
24
|
Reagan KL, Pires J, Quach N, Gilor C. Evaluation of a machine learning tool to screen for hypoadrenocorticism in dogs presenting to a teaching hospital. J Vet Intern Med 2022; 36:1942-1946. [PMID: 36259689 DOI: 10.1111/jvim.16566] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 09/23/2022] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Dogs with hypoadrenocorticism (HA) have clinical signs and clinicopathologic abnormalities that can be mistaken as other diseases. In dogs with a differential diagnosis of HA, a machine learning model (MLM) has been validated to discriminate between HA and other diseases. This MLM has not been evaluated as a screening tool for a broader group of dogs. HYPOTHESIS An MLM can accurately screen dogs for HA. ANIMALS Dogs (n = 1025) examined at a veterinary hospital. METHODS Dogs that presented to a tertiary referral hospital that had a CBC and serum chemistry panel were enrolled. A trained MLM was applied to clinicopathologic data and in dogs that were MLM positive for HA, diagnosis was confirmed by measurement of serum cortisol. RESULTS Twelve dogs were MLM positive for HA and had further cortisol testing. Five had HA confirmed (true positive), 4 of which were treated for mineralocorticoid and glucocorticoid deficiency, and 1 was treated for glucocorticoid deficiency alone. Three MLM positive dogs had baseline cortisol ≤2 μg/dL but were euthanized or administered glucocorticoid treatment without confirming the diagnosis with an ACTH-stimulation test (classified as "undetermined"), and in 4, HA was ruled out (false positives). The positive likelihood ratio of the MLM was 145 to 254. All dogs diagnosed with HA by attending clinicians tested positive by the MLM. CONCLUSIONS AND CLINICAL IMPORTANCE This MLM can robustly predict HA status when indiscriminately screening all dogs with blood work. In this group of dogs with a low prevalence of HA, the false positive rates were clinically acceptable.
Collapse
Affiliation(s)
- Krystle L Reagan
- Department of Medicine and Epidemiology, School of Veterinary Medicine, University of California-Davis, Davis, California, USA
| | - Jully Pires
- Veterinary Medical Teaching Hospital, School of Veterinary Medicine, University of California-Davis, Davis, California, USA
| | - Nina Quach
- Veterinary Medical Teaching Hospital, School of Veterinary Medicine, University of California-Davis, Davis, California, USA
| | - Chen Gilor
- Department of Small Animal Clinical sciences, College of Veterinary Medicine, University of Florida, Gainesville, Florida, USA
| |
Collapse
|
25
|
Sinwar D, Dhaka VS, Tesfaye BA, Raghuwanshi G, Kumar A, Maakar SK, Agrawal S. Artificial Intelligence and Deep Learning Assisted Rapid Diagnosis of COVID-19 from Chest Radiographical Images: A Survey. CONTRAST MEDIA & MOLECULAR IMAGING 2022; 2022:1306664. [PMID: 36304775 PMCID: PMC9581633 DOI: 10.1155/2022/1306664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 09/06/2022] [Accepted: 09/27/2022] [Indexed: 01/26/2023]
Abstract
Artificial Intelligence (AI) has been applied successfully in many real-life domains for solving complex problems. With the invention of Machine Learning (ML) paradigms, it becomes convenient for researchers to predict the outcome based on past data. Nowadays, ML is acting as the biggest weapon against the COVID-19 pandemic by detecting symptomatic cases at an early stage and warning people about its futuristic effects. It is observed that COVID-19 has blown out globally so much in a short period because of the shortage of testing facilities and delays in test reports. To address this challenge, AI can be effectively applied to produce fast as well as cost-effective solutions. Plenty of researchers come up with AI-based solutions for preliminary diagnosis using chest CT Images, respiratory sound analysis, voice analysis of symptomatic persons with asymptomatic ones, and so forth. Some AI-based applications claim good accuracy in predicting the chances of being COVID-19-positive. Within a short period, plenty of research work is published regarding the identification of COVID-19. This paper has carefully examined and presented a comprehensive survey of more than 110 papers that came from various reputed sources, that is, Springer, IEEE, Elsevier, MDPI, arXiv, and medRxiv. Most of the papers selected for this survey presented candid work to detect and classify COVID-19, using deep-learning-based models from chest X-Rays and CT scan images. We hope that this survey covers most of the work and provides insights to the research community in proposing efficient as well as accurate solutions for fighting the pandemic.
Collapse
Affiliation(s)
- Deepak Sinwar
- Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, India
| | - Vijaypal Singh Dhaka
- Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, India
| | - Biniyam Alemu Tesfaye
- Department of Computer Science, College of Informatics, Bule Hora University, Bule Hora, Ethiopia
| | - Ghanshyam Raghuwanshi
- Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, India
| | - Ashish Kumar
- Department of Mathematics and Statistics, Manipal University Jaipur, Jaipur, India
| | - Sunil Kr. Maakar
- School of Computing Science & Engineering, Galgotias University, Greater Noida, India
| | - Sanjay Agrawal
- Department of Electrical Engineering, Rajkiya Engineering College, Akbarpur, Ambedkar Nagar, India
| |
Collapse
|
26
|
Molina-Mora JA, González A, Jiménez-Morgan S, Cordero-Laurent E, Brenes H, Soto-Garita C, Sequeira-Soto J, Duarte-Martínez F. Clinical Profiles at the Time of Diagnosis of SARS-CoV-2 Infection in Costa Rica During the Pre-vaccination Period Using a Machine Learning Approach. PHENOMICS (CHAM, SWITZERLAND) 2022; 2:312-322. [PMID: 35692458 PMCID: PMC9173838 DOI: 10.1007/s43657-022-00058-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 04/22/2022] [Accepted: 04/27/2022] [Indexed: 04/16/2023]
Abstract
The clinical manifestations of COVID-19, caused by the SARS-CoV-2, define a large spectrum of symptoms that are mainly dependent on the human host conditions. In Costa Rica, more than 169,000 cases and 2185 deaths were reported during the year 2020, the pre-vaccination period. To describe the clinical presentations at the time of diagnosis of SARS-CoV-2 infection in Costa Rica during the pre-vaccination period, we implemented a symptom-based clustering using machine learning to identify clusters or clinical profiles at the population level among 18,974 records of positive cases. Profiles were compared based on symptoms, risk factors, viral load, and genomic features of the SARS-CoV-2 sequence. A total of 18 symptoms at time of diagnosis of SARS-CoV-2 infection were reported with a frequency > 1%, and those were used to identify seven clinical profiles with a specific composition of clinical manifestations. In the comparison between clusters, a lower viral load was found for the asymptomatic group, while the risk factors and the SARS-CoV-2 genomic features were distributed among all the clusters. No other distribution patterns were found for age, sex, vital status, and hospitalization. In conclusion, during the pre-vaccination time in Costa Rica, the symptoms at the time of diagnosis of SARS-CoV-2 infection were described in clinical profiles. The host co-morbidities and the SARS-CoV-2 genotypes are not specific of a particular profile, rather they are present in all the groups, including asymptomatic cases. In addition, this information can be used for decision-making by the local healthcare institutions (first point of contact with health professionals, case definition, or infrastructure). In further analyses, these results will be compared against the profiles of cases during the vaccination period. Supplementary Information The online version contains supplementary material available at 10.1007/s43657-022-00058-x.
Collapse
Affiliation(s)
- Jose Arturo Molina-Mora
- Centro de Investigación en Enfermedades Tropicales (CIET) and Facultad de Microbiología, Universidad de Costa Rica, San José, 2060 Costa Rica
| | - Alejandra González
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| | | | - Estela Cordero-Laurent
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| | - Hebleen Brenes
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| | - Claudio Soto-Garita
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| | - Jorge Sequeira-Soto
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| | - Francisco Duarte-Martínez
- Instituto Costarricense de Investigación y Enseñanza en Nutrición y Salud (INCIENSA), Tres Ríos, 30301 Costa Rica
| |
Collapse
|
27
|
Martinez-Millana A, Saez-Saez A, Tornero-Costa R, Azzopardi-Muscat N, Traver V, Novillo-Ortiz D. Artificial intelligence and its impact on the domains of universal health coverage, health emergencies and health promotion: An overview of systematic reviews. Int J Med Inform 2022; 166:104855. [PMID: 35998421 PMCID: PMC9551134 DOI: 10.1016/j.ijmedinf.2022.104855] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 08/01/2022] [Accepted: 08/11/2022] [Indexed: 12/04/2022]
Abstract
BACKGROUND Artificial intelligence is fueling a new revolution in medicine and in the healthcare sector. Despite the growing evidence on the benefits of artificial intelligence there are several aspects that limit the measure of its impact in people's health. It is necessary to assess the current status on the application of AI towards the improvement of people's health in the domains defined by WHO's Thirteenth General Programme of Work (GPW13) and the European Programme of Work (EPW), to inform about trends, gaps, opportunities, and challenges. OBJECTIVE To perform a systematic overview of systematic reviews on the application of artificial intelligence in the people's health domains as defined in the GPW13 and provide a comprehensive and updated map on the application specialties of artificial intelligence in terms of methodologies, algorithms, data sources, outcomes, predictors, performance, and methodological quality. METHODS A systematic search in MEDLINE, EMBASE, Cochrane and IEEEXplore was conducted between January 2015 and June 2021 to collect systematic reviews using a combination of keywords related to the domains of universal health coverage, health emergencies protection, and better health and wellbeing as defined by the WHO's PGW13 and EPW. Eligibility criteria was based on methodological quality and the inclusion of practical implementation of artificial intelligence. Records were classified and labeled using ICD-11 categories into the domains of the GPW13. Descriptors related to the area of implementation, type of modeling, data entities, outcomes and implementation on care delivery were extracted using a structured form and methodological aspects of the included reviews studies was assessed using the AMSTAR checklist. RESULTS The search strategy resulted in the screening of 815 systematic reviews from which 203 were assessed for eligibility and 129 were included in the review. The most predominant domain for artificial intelligence applications was Universal Health Coverage (N = 98) followed by Health Emergencies (N = 16) and Better Health and Wellbeing (N = 15). Neoplasms area on Universal Health Coverage was the disease area featuring most of the applications (21.7 %, N = 28). The reviews featured analytics primarily over both public and private data sources (67.44 %, N = 87). The most used type of data was medical imaging (31.8 %, N = 41) and predictors based on regions of interest and clinical data. The most prominent subdomain of Artificial Intelligence was Machine Learning (43.4 %, N = 56), in which Support Vector Machine method was predominant (20.9 %, N = 27). Regarding the purpose, the application of Artificial Intelligence I is focused on the prediction of the diseases (36.4 %, N = 47). With respect to the validation, more than a half of the reviews (54.3 %, N = 70) did not report a validation procedure and, whenever available, the main performance indicator was the accuracy (28.7 %, N = 37). According to the methodological quality assessment, a third of the reviews (34.9 %, N = 45) implemented methods for analysis the risk of bias and the overall AMSTAR score below was 5 (4.01 ± 1.93) on all the included systematic reviews. CONCLUSION Artificial intelligence is being used for disease modelling, diagnose, classification and prediction in the three domains of GPW13. However, the evidence is often limited to laboratory and the level of adoption is largely unbalanced between ICD-11 categoriesand diseases. Data availability is a determinant factor on the developmental stage of artificial intelligence applications. Most of the reviewed studies show a poor methodological quality and are at high risk of bias, which limits the reproducibility of the results and the reliability of translating these applications to real clinical scenarios. The analyzed papers show results only in laboratory and testing scenarios and not in clinical trials nor case studies, limiting the supporting evidence to transfer artificial intelligence to actual care delivery.
Collapse
Affiliation(s)
- Antonio Martinez-Millana
- Instituto Universitario de Investigación de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València, Camino de Vera S/N, Valencia 46022, Spain
| | - Aida Saez-Saez
- Instituto Universitario de Investigación de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València, Camino de Vera S/N, Valencia 46022, Spain
| | - Roberto Tornero-Costa
- Instituto Universitario de Investigación de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València, Camino de Vera S/N, Valencia 46022, Spain
| | - Natasha Azzopardi-Muscat
- Division of Country Health Policies and Systems, World Health Organization, Regional Office for Europe, Copenhagen, Denmark
| | - Vicente Traver
- Instituto Universitario de Investigación de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València, Camino de Vera S/N, Valencia 46022, Spain
| | - David Novillo-Ortiz
- Division of Country Health Policies and Systems, World Health Organization, Regional Office for Europe, Copenhagen, Denmark.
| |
Collapse
|
28
|
Jalil Z, Abbasi A, Javed AR, Khan MB, Abul Hasanat MH, AlTameem A, AlKhathami M, Jilani Saudagar AK. A Novel Benchmark Dataset for COVID-19 Detection during Third Wave in Pakistan. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6354579. [PMID: 35990145 PMCID: PMC9391128 DOI: 10.1155/2022/6354579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Revised: 07/04/2022] [Accepted: 07/13/2022] [Indexed: 11/17/2022]
Abstract
Coronavirus (COVID-19) is a highly severe infection caused by the severe acute respiratory coronavirus 2 (SARS-CoV-2). The polymerase chain reaction (PCR) test is essential to confirm the COVID-19 infection, but it has certain limitations, including paucity of reagents, is computationally time-consuming, and requires expert clinicians. Clinicians suggest that the PCR test is not a reliable automated COVID-19 patient detection system. This study proposed a machine learning-based approach to evaluate the PCR role in COVID-19 detection. We collect real data containing 603 COVID-19 samples from the Pakistan Institute of Medical Sciences (PIMS) Hospital in Islamabad, Pakistan, during the third COVID-19 wave. The experiments are separated into two sets. The first set comprises 24 features, including PCR test results, whereas the second comprises 24 features without PCR test. The findings demonstrate that the decision tree achieves the best detection rate for positive and negative COVID-19 patients in both scenarios. The findings reveal that PCR does not contribute to detecting COVID-19 patients. The findings also aid in the early detection of COVID-19, mainly when PCR test results are insufficient for diagnosing COVID-19 and help developing countries with a paucity of PCR tests and specialist facilities.
Collapse
Affiliation(s)
- Zunera Jalil
- Department of Cyber Security, PAF Complex E-9, Air University, Islamabad, Pakistan
| | - Ahmed Abbasi
- Department of Cyber Security, PAF Complex E-9, Air University, Islamabad, Pakistan
| | - Abdul Rehman Javed
- Department of Cyber Security, PAF Complex E-9, Air University, Islamabad, Pakistan
| | - Muhammad Badruddin Khan
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
| | - Mozaherul Hoque Abul Hasanat
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
| | - Abdullah AlTameem
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
| | - Mohammed AlKhathami
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
| | - Abdul Khader Jilani Saudagar
- Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia
| |
Collapse
|
29
|
Ali S, Zhou Y, Patterson M. Efficient analysis of COVID-19 clinical data using machine learning models. Med Biol Eng Comput 2022; 60:1881-1896. [PMID: 35507111 PMCID: PMC9066140 DOI: 10.1007/s11517-022-02570-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Accepted: 03/27/2022] [Indexed: 11/29/2022]
Abstract
Because of the rapid spread of COVID-19 to almost every part of the globe, huge volumes of data and case studies have been made available, providing researchers with a unique opportunity to find trends and make discoveries like never before by leveraging such big data. This data is of many different varieties and can be of different levels of veracity, e.g., precise, imprecise, uncertain, and missing, making it challenging to extract meaningful information from such data. Yet, efficient analyses of this continuously growing and evolving COVID-19 data is crucial to inform - often in real-time - the relevant measures needed for controlling, mitigating, and ultimately avoiding viral spread. Applying machine learning-based algorithms to this big data is a natural approach to take to this aim since they can quickly scale to such data and extract the relevant information in the presence of variety and different levels of veracity. This is important for COVID-19 and potential future pandemics in general. In this paper, we design a straightforward encoding of clinical data (on categorical attributes) into a fixed-length feature vector representation and then propose a model that first performs efficient feature selection from such representation. We apply this approach to two clinical datasets of the COVID-19 patients and then apply different machine learning algorithms downstream for classification purposes. We show that with the efficient feature selection algorithm, we can achieve a prediction accuracy of more than 90% in most cases. We also computed the importance of different attributes in the dataset using information gain. This can help the policymakers focus on only certain attributes to study this disease rather than focusing on multiple random factors that may not be very informative to patient outcomes.
Collapse
Affiliation(s)
- Sarwan Ali
- Georgia State University, Atlanta, GA USA
| | | | | |
Collapse
|
30
|
San-Cristobal R, Martín-Hernández R, Ramos-Lopez O, Martinez-Urbistondo D, Micó V, Colmenarejo G, Villares Fernandez P, Daimiel L, Martínez JA. Longwise Cluster Analysis for the Prediction of COVID-19 Severity within 72 h of Admission: COVID-DATA-SAVE-LIFES Cohort. J Clin Med 2022; 11:3327. [PMID: 35743398 PMCID: PMC9224935 DOI: 10.3390/jcm11123327] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 06/02/2022] [Accepted: 06/07/2022] [Indexed: 01/27/2023] Open
Abstract
The use of routine laboratory biomarkers plays a key role in decision making in the clinical practice of COVID-19, allowing the development of clinical screening tools for personalized treatments. This study performed a short-term longitudinal cluster from patients with COVID-19 based on biochemical measurements for the first 72 h after hospitalization. Clinical and biochemical variables from 1039 confirmed COVID-19 patients framed on the “COVID Data Save Lives” were grouped in 24-h blocks to perform a longitudinal k-means clustering algorithm to the trajectories. The final solution of the three clusters showed a strong association with different clinical severity outcomes (OR for death: Cluster A reference, Cluster B 12.83 CI: 6.11−30.54, and Cluster C 14.29 CI: 6.66−34.43; OR for ventilation: Cluster-B 2.22 CI: 1.64−3.01, and Cluster-C 1.71 CI: 1.08−2.76), improving the AUC of the models in terms of age, sex, oxygen concentration, and the Charlson Comorbidities Index (0.810 vs. 0.871 with p < 0.001 and 0.749 vs. 0.807 with p < 0.001, respectively). Patient diagnoses and prognoses remarkably diverged between the three clusters obtained, evidencing that data-driven technologies devised for the screening, analysis, prediction, and tracking of patients play a key role in the application of individualized management of the COVID-19 pandemics.
Collapse
Affiliation(s)
- Rodrigo San-Cristobal
- Precision Nutrition and Cardiometabolic Health Researh Program, Institute on Food and Health Sciences (Institute IMDEA Food), 28049 Madrid, Spain; (V.M.); (J.A.M.)
| | - Roberto Martín-Hernández
- Biostatistics & Bioinformatics Unit, Madrid Institute for Advanced Studies (IMDEA) Food, CEI UAM + CSIS, 28049 Madrid, Spain; (R.M.-H.); (G.C.)
| | - Omar Ramos-Lopez
- Medicine and Psychology School, Autonomous University of Baja California, Tijuana 22390, Baja California, Mexico;
| | - Diego Martinez-Urbistondo
- Internal Medicine Department, Hospital Universitario HM Sanchinarro, 28050 Madrid, Spain; (D.M.-U.); (P.V.F.)
| | - Víctor Micó
- Precision Nutrition and Cardiometabolic Health Researh Program, Institute on Food and Health Sciences (Institute IMDEA Food), 28049 Madrid, Spain; (V.M.); (J.A.M.)
| | - Gonzalo Colmenarejo
- Biostatistics & Bioinformatics Unit, Madrid Institute for Advanced Studies (IMDEA) Food, CEI UAM + CSIS, 28049 Madrid, Spain; (R.M.-H.); (G.C.)
| | - Paula Villares Fernandez
- Internal Medicine Department, Hospital Universitario HM Sanchinarro, 28050 Madrid, Spain; (D.M.-U.); (P.V.F.)
| | - Lidia Daimiel
- Nutritional Control of the Epigenome Group, IMDEA Food Institute, CEI UAM + CSIC, 28049 Madrid, Spain;
| | - Jose Alfredo Martínez
- Precision Nutrition and Cardiometabolic Health Researh Program, Institute on Food and Health Sciences (Institute IMDEA Food), 28049 Madrid, Spain; (V.M.); (J.A.M.)
- CIBERobn Physiopathology of Obesity and Nutrition, Institute of Health Carlos III (ISCIII), 28029 Madrid, Spain
| |
Collapse
|
31
|
Clinical and Laboratory Approach to Diagnose COVID-19 Using Machine Learning. Interdiscip Sci 2022; 14:452-470. [PMID: 35133633 PMCID: PMC8846962 DOI: 10.1007/s12539-021-00499-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 12/17/2021] [Accepted: 12/23/2021] [Indexed: 12/18/2022]
Abstract
Coronavirus 2 (SARS-CoV-2), often known by the name COVID-19, is a type of acute respiratory syndrome that has had a significant influence on both economy and health infrastructure worldwide. This novel virus is diagnosed utilising a conventional method known as the RT-PCR (Reverse Transcription Polymerase Chain Reaction) test. This approach, however, produces a lot of false-negative and erroneous outcomes. According to recent studies, COVID-19 can also be diagnosed using X-rays, CT scans, blood tests and cough sounds. In this article, we use blood tests and machine learning to predict the diagnosis of this deadly virus. We also present an extensive review of various existing machine-learning applications that diagnose COVID-19 from clinical and laboratory markers. Four different classifiers along with a technique called Synthetic Minority Oversampling Technique (SMOTE) were used for classification. Shapley Additive Explanations (SHAP) method was utilized to calculate the gravity of each feature and it was found that eosinophils, monocytes, leukocytes and platelets were the most critical blood parameters that distinguished COVID-19 infection for our dataset. These classifiers can be utilized in conjunction with RT-PCR tests to improve sensitivity and in emergency situations such as a pandemic outbreak that might happen due to new strains of the virus. The positive results indicate the prospective use of an automated framework that could help clinicians and medical personnel diagnose and screen patients.
Collapse
|
32
|
Ramírez Varela A, Moreno López S, Contreras-Arrieta S, Tamayo-Cabeza G, Restrepo-Restrepo S, Sarmiento-Barbieri I, Caballero-Díaz Y, Jorge Hernandez-Florez L, Mario González J, Salas-Zapata L, Laajaj R, Buitrago-Gutierrez G, de la Hoz-Restrepo F, Vives Florez M, Osorio E, Sofía Ríos-Oliveros D, Behrentz E. Prediction of SARS-CoV-2 infection with a Symptoms-Based model to aid public health decision making in Latin America and other low and middle income settings. Prev Med Rep 2022; 27:101798. [PMID: 35469291 PMCID: PMC9020649 DOI: 10.1016/j.pmedr.2022.101798] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/16/2022] [Accepted: 04/17/2022] [Indexed: 11/01/2022] Open
Abstract
Early non-pharmacological interventions are necessary limit the spread of COVID-19. In low to middle-income countries, there are limited resources to face the pandemic. Symptoms (i.e. anosmia) can be used to apply early strategies in suspicious cases. Logistic regression provides interpretability in prediction analysis. Machine learning analysis aids prediction because of its capacity of data synthesis. Symptoms-based models for predicting SARS-CoV-2 infection may improve clinical decision-making and be an alternative to resource allocation in under-resourced settings. In this study we aimed to test a model based on symptoms to predict a positive test result for SARS-CoV-2 infection during the COVID-19 pandemic using logistic regression and a machine-learning approach, in Bogotá, Colombia. Participants from the CoVIDA project were included. A logistic regression using the model was chosen based on biological plausibility and the Akaike Information criterion. Also, we performed an analysis using machine learning with random forest, support vector machine, and extreme gradient boosting. The study included 58,577 participants with a positivity rate of 5.7%. The logistic regression showed that anosmia (aOR = 7.76, 95% CI [6.19, 9.73]), fever (aOR = 4.29, 95% CI [3.07, 6.02]), headache (aOR = 3.29, 95% CI [1.78, 6.07]), dry cough (aOR = 2.96, 95% CI [2.44, 3.58]), and fatigue (aOR = 1.93, 95% CI [1.57, 2.93]) were independently associated with SARS-CoV-2 infection. Our final model had an area under the curve of 0.73. The symptoms-based model correctly identified over 85% of participants. This model can be used to prioritize resource allocation related to COVID-19 diagnosis, to decide on early isolation, and contact-tracing strategies in individuals with a high probability of infection before receiving a confirmatory test result. This strategy has public health and clinical decision-making significance in low- and middle-income settings like Latin America.
Collapse
|
33
|
Ulgen A, Cetin S, Cetin M, Sivgin H, Li W. A composite ranking of risk factors for COVID-19 time-to-event data from a Turkish cohort. Comput Biol Chem 2022; 98:107681. [PMID: 35487152 PMCID: PMC8993420 DOI: 10.1016/j.compbiolchem.2022.107681] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 03/04/2022] [Accepted: 04/04/2022] [Indexed: 02/08/2023]
Abstract
Having a complete and reliable list of risk factors from routine laboratory blood test for COVID-19 disease severity and mortality is important for patient care and hospital management. It is common to use meta-analysis to combine analysis results from different studies to make it more reproducible. In this paper, we propose to run multiple analyses on the same set of data to produce a more robust list of risk factors. With our time-to-event survival data, the standard survival analysis were extended in three directions. The first is to extend from tests and corresponding p-values to machine learning and their prediction performance. The second is to extend from single-variable to multiple-variable analysis. The third is to expand from analyzing time-to-decease data with death as the event of interest to analyzing time-to-hospital-release data to treat early recovery as a meaningful event as well. Our extension of the type of analyses leads to ten ranking lists. We conclude that 20 out of 30 factors are deemed to be reliably associated to faster-death or faster-recovery. Considering correlation among factors and evidenced by stepwise variable selection in random survival forest, 10 ~ 15 factors seem to be able to achieve the optimal prognosis performance. Our final list of risk factors contain calcium, white blood cell and neutrophils count, urea and creatine, d-dimer, red cell distribution widths, age, ferritin, glucose, lactate dehydrogenase, lymphocyte, basophils, anemia related factors (hemoglobin, hematocrit, mean corpuscular hemoglobin concentration), sodium, potassium, eosinophils, and aspartate aminotransferase.
Collapse
Affiliation(s)
- Ayse Ulgen
- Department of Biostatistics, Faculty of Medicine, Girne American University, Karmi, Cyprus
| | - Sirin Cetin
- Department of Biostatistics, Faculty of Medicine, Tokat Gaziosmanpasa University, Turkey
| | - Meryem Cetin
- Department of Medical Microbiology, Faculty of Medicine, Amasya University, Amasya, Turkey
| | - Hakan Sivgin
- Department of Internal Medicine, Faculty of Medicine, Tokat Gaziosmanpaşa University, Turkey
| | - Wentian Li
- The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA.
| |
Collapse
|
34
|
Supporting Clinical COVID-19 Diagnosis with Routine Blood Tests Using Tree-Based Entropy Structured Self-Organizing Maps. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12105137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Data classification is an automatic or semi-automatic process that, utilizing artificial intelligence algorithms, learns the variable and class relationships of a dataset for use a posteriori in situations where the class result is unknown. For many years, work on this topic has been aimed at increasing the hit rates of algorithms. However, when the problem is restricted to applications in healthcare, besides the concern with performance, it is also necessary to design algorithms whose results are understandable by the specialists responsible for making the decisions. Among the problems in the field of medicine, a current focus is related to COVID-19: AI algorithms may contribute to early diagnosis. Among the available COVID-19 data, the blood test is a typical procedure performed when the patient seeks the hospital, and its use in the diagnosis allows reducing the need for other diagnostic tests that can impact the detection time and add to costs. In this work, we propose using self-organizing map (SOM) to discover attributes in blood test examinations that are relevant for COVID-19 diagnosis. We applied SOM and an entropy calculation in the definition of a hierarchical, semi-supervised and explainable model named TESSOM (tree-based entropy-structured self-organizing maps), in which the main feature is enhancing the investigation of groups of cases with high levels of class overlap, as far as the diagnostic outcome is concerned. Framing the TESSOM algorithm in the context of explainable artificial intelligence (XAI) makes it possible to explain the results to an expert in a simplified way. It is demonstrated in the paper that the use of the TESSOM algorithm to identify attributes of blood tests can help with the identification of COVID-19 cases. It providing a performance increase in 1.489% in multiple scenarios when analyzing 2207 cases from three hospitals in the state of São Paulo, Brazil. This work is a starting point for researchers to identify relevant attributes of blood tests for COVID-19 and to support the diagnosis of other diseases.
Collapse
|
35
|
Rahman MS, Chowdhury AH, Amrin M. Accuracy comparison of ARIMA and XGBoost forecasting models in predicting the incidence of COVID-19 in Bangladesh. PLOS GLOBAL PUBLIC HEALTH 2022; 2:e0000495. [PMID: 36962227 PMCID: PMC10021465 DOI: 10.1371/journal.pgph.0000495] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 04/27/2022] [Indexed: 04/19/2023]
Abstract
Accurate predictive time series modelling is important in public health planning and response during the emergence of a novel pandemic. Therefore, the aims of the study are three-fold: (a) to model the overall trend of COVID-19 confirmed cases and deaths in Bangladesh; (b) to generate a short-term forecast of 8 weeks of COVID-19 cases and deaths; (c) to compare the predictive accuracy of the Autoregressive Integrated Moving Average (ARIMA) and eXtreme Gradient Boosting (XGBoost) for precise modelling of non-linear features and seasonal trends of the time series. The data were collected from the onset of the epidemic in Bangladesh from the Directorate General of Health Service (DGHS) and Institute of Epidemiology, Disease Control and Research (IEDCR). The daily confirmed cases and deaths of COVID-19 of 633 days in Bangladesh were divided into several training and test sets. The ARIMA and XGBoost models were established using those training data, and the test sets were used to evaluate each model's ability to forecast and finally averaged all the predictive performances to choose the best model. The predictive accuracy of the models was assessed using the mean absolute error (MAE), mean percentage error (MPE), root mean square error (RMSE) and mean absolute percentage error (MAPE). The findings reveal the existence of a nonlinear trend and weekly seasonality in the dataset. The average error measures of the ARIMA model for both COVID-19 confirmed cases and deaths were lower than XGBoost model. Hence, in our study, the ARIMA model performed better than the XGBoost model in predicting COVID-19 confirmed cases and deaths in Bangladesh. The suggested prediction model might play a critical role in estimating the spread of a novel pandemic in Bangladesh and similar countries.
Collapse
|
36
|
Speech as a Biomarker for COVID-19 Detection Using Machine Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6093613. [PMID: 35444694 PMCID: PMC9014833 DOI: 10.1155/2022/6093613] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 03/07/2022] [Accepted: 03/21/2022] [Indexed: 11/30/2022]
Abstract
The use of speech as a biomedical signal for diagnosing COVID-19 is investigated using statistical analysis of speech spectral features and classification algorithms based on machine learning. It is established that spectral features of speech, obtained by computing the short-time Fourier Transform (STFT), get altered in a statistical sense as a result of physiological changes. These spectral features are then used as input features to machine learning-based classification algorithms to classify them as coming from a COVID-19 positive individual or not. Speech samples from healthy as well as “asymptomatic” COVID-19 positive individuals have been used in this study. It is shown that the RMS error of statistical distribution fitting is higher in the case of speech samples of COVID-19 positive speech samples as compared to the speech samples of healthy individuals. Five state-of-the-art machine learning classification algorithms have also been analyzed, and the performance evaluation metrics of these algorithms are also presented. The tuning of machine learning model parameters is done so as to minimize the misclassification of COVID-19 positive individuals as being COVID-19 negative since the cost associated with this misclassification is higher than the opposite misclassification. The best performance in terms of the “recall” metric is observed for the Decision Forest algorithm which gives a recall value of 0.7892.
Collapse
|
37
|
An Improved Machine-Learning Approach for COVID-19 Prediction Using Harris Hawks Optimization and Feature Analysis Using SHAP. Diagnostics (Basel) 2022; 12:diagnostics12051023. [PMID: 35626179 PMCID: PMC9139459 DOI: 10.3390/diagnostics12051023] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 04/02/2022] [Accepted: 04/16/2022] [Indexed: 02/01/2023] Open
Abstract
A healthcare monitoring system needs the support of recent technologies such as artificial intelligence (AI), machine learning (ML), and big data, especially during the COVID-19 pandemic. This global pandemic has already taken millions of lives. Both infected and uninfected people have generated big data where AI and ML can use to combat and detect COVID-19 at an early stage. Motivated by this, an improved ML framework for the early detection of this disease is proposed in this paper. The state-of-the-art Harris hawks optimization (HHO) algorithm with an improved objective function is proposed and applied to optimize the hyperparameters of the ML algorithms, namely HHO-based eXtreme gradient boosting (HHOXGB), light gradient boosting (HHOLGB), categorical boosting (HHOCAT), random forest (HHORF) and support vector classifier (HHOSVC). An ensemble technique was applied to these optimized ML models to improve the prediction performance. Our proposed method was applied to publicly available big COVID-19 data and yielded a prediction accuracy of 92.38% using the ensemble model. In contrast, HHOXGB provided the highest accuracy of 92.23% as a single optimized model. The performance of the proposed method was compared with the traditional algorithms and other ML-based methods. In both cases, our proposed method performed better. Furthermore, not only the classification improvement, but also the features are analyzed in terms of feature importance calculated by SHapely adaptive exPlanations (SHAP) values. A graphical user interface is also discussed as a potential tool for nonspecialist users such as clinical staff and nurses. The processed data, trained model, and codes related to this study are available at GitHub.
Collapse
|
38
|
Zhao Y, Zhang R, Zhong Y, Wang J, Weng Z, Luo H, Chen C. Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients. Front Cell Infect Microbiol 2022; 12:838749. [PMID: 35521216 PMCID: PMC9063041 DOI: 10.3389/fcimb.2022.838749] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 03/07/2022] [Indexed: 01/22/2023] Open
Abstract
The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people’s lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospective analysis for both COVID-19 and other types of pneumonia by combining the COVID-19 clinical data, eICU and MIMIC-III databases. Machine learning models, including logistic regression, random forest, XGBoost and deep learning neural networks, were developed to predict the severity of COVID-19 infections as well as the mortality of pneumonia patients in intensive care units (ICU). Statistical analysis and feature interpretation, including the analysis of two-level attention mechanisms on both temporal and non-temporal features, were utilized to understand the associations between different clinical variables and disease outcomes. For the COVID-19 data, the XGBoost model obtained the best performance on the test set (AUROC = 1.000 and AUPRC = 0.833). On the MIMIC-III and eICU pneumonia datasets, our deep learning model (Bi-LSTM_Attn) was able to identify clinical variables associated with death of pneumonia patients (AUROC = 0.924 and AUPRC = 0.802 for 24-hour observation window and 12-hour prediction window). The results highlighted clinical indicators, such as the lymphocyte counts, that may help the doctors to predict the disease progression and outcomes for both COVID-19 and other types of pneumonia.
Collapse
Affiliation(s)
- Yu Zhao
- College of Computer and Data Science, Fuzhou University, Fuzhou, China
- Centre for Big Data Research in Burns and Trauma, Fuzhou University, Fuzhou, China
| | - Rusen Zhang
- Department of Cardiovascular Medicine, Affiliated Fuzhou First Hospital of Fujian Medical University, Fuzhou, China
| | - Yi Zhong
- College of Computer and Data Science, Fuzhou University, Fuzhou, China
- Centre for Big Data Research in Burns and Trauma, Fuzhou University, Fuzhou, China
| | - Jingjing Wang
- Department of Critical Care Medicine, Union Hospital of Fujian Medical University, Fuzhou, China
| | - Zuquan Weng
- Centre for Big Data Research in Burns and Trauma, Fuzhou University, Fuzhou, China
- College of Biological Science and Engineering, Fuzhou University, Fuzhou, China
- *Correspondence: Zuquan Weng, ; Heng Luo, ; Cunrong Chen,
| | - Heng Luo
- College of Computer and Data Science, Fuzhou University, Fuzhou, China
- Centre for Big Data Research in Burns and Trauma, Fuzhou University, Fuzhou, China
- MetaNovas Biotech Inc., Foster City, CA, United States
- *Correspondence: Zuquan Weng, ; Heng Luo, ; Cunrong Chen,
| | - Cunrong Chen
- Department of Critical Care Medicine, Union Hospital of Fujian Medical University, Fuzhou, China
- *Correspondence: Zuquan Weng, ; Heng Luo, ; Cunrong Chen,
| |
Collapse
|
39
|
Xu F, Lou K, Chen C, Chen Q, Wang D, Wu J, Zhu W, Tan W, Zhou Y, Liu Y, Wang B, Zhang X, Zhang Z, Zhang J, Sun M, Zhang G, Dai G, Hu H. An original deep learning model using limited data for COVID-19 discrimination: A multi-center study. Med Phys 2022; 49:3874-3885. [PMID: 35305027 PMCID: PMC9088453 DOI: 10.1002/mp.15549] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 12/24/2021] [Accepted: 02/07/2022] [Indexed: 11/06/2022] Open
Abstract
OBJECTIVES Artificial intelligence (AI) has been proved to be a highly efficient tool for COVID-19 diagnosis, but the large data size and heavy label force required for algorithm development and the poor generalizability of AI algorithms, to some extent, limit the application of AI technology in clinical practice. The aim of this study is to develop an AI algorithm with high robustness using limited chest CT data for COVID-19 discrimination. METHODS A three dimensional algorithm that combined multi-instance learning (MIL) with the long and short-term memory (LSTM) architecture (3DMTM) was developed for differentiating COVID-19 from community acquired pneumonia (CAP) while logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM) and a three dimensional convolutional neural network (3D CNN) set for comparison. Totally, 515 patients with or without COVID-19 between December 2019 and March 2020 from 5 different hospitals were recruited and divided into relatively large (150 COVID-19 and 183 CAP cases) and relatively small datasets (17 COVID-19 and 35 CAP cases) for either training or validation and another independent dataset (37 COVID-19 and 93 CAP cases) for external test. Area under the receiver operating characteristic curve (AUC), sensitivity, specificity, precision, accuracy, F1 score, and G-Mean were utilized for performance evaluation. RESULTS In the external test cohort, the relatively large data-based 3DMTM-LD achieved an AUC of 0.956 (95%CI, 0.929∼0.982) with 86.2% and 98.0% for its sensitivity and specificity. 3DMTM-SD got an AUC of 0.937 (95%CI, 0.909∼0.965) while the AUC of 3DCM-SD decreased dramatically to 0.714 (95%CI, 0.649∼0.780) with training data reduction. KNN-MMSD, LR-MMSD, SVM-MMSD and 3DCM-MMSD benefited significantly from the inclusion of clinical information while models trained with relatively large dataset got slight performance improvement in COVID-19 discrimination. 3DMTM, trained with either CT or multi-modal data, presented comparably excellent performance in COVID-19 discrimination. CONCLUSIONS The 3DMTM algorithm presented excellent robustness for COVID-19 discrimination with limited CT data. 3DMTM based on CT data performed comparably in COVID-19 discrimination with that trained with multi-modal information. Clinical information could improve the performance of KNN, LR, SVM and 3DCM in COVID-19 discrimination, especially in the scenario with limited data for training. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Fangyi Xu
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Kaihua Lou
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Chao Chen
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Qingqing Chen
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Dawei Wang
- Institute of Advanced Research, Infervision Medical Technology Co., Ltd., 18F, Building E. Yuanyang International Center, Chaoyang District, Beijing, China
| | - Jiangfen Wu
- Institute of Advanced Research, Infervision Medical Technology Co., Ltd., 18F, Building E. Yuanyang International Center, Chaoyang District, Beijing, China
| | - Wenchao Zhu
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Weixiong Tan
- Institute of Advanced Research, Infervision Medical Technology Co., Ltd., 18F, Building E. Yuanyang International Center, Chaoyang District, Beijing, China
| | - Yong Zhou
- Department of Pulmonary and Critical Care Medicine, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China.,China National Respiratory Regional Medical Center (East China), No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| | - Yongjiu Liu
- Department of Radiology, JINGMEN NO.1 PEOPLE'S HOSPITAL, No.168, Xiangshan Road, Dongbao District, Jingmen, Hubei, China
| | - Bing Wang
- Department of Radiology, JINGMEN NO.1 PEOPLE'S HOSPITAL, No.168, Xiangshan Road, Dongbao District, Jingmen, Hubei, China
| | - Xiaoguo Zhang
- Department of respiratory medicine, Jinan Infectious Disease Hospital, Shandong University, No.22029, Jingshi Road, Shizhong District, Jinan, China
| | - Zhongfa Zhang
- Department of respiratory medicine, Jinan Infectious Disease Hospital, Shandong University, No.22029, Jingshi Road, Shizhong District, Jinan, China
| | - Jianjun Zhang
- Department of Radiology, Zhejiang Hospital, No.12, Lingyin Road, Xihu District, Hangzhou, China
| | - Mingxia Sun
- Department of Radiology, Zhejiang Hospital, No.12, Lingyin Road, Xihu District, Hangzhou, China
| | - Guohua Zhang
- Department of Radiology, TAIZHOU NO.1 PEOPLE'S HOSPITAL, No.218, Hengjie Road, Huangyan District, Taizhou, Zhejiang, China
| | - Guojiao Dai
- Department of Radiology, TAIZHOU NO.1 PEOPLE'S HOSPITAL, No.218, Hengjie Road, Huangyan District, Taizhou, Zhejiang, China
| | - Hongjie Hu
- Department of Radiology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, No.3, Qingchun East Road, Shangcheng District, Hangzhou, Zhejiang, China
| |
Collapse
|
40
|
Lau KYY, Ng KS, Kwok KW, Tsia KKM, Sin CF, Lam CW, Vardhanabhuti V. An Unsupervised Machine Learning Clustering and Prediction of Differential Clinical Phenotypes of COVID-19 Patients Based on Blood Tests—A Hong Kong Population Study. Front Med (Lausanne) 2022; 8:764934. [PMID: 35284429 PMCID: PMC8907521 DOI: 10.3389/fmed.2021.764934] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/27/2021] [Indexed: 01/08/2023] Open
Abstract
Background To better understand the different clinical phenotypes across the disease spectrum in patients with COVID-19 using an unsupervised machine learning clustering approach. Materials and Methods A population-based retrospective study was conducted utilizing demographics, clinical characteristics, comorbidities, and clinical outcomes of 7,606 COVID-19–positive patients on admission to public hospitals in Hong Kong in the year 2020. An unsupervised machine learning clustering was used to explore this large cohort. Results Four clusters of differing clinical phenotypes based on data at initial admission was derived in which 86.6% of the deceased cases were aggregated in one of the clusters without prior knowledge of their clinical outcomes. Other distinctive clinical characteristics of this cluster were old age and high concurrent comorbidities as well as laboratory characteristics of lower hemoglobin/hematocrit levels, higher neutrophil, C-reactive protein, lactate dehydrogenase, and creatinine levels. The clinical patterns captured by the cluster analysis was validated on other temporally distinct cohorts in 2021. The phenotypes aligned with existing literature. Conclusion The study demonstrated the usefulness of unsupervised machine learning techniques with the potential to uncover latent clinical phenotypes. It could serve as a more robust classification for patient triaging and patient-tailored treatment strategies.
Collapse
Affiliation(s)
- Kitty Yu-Yeung Lau
- Biomedical Engineering Programme, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Kei-Shing Ng
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Ka-Wai Kwok
- Department of Mechanical Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Kevin Kin-Man Tsia
- Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Chun-Fung Sin
- Department of Pathology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Ching-Wan Lam
- Department of Pathology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Varut Vardhanabhuti
- Department of Diagnostic Radiology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
- *Correspondence: Varut Vardhanabhuti
| |
Collapse
|
41
|
Xiong Y, Ma Y, Ruan L, Li D, Lu C, Huang L. Comparing different machine learning techniques for predicting COVID-19 severity. Infect Dis Poverty 2022; 11:19. [PMID: 35177120 PMCID: PMC8851750 DOI: 10.1186/s40249-022-00946-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/09/2022] [Indexed: 12/28/2022] Open
Abstract
Background Coronavirus disease 2019 (COVID-19) is still ongoing spreading globally, machine learning techniques were used in disease diagnosis and to predict treatment outcomes, which showed favorable performance. The present study aims to predict COVID-19 severity at admission by different machine learning techniques including random forest (RF), support vector machine (SVM), and logistic regression (LR). Feature importance to COVID-19 severity were further identified. Methods A retrospective design was adopted in the JinYinTan Hospital from January 26 to March 28, 2020, eighty-six demographic, clinical, and laboratory features were selected with LassoCV method, Spearman’s rank correlation, experts’ opinions, and literature evaluation. RF, SVM, and LR were performed to predict severe COVID-19, the performance of the models was compared by the area under curve (AUC). Additionally, feature importance to COVID-19 severity were analyzed by the best performance model. Results A total of 287 patients were enrolled with 36.6% severe cases and 63.4% non-severe cases. The median age was 60.0 years (interquartile range: 49.0–68.0 years). Three models were established using 23 features including 1 clinical, 1 chest computed tomography (CT) and 21 laboratory features. Among three models, RF yielded better overall performance with the highest AUC of 0.970 than SVM of 0.948 and LR of 0.928, RF also achieved a favorable sensitivity of 96.7%, specificity of 69.5%, and accuracy of 84.5%. SVM had sensitivity of 93.9%, specificity of 79.0%, and accuracy of 88.5%. LR also achieved a favorable sensitivity of 92.3%, specificity of 72.3%, and accuracy of 85.2%. Additionally, chest-CT had highest importance to illness severity, and the following features were neutrophil to lymphocyte ratio, lactate dehydrogenase, and D-dimer, respectively. Conclusions Our results indicated that RF could be a useful predictive tool to identify patients with severe COVID-19, which may facilitate effective care and further optimize resources. Graphical Abstract ![]()
Supplementary Information The online version contains supplementary material available at 10.1186/s40249-022-00946-4.
Collapse
Affiliation(s)
- Yibai Xiong
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, No. 16, Nanxiao Street, Dongzhimen, Dongcheng District, Beijing, 100700, Beijing, China
| | - Yan Ma
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, No. 16, Nanxiao Street, Dongzhimen, Dongcheng District, Beijing, 100700, Beijing, China
| | - Lianguo Ruan
- Department of Infectious Diseases, JinYinTan Hospital, Wuhan, 430040, China
| | - Dan Li
- Information Center, Chinese Center for Disease Control and Prevention, Beijing, 102206, China
| | - Cheng Lu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, No. 16, Nanxiao Street, Dongzhimen, Dongcheng District, Beijing, 100700, Beijing, China.
| | - Luqi Huang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, No. 16, Nanxiao Street, Dongzhimen, Dongcheng District, Beijing, 100700, Beijing, China.
| | | |
Collapse
|
42
|
Xu Z, Su C, Xiao Y, Wang F. Artificial intelligence for COVID-19: battling the pandemic with computational intelligence. INTELLIGENT MEDICINE 2022; 2:13-29. [PMID: 34697578 PMCID: PMC8529224 DOI: 10.1016/j.imed.2021.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 09/15/2021] [Accepted: 09/29/2021] [Indexed: 12/15/2022]
Abstract
The new coronavirus disease 2019 (COVID-19) has become a global pandemic leading to over 180 million confirmed cases and nearly 4 million deaths until June 2021, according to the World Health Organization. Since the initial report in December 2019 , COVID-19 has demonstrated a high transmission rate (with an R0 > 2), a diverse set of clinical characteristics (e.g., high rate of hospital and intensive care unit admission rates, multi-organ dysfunction for critically ill patients due to hyperinflammation, thrombosis, etc.), and a tremendous burden on health care systems around the world. To understand the serious and complex diseases and develop effective control, treatment, and prevention strategies, researchers from different disciplines have been making significant efforts from different aspects including epidemiology and public health, biology and genomic medicine, as well as clinical care and patient management. In recent years, artificial intelligence (AI) has been introduced into the healthcare field to aid clinical decision-making for disease diagnosis and treatment such as detecting cancer based on medical images, and has achieved superior performance in multiple data-rich application scenarios. In the COVID-19 pandemic, AI techniques have also been used as a powerful tool to overcome the complex diseases. In this context, the goal of this study is to review existing studies on applications of AI techniques in combating the COVID-19 pandemic. Specifically, these efforts can be grouped into the fields of epidemiology, therapeutics, clinical research, social and behavioral studies and are summarized. Potential challenges, directions, and open questions are discussed accordingly, which may provide new insights into addressing the COVID-19 pandemic and would be helpful for researchers to explore more related topics in the post-pandemic era.
Collapse
Affiliation(s)
- Zhenxing Xu
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York 10065, United States
| | - Chang Su
- Department of Health Service Administration and Policy, Temple University, Philadelphia 19122, United States
| | - Yunyu Xiao
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York 10065, United States
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York 10065, United States
| |
Collapse
|
43
|
He F, Page JH, Weinberg KR, Mishra A. The Development and Validation of Simplified Machine Learning Algorithms to Predict Prognosis of Hospitalized Patients With COVID-19: Multicenter, Retrospective Study. J Med Internet Res 2022; 24:e31549. [PMID: 34951865 PMCID: PMC8785956 DOI: 10.2196/31549] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 07/26/2021] [Accepted: 12/19/2021] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The current COVID-19 pandemic is unprecedented; under resource-constrained settings, predictive algorithms can help to stratify disease severity, alerting physicians of high-risk patients; however, there are only few risk scores derived from a substantially large electronic health record (EHR) data set, using simplified predictors as input. OBJECTIVE The objectives of this study were to develop and validate simplified machine learning algorithms that predict COVID-19 adverse outcomes; to evaluate the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and calibration of the algorithms; and to derive clinically meaningful thresholds. METHODS We performed machine learning model development and validation via a cohort study using multicenter, patient-level, longitudinal EHRs from the Optum COVID-19 database that provides anonymized, longitudinal EHR from across the United States. The models were developed based on clinical characteristics to predict 28-day in-hospital mortality, intensive care unit (ICU) admission, respiratory failure, and mechanical ventilator usages at inpatient setting. Data from patients who were admitted from February 1, 2020, to September 7, 2020, were randomly sampled into development, validation, and test data sets; data collected from September 7, 2020, to November 15, 2020, were reserved as the postdevelopment prospective test data set. RESULTS Of the 3.7 million patients in the analysis, 585,867 patients were diagnosed or tested positive for SARS-CoV-2, and 50,703 adult patients were hospitalized with COVID-19 between February 1 and November 15, 2020. Among the study cohort (n=50,703), there were 6204 deaths, 9564 ICU admissions, 6478 mechanically ventilated or EMCO patients, and 25,169 patients developed acute respiratory distress syndrome or respiratory failure within 28 days since hospital admission. The algorithms demonstrated high accuracy (AUC 0.89, 95% CI 0.89-0.89 on the test data set [n=10,752]), consistent prediction through the second wave of the pandemic from September to November (AUC 0.85, 95% CI 0.85-0.86) on the postdevelopment prospective test data set [n=14,863], great clinical relevance, and utility. Besides, a comprehensive set of 386 input covariates from baseline or at admission were included in the analysis; the end-to-end pipeline automates feature selection and model development. The parsimonious model with only 10 input predictors produced comparably accurate predictions; these 10 predictors (age, blood urea nitrogen, SpO2, systolic and diastolic blood pressures, respiration rate, pulse, temperature, albumin, and major cognitive disorder excluding stroke) are commonly measured and concordant with recognized risk factors for COVID-19. CONCLUSIONS The systematic approach and rigorous validation demonstrate consistent model performance to predict even beyond the period of data collection, with satisfactory discriminatory power and great clinical utility. Overall, the study offers an accurate, validated, and reliable prediction model based on only 10 clinical features as a prognostic tool to stratifying patients with COVID-19 into intermediate-, high-, and very high-risk groups. This simple predictive tool is shared with a wider health care community, to enable service as an early warning system to alert physicians of possible high-risk patients, or as a resource triaging tool to optimize health care resources.
Collapse
Affiliation(s)
- Fang He
- Amgen Inc, Center for Observational Research, South San Francisco, CA, United States
- Amgen Inc, Digital Health & Innovation, Thousand Oaks, CA, United States
| | - John H Page
- Amgen Inc, Center for Observational Research, Thousand Oaks, CA, United States
| | - Kerry R Weinberg
- Amgen Inc, Digital Health & Innovation, Thousand Oaks, CA, United States
| | - Anirban Mishra
- Amgen Inc, Digital Health & Innovation, Thousand Oaks, CA, United States
| |
Collapse
|
44
|
Boddu RSK, Karmakar P, Bhaumik A, Nassa VK, Bhattacharya S. Analyzing the impact of machine learning and artificial intelligence and its effect on management of lung cancer detection in covid-19 pandemic. MATERIALS TODAY: PROCEEDINGS 2022; 56:2213-2216. [PMID: 34877264 PMCID: PMC8641302 DOI: 10.1016/j.matpr.2021.11.549] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Cancer victims, particularly those with lung cancer, are more susceptible and at higher danger of COVID-19 and associated consequences as a result of their compromised immune systems, which makes them particularly sensitive. Because of a variety of circumstances, cancer patients' diagnosis, treatment, and aftercare are very complicated and time-consuming during an epidemic. In such circumstances, advances in artificial intelligence (AI) and machine learning algorithms (ML) offer the capacity to boost cancer sufferer diagnosis, therapy, and care via the use of cutting technologies. For example, using clinical and imaging data combined with machine learning methods, the researchers may be able to distinguish among lung alterations induced by corona virus and those produced by immunotherapy and radiation. During this epidemic, artificial intelligence (AI) may be utilized to guarantee that the appropriate individuals are recruited in cancer clinical trials more quickly and effectively than in the past, which was done in a conventional and complicated manner. In order to better care for cancer patients and find novel and more effective therapies, It is critical that we move beyond traditional research methods and use artificial intelligence (AI) and machine learning to update our research (ML). Artificial intelligence (AI) and machine learning (ML) are being utilised to help with several aspects of the COVID-19 epidemic, such as epidemiology, molecular research and medication development, medical diagnosis and treatment, and socioeconomics. The use of artificial intelligence (AI) and machine learning (ML) in the diagnosis and treatment of COVID-19 patients is also being investigated. The combination of artificial intelligence and machine learning in COVID-19 may help to identify positive patients more quickly. In order to understand the dynamics of an epidemic that is relevant to artificial intelligence, when used in different patient groups, AI-based algorithms can quickly detect CT scans with COVID-19 linked pneumonia, as well as discriminate non-COVID connected pneumonia with high specificity and accuracy. It is possible to utilize the existing difficulties and future views presented in this study to guide an optimal implementation of AI and machine learning technologies in an epidemic.
Collapse
Affiliation(s)
| | - Partha Karmakar
- Government of West Bengal, Bikash Bhawan, Salt Lake, Kolkata, W.B., India
| | - Ankan Bhaumik
- Dept. of Applied Mathematics With Oceanology and Computer Programming, Vidyasagar University, Midnapore, India
| | - Vinay Kumar Nassa
- Department of Computer Science Engineering, South Point Group of Institutions Sonepat, Haryana, India
| | | |
Collapse
|
45
|
Ilu SY, Rajesh P, Mohammed H. Prediction of COVID-19 using long short-term memory by integrating principal component analysis and clustering techniques. INFORMATICS IN MEDICINE UNLOCKED 2022; 31:100990. [PMID: 35677733 PMCID: PMC9164683 DOI: 10.1016/j.imu.2022.100990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 06/01/2022] [Accepted: 06/01/2022] [Indexed: 11/27/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus (SARS-COV) is a major family of viruses that cause infections in both animals and humans, including common cold, coronavirus disease (COVID-19), severe acute respiratory syndrome (SARS), and Middle East respiratory syndrome. This study primarily aims to predict the number of COVID-19 positive cases in 36 states of Nigeria using a long short-term memory (LSTM) algorithm of deep learning. The proposed approach employs K-means clustering to detect outliers and principal component analysis (PCA) to select important features from the dataset. The LSTM was chosen because of its non-linear characteristics to handle the dataset. As COVID-19 cases follow non-linear characteristics, LSTM is the most suitable algorithm for predicting their numbers. For comparison, several types of machine learning algorithms, such as naive Bayes, XG-boost, and SVM, were employed. After the comparison, LSTM was observed to be superior among all algorithms.
Collapse
|
46
|
Čolaković A, Avdagić-Golub E, Begović M, Memić B, Hasković-Džubur A. Application of machine learning in the fight against the COVID-19 pandemic: A review. ACTA FACULTATIS MEDICAE NAISSENSIS 2022. [DOI: 10.5937/afmnai39-38354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Introduction: Machine learning (ML) plays a significant role in the fight against the COVID-19 (officially known as SARS-CoV-2) pandemic. ML techniques enable the rapid detection of patterns and trends in large datasets. Therefore, ML provides efficient methods to generate knowledge from structured and unstructured data. This potential is particularly significant when the pandemic affects all aspects of human life. It is necessary to collect a large amount of data to identify methods to prevent the spread of infection, early detection, reduction of consequences, and finding appropriate medicine. Modern information and communication technologies (ICT) such as the Internet of Things (IoT) allow the collection of large amounts of data from various sources. Thus, we can create predictive ML-based models for assessments, predictions, and decisions. Methods: This is a review article based on previous studies and scientifically proven knowledge. In this paper, bibliometric data from authoritative databases of research publications (Web of Science, Scopus, PubMed) are combined for bibliometric analyses in the context of ML applications for COVID-19. Aim: This paper reviews some ML-based applications used for mitigating COVID-19. We aimed to identify and review ML potentials and solutions for mitigating the COVID-19 pandemic as well as to present some of the most commonly used ML techniques, algorithms, and datasets applied in the context of COVID-19. Also, we provided some insights into specific emerging ideas and open issues to facilitate future research. Conclusion: ML is an effective tool for diagnosing and early detection of symptoms, predicting the spread of a pandemic, developing medicines and vaccines, etc.
Collapse
|
47
|
An Overview of Supervised Machine Learning Methods and Data Analysis for COVID-19 Detection. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:4733167. [PMID: 34853669 PMCID: PMC8629644 DOI: 10.1155/2021/4733167] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/16/2021] [Accepted: 10/11/2021] [Indexed: 12/16/2022]
Abstract
Methods Our analysis and machine learning algorithm is based on most cited two clinical datasets from the literature: one from San Raffaele Hospital Milan Italia and the other from Hospital Israelita Albert Einstein São Paulo Brasilia. The datasets were processed to select the best features that most influence the target, and it turned out that almost all of them are blood parameters. EDA (Exploratory Data Analysis) methods were applied to the datasets, and a comparative study of supervised machine learning models was done, after which the support vector machine (SVM) was selected as the one with the best performance. Results SVM being the best performant is used as our proposed supervised machine learning algorithm. An accuracy of 99.29%, sensitivity of 92.79%, and specificity of 100% were obtained with the dataset from Kaggle (https://www.kaggle.com/einsteindata4u/covid19) after applying optimization to SVM. The same procedure and work were performed with the dataset taken from San Raffaele Hospital (https://zenodo.org/record/3886927#.YIluB5AzbMV). Once more, the SVM presented the best performance among other machine learning algorithms, and 92.86%, 93.55%, and 90.91% for accuracy, sensitivity, and specificity, respectively, were obtained. Conclusion The obtained results, when compared with others from the literature based on these same datasets, are superior, leading us to conclude that our proposed solution is reliable for the COVID-19 diagnosis.
Collapse
|
48
|
Chang Z, Zhan Z, Zhao Z, You Z, Liu Y, Yan Z, Fu Y, Liang W, Zhao L. Application of artificial intelligence in COVID-19 medical area: a systematic review. J Thorac Dis 2021; 13:7034-7053. [PMID: 35070385 PMCID: PMC8743418 DOI: 10.21037/jtd-21-747] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 09/02/2021] [Indexed: 01/08/2023]
Abstract
BACKGROUND Coronavirus disease 2019 (COVID-19) has caused a large-scale global epidemic, impacting international politics and the economy. At present, there is no particularly effective medicine and treatment plan. Therefore, it is urgent and significant to find new technologies to diagnose early, isolate early, and treat early. Multimodal data drove artificial intelligence (AI) can potentially be the option. During the COVID-19 Pandemic, AI provided cutting-edge applications in disease, medicine, treatment, and target recognition. This paper reviewed the literature on the intersection of AI and medicine to analyze and compare different AI model applications in the COVID-19 Pandemic, evaluate their effectiveness, show their advantages and differences, and introduce the main models and their characteristics. METHODS We searched PubMed, arXiv, medRxiv, and Google Scholar through February 2020 to identify studies on AI applications in the medical areas for the COVID-19 Pandemic. RESULTS We summarize the main AI applications in six areas: (I) epidemiology, (II) diagnosis, (III) progression, (IV) treatment, (V) psychological health impact, and (VI) data security. The ongoing development in AI has significantly improved prediction, contact tracing, screening, diagnosis, treatment, medication, and vaccine development for the COVID-19 Pandemic and reducing human intervention in medical practice. DISCUSSION This paper provides strong advice for using AI-based auxiliary tools for related applications of human diseases. We also discuss the clinicians' role in the further development of AI. They and AI researchers can integrate AI technology with current clinical processes and information systems into applications. In the future, AI personnel and medical workers will further cooperate closely.
Collapse
Affiliation(s)
- Zhoulin Chang
- College of Mechanical and Electrical Engineering, Guangdong University of Science and Technology, Dongguan, China
| | - Zhiqing Zhan
- The Third Clinical College, Guangzhou Medical University, Guangzhou, China
| | - Zifan Zhao
- Nanshan College, Guangzhou Medical University, Guangzhou, China
| | - Zhixuan You
- Nanshan College, Guangzhou Medical University, Guangzhou, China
| | - Yang Liu
- School of Information Engineering, Zhengzhou University, Zhengzhou, China
| | - Zhihong Yan
- Kuangji Medical Technology (Guangdong Hengqin) Co., Ltd., Zhuhai, China
| | - Yong Fu
- Kuangji Medical Technology (Guangdong Hengqin) Co., Ltd., Zhuhai, China
| | - Wenhua Liang
- Department of Thoracic Surgery, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Lei Zhao
- Department of Physiology, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
49
|
Early Stage Identification of COVID-19 Patients in Mexico Using Machine Learning: A Case Study for the Tijuana General Hospital. INFORMATION 2021. [DOI: 10.3390/info12120490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Background: The current pandemic caused by SARS-CoV-2 is an acute illness of global concern. SARS-CoV-2 is an infectious disease caused by a recently discovered coronavirus. Most people who get sick from COVID-19 experience either mild, moderate, or severe symptoms. In order to help make quick decisions regarding treatment and isolation needs, it is useful to determine which significant variables indicate infection cases in the population served by the Tijuana General Hospital (Hospital General de Tijuana). An Artificial Intelligence (Machine Learning) mathematical model was developed in order to identify early-stage significant variables in COVID-19 patients. Methods: The individual characteristics of the study subjects included age, gender, age group, symptoms, comorbidities, diagnosis, and outcomes. A mathematical model that uses supervised learning algorithms, allowing the identification of the significant variables that predict the diagnosis of COVID-19 with high precision, was developed. Results: Automatic algorithms were used to analyze the data: for Systolic Arterial Hypertension (SAH), the Logistic Regression algorithm showed results of 91.0% in area under ROC (AUC), 80% accuracy (CA), 80% F1 and 80% Recall, and 80.1% precision for the selected variables, while for Diabetes Mellitus (DM) with the Logistic Regression algorithm it obtained 91.2% AUC, 89.2% accuracy, 88.8% F1, 89.7% precision, and 89.2% recall for the selected variables. The neural network algorithm showed better results for patients with Obesity, obtaining 83.4% AUC, 91.4% accuracy, 89.9% F1, 90.6% precision, and 91.4% recall. Conclusions: Statistical analyses revealed that the significant predictive symptoms in patients with SAH, DM, and Obesity were more substantial in fatigue and myalgias/arthralgias. In contrast, the third dominant symptom in people with SAH and DM was odynophagia.
Collapse
|
50
|
Komolafe TE, Cao Y, Nguchu BA, Monkam P, Olaniyi EO, Sun H, Zheng J, Yang X. Diagnostic Test Accuracy of Deep Learning Detection of COVID-19: A Systematic Review and Meta-Analysis. Acad Radiol 2021; 28:1507-1523. [PMID: 34649779 PMCID: PMC8445811 DOI: 10.1016/j.acra.2021.08.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 06/18/2021] [Accepted: 08/12/2021] [Indexed: 12/11/2022]
Abstract
RATIONALE AND OBJECTIVE To perform a meta-analysis to compare the diagnostic test accuracy (DTA) of deep learning (DL) in detecting coronavirus disease 2019 (COVID-19), and to investigate how network architecture and type of datasets affect DL performance. MATERIALS AND METHODS We searched PubMed, Web of Science and Inspec from January 1, 2020, to December 3, 2020, for retrospective and prospective studies on deep learning detection with at least reported sensitivity and specificity. Pooled DTA was obtained using random-effect models. Sub-group analysis between studies was also carried out for data source and network architectures. RESULTS The pooled sensitivity and specificity were 91% (95% confidence interval [CI]: 88%, 93%; I2 = 69%) and 92% (95% CI: 88%, 94%; I2 = 88%), respectively for 19 studies. The pooled AUC and diagnostic odds ratio (DOR) were 0.95 (95% CI: 0.88, 0.92) and 112.5 (95% CI: 57.7, 219.3; I2 = 90%) respectively. The overall accuracy, recall, F1-score, LR+ and LR- are 89.5%, 89.5%, 89.7%, 23.13 and 0.13. Sub-group analysis shows that the sensitivity and DOR significantly vary with the type of network architectures and sources of data with low heterogeneity are (I2 = 0%) and (I2 = 18%) for ResNet architecture and single-source datasets, respectively. CONCLUSION The diagnosis of COVID-19 via deep learning has achieved incredible performance, and the source of datasets, as well as network architectures, strongly affect DL performance.
Collapse
Affiliation(s)
- Temitope Emmanuel Komolafe
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Yuzhu Cao
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Benedictor Alexander Nguchu
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Patrice Monkam
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Ebenezer Obaloluwa Olaniyi
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Haotian Sun
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Jian Zheng
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China
| | - Xiaodong Yang
- School of Biomedical Engineering (Suzhou) (T.E.K.,Y.C., H.S.), Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, China; Department of Medical Imaging (T.E.K.,Y.C., H.S., J.Z., X.Y.), Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Sciences, Suzhou, 215163, China; Hefei National Lab for Physical Sciences at the Microscale and Centres for Biomedical Engineering (B.A.N.), University of Science and Technology of China, Hefei, 230026, China; EasySignal Group, Department of Automation (P.M.), Tsinghua University, Beijing 100084, China; Department of Biomedical Engineering (E.O.O.), Shenzhen University, Shenzhen, 518060, China; Jinhua Laboratory (X.Y.), Foshan, 528000, China.
| |
Collapse
|