1
|
Zhang Y, Gao H. Development and nursing application of kidney disease prediction models based on machine learning. Comput Methods Biomech Biomed Engin 2025:1-12. [PMID: 40125897 DOI: 10.1080/10255842.2025.2479856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 02/13/2025] [Accepted: 03/10/2025] [Indexed: 03/25/2025]
Abstract
Kidney diseases complicate treatment prediction and progression. This study introduces a Metaheuristic Red Fox-Optimized Agile Support Vector Machine (MRFO-ASVM) for early detection and prognosis of kidney diseases. Nurses' involvement in data collection and analysis enhances model effectiveness. Pre-processing with Min-Max normalization and feature extraction using Principal Component Analysis (PCA) improves data quality. The MRFO-ASVM obtained enhanced parameter performance of the model including high accuracy (0.92), F1-score (0.67), sensitivity (0.89), precision (0.63), and ROC-AUC (0.99). Integrating this technology into nursing practice enhances early detection and personalized care, advancing patient-centred healthcare solutions.
Collapse
Affiliation(s)
- Yan Zhang
- Department of Blood Purification Center, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Third Hospital of Shanxi Medical University, Taiyuan, China
| | - Hui Gao
- Department of Blood Purification Center, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Third Hospital of Shanxi Medical University, Taiyuan, China
| |
Collapse
|
2
|
Yoshizaki Y, Kato K, Fujihara K, Sone H, Akazawa K. Development of a machine learning tool to predict the risk of incident chronic kidney disease using health examination data. Front Public Health 2024; 12:1495054. [PMID: 39555038 PMCID: PMC11566449 DOI: 10.3389/fpubh.2024.1495054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 10/17/2024] [Indexed: 11/19/2024] Open
Abstract
Background Chronic kidney disease (CKD) is characterized by a decreased glomerular filtration rate or renal injury (especially proteinuria) for at least 3 months. The early detection and treatment of CKD, a major global public health concern, before the onset of symptoms is important. This study aimed to develop machine learning models to predict the risk of developing CKD within 1 and 5 years using health examination data. Methods Data were collected from patients who underwent annual health examinations between 2017 and 2022. Among the 30,273 participants included in the study, 1,372 had CKD. Demographic characteristics, body mass index, blood pressure, blood and urine test results, and questionnaire responses were used to predict the risk of CKD development at 1 and 5 years. This study examined three outcomes: incident estimated glomerular filtration rate (eGFR) <60 mL/min/1.73 m2, the development of proteinuria, and incident eGFR <60 mL/min/1.73 m2 or the development of proteinuria. Logistic regression (LR), conditional logistic regression, neural network, and recurrent neural network were used to develop the prediction models. Results All models had predictive values, sensitivities, and specificities >0.8 for predicting the onset of CKD in 1 year when the outcome was eGFR <60 mL/min/1.73 m2. The area under the receiver operating characteristic curve (AUROC) was >0.9. With LR and a neural network, the specificities were 0.749 and 0.739 and AUROCs were 0.889 and 0.890, respectively, for predicting onset within 5 years. The AUROCs of most models were approximately 0.65 when the outcome was eGFR <60 mL/min/1.73 m2 or proteinuria. The predictive performance of all models exhibited a significant decrease when eGFR was not included as an explanatory variable (AUROCs: 0.498-0.732). Conclusion Machine learning models can predict the risk of CKD, and eGFR plays a crucial role in predicting the onset of CKD. However, it is difficult to predict the onset of proteinuria based solely on health examination data. Further studies must be conducted to predict the decline in eGFR and increase in urine protein levels.
Collapse
Affiliation(s)
- Yuki Yoshizaki
- Department of Medical Informatics and Statistics, Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
| | - Kiminori Kato
- Department of Prevention of Noncommunicable Diseases and Promotion of Health Checkup, Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
| | - Kazuya Fujihara
- Department of Hematology, Endocrinology and Metabolism, Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
| | - Hirohito Sone
- Department of Hematology, Endocrinology and Metabolism, Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
| | - Kohei Akazawa
- Department of Medical Informatics, Niigata University Medical and Dental Hospital, Niigata, Japan
| |
Collapse
|
3
|
Chang CY, Peng CH, Chen FY, Huang LY, Kuo CH, Chu TW, Liang YJ. The risk factors determined by four machine learning methods for the change of difference of bone mineral density in post-menopausal women after three years follow-up. Sci Rep 2024; 14:23234. [PMID: 39369003 PMCID: PMC11455928 DOI: 10.1038/s41598-024-73799-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 09/20/2024] [Indexed: 10/07/2024] Open
Abstract
The prevalence of osteoporosis has drastically increased recently. It is not only the most frequent but is also a major global public health problem due to its high morbidity. There are many risk factors associated with osteoporosis were identified. However, most studies have used the traditional multiple linear regression (MLR) to explore their relationships. Recently, machine learning (Mach-L) has become a new modality for data analysis because it enables machine to learn from past data or experiences without being explicitly programmed and could capture nonlinear relationships better. These methods have the potential to outperform conventional MLR in disease prediction. In the present study, we enrolled a Chinese post-menopause cohort followed up for 4 years. The difference of T-score (δ-T score) was the dependent variable. Information such as demographic, biochemistry and life styles were the independent variables. Our goals were: (1) Compare the prediction accuracy between Mach-L and traditional MLR for δ-T score. (2) Rank the importance of risk factors (independent variables) for prediction of δ T-score. Totally, there were 1698 postmenopausal women were enrolled from MJ Health Database. Four different Mach-L methods namely, Random forest (RF), eXtreme Gradient Boosting (XGBoost), Naïve Bayes (NB), and stochastic gradient boosting (SGB), to construct predictive models for predicting δ-BMD after four years follow-up. The dataset was then randomly divided into an 80% training dataset for model building and a 20% testing dataset for model testing. A 10-fold cross-validation technique for hyperparameter tuning was used. The model with the lowest root mean square error for the validation dataset was viewed as the best model for each ML method. The averaged metrics of the RF, SGB, NB, and XGBoost models were used to compare the model performance of the benchmark MLR model that used the same training and testing dataset as the Mach-L methods. We defined that the priority demonstrated in each model ranked 1 as the most critical risk factor and 22 as the last selected risk factor. For Pearson correlation, age, education, BMI, HDL-C, and TSH were positively and plasma calcium level, and baseline T-score were negatively correlated with δ-T score. All four Mach-L methods yielded lower prediction errors than the MLR method and were all convincing Mach-L models. From our results, it could be noted that education level is the most important factor for δ-T Score, followed by DBP, smoking, SBP, UA, age, and LDL-C. All four Mach-L outperformed traditional MLR. By using Mach-L, the most important six risk factors were selected which are, from the most important to the least: DBP, SBP, UA, education level, TG and sleeping hour. δ T score was positively related to SBP, education level, UA and TG and negatively related to DBP and sleeping hour in postmenopausal Chinese women.
Collapse
Affiliation(s)
- Ching-Yao Chang
- Graduate Institute of Applied Science and Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, ROC
| | - Chung-Hsin Peng
- Department of Urology, Cardinal Tien Hospital, School of Medicine, Fu-Jen Catholic University, New Taipei City, Taiwan, ROC
| | - Fang-Yu Chen
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, New Taipei City, Taiwan, ROC
| | - Li-Ying Huang
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Department of Medical Education, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan, ROC
| | - Chun-Heng Kuo
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan, ROC
| | - Ta-Wei Chu
- Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Chief Executive Officer's Office, MJ Health Research Foundation, Taipei, 114, Taiwan
| | - Yao-Jen Liang
- Graduate Institute of Applied Science and Engineering, Fu Jen Catholic University, New Taipei City, Taiwan, ROC.
- Department and Institute of Life Science, Fu Jen Catholic University, New Taipei City, Taiwan, ROC.
| |
Collapse
|
4
|
Liu P, Liu Y, Liu H, Xiong L, Mei C, Yuan L. A Random Forest Algorithm for Assessing Risk Factors Associated With Chronic Kidney Disease: Observational Study. Asian Pac Isl Nurs J 2024; 8:e48378. [PMID: 38830204 PMCID: PMC11184270 DOI: 10.2196/48378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 02/02/2024] [Accepted: 04/16/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND The prevalence and mortality rate of chronic kidney disease (CKD) are increasing year by year, and it has become a global public health issue. The economic burden caused by CKD is increasing at a rate of 1% per year. CKD is highly prevalent and its treatment cost is high but unfortunately remains unknown. Therefore, early detection and intervention are vital means to mitigate the treatment burden on patients and decrease disease progression. OBJECTIVE In this study, we investigated the advantages of using the random forest (RF) algorithm for assessing risk factors associated with CKD. METHODS We included 40,686 people with complete screening records who underwent screening between January 1, 2015, and December 22, 2020, in Jing'an District, Shanghai, China. We grouped the participants into those with and those without CKD by staging based on the glomerular filtration rate staging and grouping based on albuminuria. Using a logistic regression model, we determined the relationship between CKD and risk factors. The RF machine learning algorithm was used to score the predictive variables and rank them based on their importance to construct a prediction model. RESULTS The logistic regression model revealed that gender, older age, obesity, abnormal index estimated glomerular filtration rate, retirement status, and participation in urban employee medical insurance were significantly associated with the risk of CKD. On RF algorithm-based screening, the top 4 factors influencing CKD were age, albuminuria, working status, and urinary albumin-creatinine ratio. The RF model predicted an area under the receiver operating characteristic curve of 93.15%. CONCLUSIONS Our findings reveal that the RF algorithm has significant predictive value for assessing risk factors associated with CKD and allows the screening of individuals with risk factors. This has crucial implications for early intervention and prevention of CKD.
Collapse
Affiliation(s)
- Pei Liu
- Department of Mathematics and Physics, Second Military Medical University, Shanghai, China
| | - Yijun Liu
- Department of Health Management, Second Military Medical University, Shanghai, China
| | - Hao Liu
- Faculty of Health Service, Second Military Medical University, Shanghai, China
| | - Linping Xiong
- Department of Health Management, Second Military Medical University, Shanghai, China
| | - Changlin Mei
- Nephrology Department, Shanghai Changzheng Hospital, Shanghai, China
| | - Lei Yuan
- Department of Health Management, Second Military Medical University, Shanghai, China
| |
Collapse
|
5
|
Chen MS, Liu TC, Jhou MJ, Yang CT, Lu CJ. Analyzing Longitudinal Health Screening Data with Feature Ensemble and Machine Learning Techniques: Investigating Diagnostic Risk Factors of Metabolic Syndrome for Chronic Kidney Disease Stages 3a to 3b. Diagnostics (Basel) 2024; 14:825. [PMID: 38667472 PMCID: PMC11048899 DOI: 10.3390/diagnostics14080825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/12/2024] [Accepted: 04/13/2024] [Indexed: 04/28/2024] Open
Abstract
Longitudinal data, while often limited, contain valuable insights into features impacting clinical outcomes. To predict the progression of chronic kidney disease (CKD) in patients with metabolic syndrome, particularly those transitioning from stage 3a to 3b, where data are scarce, utilizing feature ensemble techniques can be advantageous. It can effectively identify crucial risk factors, influencing CKD progression, thereby enhancing model performance. Machine learning (ML) methods have gained popularity due to their ability to perform feature selection and handle complex feature interactions more effectively than traditional approaches. However, different ML methods yield varying feature importance information. This study proposes a multiphase hybrid risk factor evaluation scheme to consider the diverse feature information generated by ML methods. The scheme incorporates variable ensemble rules (VERs) to combine feature importance information, thereby aiding in the identification of important features influencing CKD progression and supporting clinical decision making. In the proposed scheme, we employ six ML models-Lasso, RF, MARS, LightGBM, XGBoost, and CatBoost-each renowned for its distinct feature selection mechanisms and widespread usage in clinical studies. By implementing our proposed scheme, thirteen features affecting CKD progression are identified, and a promising AUC score of 0.883 can be achieved when constructing a model with them.
Collapse
Affiliation(s)
- Ming-Shu Chen
- Department of Healthcare Administration, College of Healthcare & Management, Asia Eastern University of Science and Technology, New Taipei City 220, Taiwan
| | - Tzu-Chi Liu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242, Taiwan
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242, Taiwan
| | - Chih-Te Yang
- Department of Business Administration, Tamkang University, New Taipei City 251, Taiwan
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242, Taiwan
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 242, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City 242, Taiwan
| |
Collapse
|
6
|
Zheng J, Guo AH, Xue BW, Wu SY, Wang XX, Jing YJ, Zhai LJ, Liu R. Exploring patient delay in people with chronic kidney disease: A cross-sectional study. Medicine (Baltimore) 2024; 103:e37077. [PMID: 38363926 PMCID: PMC10869059 DOI: 10.1097/md.0000000000037077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 01/05/2024] [Indexed: 02/18/2024] Open
Abstract
To examine the factors that contribute to patient delays among individuals with chronic kidney disease (CKD) and offer insights to help develop specific risk management strategies. Conducted as a cross-sectional study between September 2021 and April 2022, this study used a convenient sampling technique to select 245 individuals diagnosed with CKD from a Grade 3 Class A hospital located in Shanxi Province. These individuals were chosen as the subjects of the study. The research participants underwent an investigation using several assessment tools, including socio-demographic information questionnaire, medical behavior, the social support rating scale, the simplified coping style questionnaire, and the General Self-efficacy Scale. The study revealed that 35.4% of individuals with CKD experienced patient delay (the interval between the initial onset and the time of seeking medical attention being longer than or equal to 3 months). Through a multifactorial logistic regression analysis, it was determined that various factors independently influenced patient delay in patients with CKD. These factors included the level of knowledge about CKD, educational level, frequency of attending physical examinations, severity of initial symptoms, social support, self-efficacy, positive coping, and negative coping. Numerous factors contribute to the Patient Delay. To effectively enhance awareness and coping abilities regarding CKD in high-risk groups, it is essential to implement focused and continuous interventions throughout the medical seeking process.
Collapse
Affiliation(s)
- Jie Zheng
- Nursing College of Shanxi Medical University, Shanxi, China
| | - Ao-Han Guo
- Nursing College of Shanxi Medical University, Shanxi, China
| | - Bo-Wen Xue
- Nursing College, Hangzhou Normal University, Hangzhou, China
| | - Shu-Yan Wu
- Nursing College of Shanxi Medical University, Shanxi, China
| | | | - Yue-Juan Jing
- The Second Hospital of Shanxi Medicine University, Shanxi, China
| | - Lin-Jun Zhai
- Nursing College of Shanxi Medical University, Shanxi, China
| | - Rong Liu
- Nursing College of Shanxi Medical University, Shanxi, China
| |
Collapse
|
7
|
Zheng JX, Li X, Zhu J, Guan SY, Zhang SX, Wang WM. Interpretable machine learning for predicting chronic kidney disease progression risk. Digit Health 2024; 10:20552076231224225. [PMID: 38235416 PMCID: PMC10793198 DOI: 10.1177/20552076231224225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2023] [Indexed: 01/19/2024] Open
Abstract
Objective Chronic kidney disease (CKD) poses a major global health burden. Early CKD risk prediction enables timely interventions, but conventional models have limited accuracy. Machine learning (ML) enhances prediction, but interpretability is needed to support clinical usage with both in diagnostic and decision-making. Methods A cohort of 491 patients with clinical data was collected for this study. The dataset was randomly split into an 80% training set and a 20% testing set. To achieve the first objective, we developed four ML algorithms (logistic regression, random forests, neural networks, and eXtreme Gradient Boosting (XGBoost)) to classify patients into two classes-those who progressed to CKD stages 3-5 during follow-up (positive class) and those who did not (negative class). For the classification task, the area under the receiver operating characteristic curve (AUC-ROC) was used to evaluate model performance in discriminating between the two classes. For survival analysis, Cox proportional hazards regression (COX) and random survival forests (RSFs) were employed to predict CKD progression, and the concordance index (C-index) and integrated Brier score were used for model evaluation. Furthermore, variable importance, partial dependence plots, and restrict cubic splines were used to interpret the models' results. Results XGBOOST demonstrated the best predictive performance for CKD progression in the classification task, with an AUC-ROC of 0.867 (95% confidence interval (CI): 0.728-0.100), outperforming the other ML algorithms. In survival analysis, RSF showed slightly better discrimination and calibration on the test set compared to COX, indicating better generalization to new data. Variable importance analysis identified estimated glomerular filtration rate, age, and creatinine as the most important predictors for CKD survival analysis. Further analysis revealed non-linear associations between age and CKD progression, suggesting higher risks in patients aged 52-55 and 65-66 years. The association between cholesterol levels and CKD progression was also non-linear, with lower risks observed when cholesterol levels were in the range of 5.8-6.4 mmol/L. Conclusions Our study demonstrated the effectiveness of interpretable ML models for predicting CKD progression. The comparison between COX and RSF highlighted the advantages of ML in survival analysis, particularly in handling non-linearity and high-dimensional data. By leveraging interpretable ML for unraveling risk factor relationships, contrasting predictive techniques, and exposing non-linear associations, this study significantly advances CKD risk prediction to enable enhanced clinical decision-making.
Collapse
Affiliation(s)
- Jin-Xin Zheng
- Department of Nephrology, Ruijin Hospital, Institute of Nephrology, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xin Li
- Department of Nephrology, Ruijin Hospital, Institute of Nephrology, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jiang Zhu
- Liver Transplantation Center, West China Hospital, Sichuan University, Chengdu, China
| | - Shi-Yang Guan
- Department of Statistics, Second Affiliated Hospital of Anhui Medical University, Hefei, Anhui, China
| | - Shun-Xian Zhang
- School of Global Health, Chinese Center for Tropical Diseases Research – Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Clinical Research Center, Longhua Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Wei-Ming Wang
- Department of Nephrology, Ruijin Hospital, Institute of Nephrology, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
8
|
Chang CC, Yeh JH, Chiu HC, Liu TC, Chen YM, Jhou MJ, Lu CJ. Assessing the length of hospital stay for patients with myasthenia gravis based on the data mining MARS approach. Front Neurol 2023; 14:1283214. [PMID: 38156090 PMCID: PMC10752965 DOI: 10.3389/fneur.2023.1283214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/27/2023] [Indexed: 12/30/2023] Open
Abstract
Predicting the length of hospital stay for myasthenia gravis (MG) patients is challenging due to the complex pathogenesis, high clinical variability, and non-linear relationships between variables. Considering the management of MG during hospitalization, it is important to conduct a risk assessment to predict the length of hospital stay. The present study aimed to successfully predict the length of hospital stay for MG based on an expandable data mining technique, multivariate adaptive regression splines (MARS). Data from 196 MG patients' hospitalization were analyzed, and the MARS model was compared with classical multiple linear regression (MLR) and three other machine learning (ML) algorithms. The average hospital stay duration was 12.3 days. The MARS model, leveraging its ability to capture non-linearity, identified four significant factors: disease duration, age at admission, MGFA clinical classification, and daily prednisolone dose. Cut-off points and correlation curves were determined for these risk factors. The MARS model outperformed the MLR and the other ML methods (including least absolute shrinkage and selection operator MLR, classification and regression tree, and random forest) in assessing hospital stay length. This is the first study to utilize data mining methods to explore factors influencing hospital stay in patients with MG. The results highlight the effectiveness of the MARS model in identifying the cut-off points and correlation for risk factors associated with MG hospitalization. Furthermore, a MARS-based formula was developed as a practical tool to assist in the measurement of hospital stay, which can be feasibly supported as an extension of clinical risk assessment.
Collapse
Affiliation(s)
- Che-Cheng Chang
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City, Taiwan
- PhD Program in Nutrition and Food Science, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Jiann-Horng Yeh
- School of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan
- Department of Neurology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei City, Taiwan
- Department of Neurology, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Hou-Chang Chiu
- School of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan
- Department of Neurology, Taipei Medical University, Shuang-Ho Hospital, New Taipei City, Taiwan
| | - Tzu-Chi Liu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Yen-Ming Chen
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City, Taiwan
| |
Collapse
|
9
|
Wang CK, Chang CY, Chu TW, Liang YJ. Using Machine Learning to Identify the Relationships between Demographic, Biochemical, and Lifestyle Parameters and Plasma Vitamin D Concentration in Healthy Premenopausal Chinese Women. Life (Basel) 2023; 13:2257. [PMID: 38137858 PMCID: PMC10744461 DOI: 10.3390/life13122257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 11/15/2023] [Accepted: 11/22/2023] [Indexed: 12/24/2023] Open
Abstract
INTRODUCTION Vitamin D plays a vital role in maintaining homeostasis and enhancing the absorption of calcium, an essential component for strengthening bones and preventing osteoporosis. There are many factors known to relate to plasma vitamin D concentration (PVDC). However, most of these studies were performed with traditional statistical methods. Nowadays, machine learning methods (Mach-L) have become new tools in medical research. In the present study, we used four Mach-L methods to explore the relationships between PVDC and demographic, biochemical, and lifestyle factors in a group of healthy premenopausal Chinese women. Our goals were as follows: (1) to evaluate and compare the predictive accuracy of Mach-L and MLR, and (2) to establish a hierarchy of the significance of the aforementioned factors related to PVDC. METHODS Five hundred ninety-three healthy Chinese women were enrolled. In total, there were 35 variables recorded, including demographic, biochemical, and lifestyle information. The dependent variable was 25-OH vitamin D (PVDC), and all other variables were the independent variables. Multiple linear regression (MLR) was regarded as the benchmark for comparison. Four Mach-L methods were applied (random forest (RF), stochastic gradient boosting (SGB), extreme gradient boosting (XGBoost), and elastic net). Each method would produce several estimation errors. The smaller these errors were, the better the model was. RESULTS Pearson's correlation, age, glycated hemoglobin, HDL-cholesterol, LDL-cholesterol, and hemoglobin were positively correlated to PVDC, whereas eGFR was negatively correlated to PVDC. The Mach-L methods yielded smaller estimation errors for all five parameters, which indicated that they were better methods than the MLR model. After averaging the importance percentage from the four Mach-L methods, a rank of importance could be obtained. Age was the most important factor, followed by plasma insulin level, TSH, spouse status, LDH, and ALP. CONCLUSIONS In a healthy Chinese premenopausal cohort using four different Mach-L methods, age was found to be the most important factor related to PVDC, followed by plasma insulin level, TSH, spouse status, LDH, and ALP.
Collapse
Affiliation(s)
- Chun-Kai Wang
- Department of Obstetrics and Gynecology, Zuoying Branch of Kaohsiung Armed Forces General Hospital, Kaohsiung 813, Taiwan;
| | - Ching-Yao Chang
- Graduate Institute of Applied Science and Engineering, Fu Jen Catholic University, New Taipei City 242, Taiwan;
| | - Ta-Wei Chu
- Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Chief Executive Officer’s Office, MJ Health Research Foundation, Taipei 114, Taiwan;
| | - Yao-Jen Liang
- Graduate Institute of Applied Science and Engineering, Fu Jen Catholic University, New Taipei City 242, Taiwan;
| |
Collapse
|
10
|
Yang CC, Peng CH, Huang LY, Chen FY, Kuo CH, Wu CZ, Hsia TL, Lin CY. Comparison between multiple logistic regression and machine learning methods in prediction of abnormal thallium scans in type 2 diabetes. World J Clin Cases 2023; 11:7951-7964. [DOI: 10.12998/wjcc.v11.i33.7951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/23/2023] [Accepted: 11/13/2023] [Indexed: 11/24/2023] Open
Abstract
BACKGROUND The prevalence of type 2 diabetes (T2D) has been increasing dramatically in recent decades, and 47.5% of T2D patients will die of cardiovascular disease. Thallium-201 myocardial perfusion scan (MPS) is a precise and non-invasive method to detect coronary artery disease (CAD). Most previous studies used traditional logistic regression (LGR) to evaluate the risks for abnormal CAD. Rapidly developing machine learning (Mach-L) techniques could potentially outperform LGR in capturing non-linear relationships.
AIM To aims were: (1) Compare the accuracy of Mach-L methods and LGR; and (2) Found the most important factors for abnormal TMPS.
METHODS 556 T2D were enrolled in the study (287 men and 269 women). Demographic and biochemistry data were used as independent variables and the sum of stressed score derived from MPS scan was the dependent variable. Subjects with a MPS score ≥ 9 were defined as abnormal. In addition to traditional LGR, classification and regression tree (CART), random forest, Naïve Bayes, and eXtreme gradient boosting were also applied. Sensitivity, specificity, accuracy and area under the receiver operation curve were used to evaluate the respective accuracy of LGR and Mach-L methods.
RESULTS Except for CART, the other Mach-L methods outperformed LGR, with gender, body mass index, age, low-density lipoprotein cholesterol, glycated hemoglobin and smoking emerging as the most important factors to predict abnormal MPS.
CONCLUSION Four Mach-L methods are found to outperform LGR in predicting abnormal TMPS in Chinese T2D, with the most important risk factors being gender, body mass index, age, low-density lipoprotein cholesterol, glycated hemoglobin and smoking.
Collapse
Affiliation(s)
- Chung-Chi Yang
- Division of Cardiovascular Medicine, Taoyuan Armed Forces General Hospital, Taoyuan City 32551, Taiwan
- Division of Cardiovascular, Tri-service General Hospital, Taipei City 114202, Taiwan
| | - Chung-Hsin Peng
- Department of Urology, Cardinal Tien Hospital, New Taipei City 23148, Taiwan
- School of Medicine, Fu-Jen Catholic University, New Taipei City 242062, Taiwan
| | - Li-Ying Huang
- Department of Internal Medicine, Department of Medical Education, School of Medicine, Fu Jen Catholic University Hospital, New Taipei City 243, Taiwan
- School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 243, Taiwan
| | - Fang Yu Chen
- Department of Endocrinology, Fu Jen Catholic University Hospital, New Taipei City 243, Taiwan
| | - Chun-Heng Kuo
- School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 243, Taiwan
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, New Taipei City 243, Taiwan
| | - Chung-Ze Wu
- Division of Endocrinology, Shuang Ho Hospital, New Taipei City 23561, Taiwan
- School of Medicine, Taipei Medical University, Taipei City 11031, Taiwan
| | - Te-Lin Hsia
- Department of Internal Medicine, Cardinal Tien Hospital, New Taipei City 23148, Taiwan
| | - Chung-Yu Lin
- Department of Cardiology, Fu Jen Catholic University Hospital, New Taipei City 24352, Taiwan
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| |
Collapse
|
11
|
Huang HH, Lu CJ, Jhou MJ, Liu TC, Yang CT, Hsieh SJ, Yang WJ, Chang HC, Chen MS. Using a Decision Tree Algorithm Predictive Model for Sperm Count Assessment and Risk Factors in Health Screening Population. Risk Manag Healthc Policy 2023; 16:2469-2478. [PMID: 38024496 PMCID: PMC10658962 DOI: 10.2147/rmhp.s433193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 11/01/2023] [Indexed: 12/01/2023] Open
Abstract
Purpose Approximately 20% of couples face infertility challenges and struggle to conceive naturally. Despite advances in artificial reproduction, its success hinges on sperm quality. Our previous study used five machine learning (ML) algorithms, random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting, to model health data from 1375 Taiwanese males and identified ten risk factors affecting sperm count. Methods We employed the CART algorithm to generate decision trees using identified risk factors to predict healthy sperm counts. Four error metrics, SMAPE, RAE, RRSE, and RMSE, were used to evaluate the decision trees. We identified the top five decision trees based on their low errors and discussed in detail the tree with the least error. Results The decision tree featuring the least error, comprising BMI, UA, ST, T-Cho/HDL-C ratio, and BUN, corroborated the negative impacts of metabolic syndrome, particularly high BMI, on sperm count, while emphasizing the link between good sleep and male fertility. Our study also sheds light on the potentially significant influence of high BUN on spermatogenesis. Two novel risk factors, T-Cho/HDL-C and UA, warrant further investigation. Conclusion The ML algorithm established a predictive model for healthcare personnel to assess low sperm counts. Refinement of the model using additional data is crucial for improved precision. The risk factors identified offer avenues for future investigations.
Collapse
Affiliation(s)
- Hung-Hsiang Huang
- Department of Urology, Surgery, Far Eastern Memorial Hospital, New Taipei City, 220, Taiwan
| | - Chi-Jie Lu
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, 242, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City, 242, Taiwan
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, 242, Taiwan
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, 242, Taiwan
| | - Tzu-Chi Liu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, 242, Taiwan
| | - Chih-Te Yang
- Department of Business Administration, Tamkang University, New Taipei City, 251, Taiwan
| | - Shang-Ju Hsieh
- Department of Urology, Surgery, Far Eastern Memorial Hospital, New Taipei City, 220, Taiwan
| | - Wen-Jen Yang
- Health Screening Center, Chi Hsin Clinic, Taipei City, 104, Taiwan
| | - Hsiao-Chun Chang
- Department of Urology, Surgery, Far Eastern Memorial Hospital, New Taipei City, 220, Taiwan
| | - Ming-Shu Chen
- Department of Healthcare Administration, Asia Eastern University of Science and Technology, New Taipei City, 220, Taiwan
| |
Collapse
|
12
|
Tzou SJ, Peng CH, Huang LY, Chen FY, Kuo CH, Wu CZ, Chu TW. Comparison between linear regression and four different machine learning methods in selecting risk factors for osteoporosis in a Chinese female aged cohort. J Chin Med Assoc 2023; 86:1028-1036. [PMID: 37729604 DOI: 10.1097/jcma.0000000000000999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/22/2023] Open
Abstract
BACKGROUND Population aging is emerging as an increasingly acute challenge for countries around the world. One particular manifestation of this phenomenon is the impact of osteoporosis on individuals and national health systems. Previous studies of risk factors for osteoporosis were conducted using traditional statistical methods, but more recent efforts have turned to machine learning approaches. Most such efforts, however, treat the target variable (bone mineral density [BMD] or fracture rate) as a categorical one, which provides no quantitative information. The present study uses five different machine learning methods to analyze the risk factors for T-score of BMD, seeking to (1) compare the prediction accuracy between different machine learning methods and traditional multiple linear regression (MLR) and (2) rank the importance of 25 different risk factors. METHODS The study sample includes 24 412 women older than 55 years with 25 related variables, applying traditional MLR and five different machine learning methods: classification and regression tree, Naïve Bayes, random forest, stochastic gradient boosting, and eXtreme gradient boosting. The metrics used for model performance comparisons are the symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error. RESULTS Machine learning approaches outperformed MLR for all four prediction errors. The average importance ranking of each factor generated by the machine learning methods indicates that age is the most important factor determining T-score, followed by estimated glomerular filtration rate (eGFR), body mass index (BMI), uric acid (UA), and education level. CONCLUSION In a group of women older than 55 years, we demonstrated that machine learning methods provide superior performance in estimating T-Score, with age being the most important impact factor, followed by eGFR, BMI, UA, and education level.
Collapse
Affiliation(s)
- Shiow-Jyu Tzou
- Teaching and Researching Center, Kaohsiung Armed Forces General Hospital, Kaohsiung, Taiwan, ROC
- Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung, Taiwan, ROC
| | - Chung-Hsin Peng
- Department of Urology, Cardinal Tien Hospital, School of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan, ROC
| | - Li-Ying Huang
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, New Taipei, Taiwan
| | - Fang-Yu Chen
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, New Taipei, Taiwan
| | - Chun-Heng Kuo
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Fu Jen Catholic University Hospital, New Taipei, Taiwan
| | - Chung-Ze Wu
- Department of Internal Medicine, Shuang Ho Hospital, New Taipei City, Division of Endocrinology and Metabolism, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan, ROC
| | - Ta-Wei Chu
- Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan, ROC
- MJ Health Research Foundation, Taipei, Taiwan, ROC
| |
Collapse
|
13
|
Chen CH, Wang CK, Wang CY, Chang CF, Chu TW. Roles of biochemistry data, lifestyle, and inflammation in identifying abnormal renal function in old Chinese. World J Clin Cases 2023; 11:7004-7016. [PMID: 37946770 PMCID: PMC10631406 DOI: 10.12998/wjcc.v11.i29.7004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 08/01/2023] [Accepted: 09/11/2023] [Indexed: 10/13/2023] Open
Abstract
BACKGROUND The incidence of chronic kidney disease (CKD) has dramatically increased in recent years, with significant impacts on patient mortality rates. Previous studies have identified multiple risk factors for CKD, but they mostly relied on the use of traditional statistical methods such as logistic regression and only focused on a few risk factors. AIM To determine factors that can be used to identify subjects with a low estimated glomerular filtration rate (L-eGFR < 60 mL/min per 1.73 m2) in a cohort of 1236 Chinese people aged over 65. METHODS Twenty risk factors were divided into three models. Model 1 consisted of demographic and biochemistry data. Model 2 added lifestyle data to Model 1, and Model 3 added inflammatory markers to Model 2. Five machine learning methods were used: Multivariate adaptive regression splines, eXtreme Gradient Boosting, stochastic gradient boosting, Light Gradient Boosting Machine, and Categorical Features + Gradient Boosting. Evaluation criteria included accuracy, sensitivity, specificity, area under the receiver operating characteristic curve (AUC), F-1 score, and balanced accuracy. RESULTS A trend of increasing AUC of each was observed from Model 1 to Model 3 and reached statistical significance. Model 3 selected uric acid as the most important risk factor, followed by age, hemoglobin (Hb), body mass index (BMI), sport hours, and systolic blood pressure (SBP). CONCLUSION Among all the risk factors including demographic, biochemistry, and lifestyle risk factors, along with inflammation markers, UA is the most important risk factor to identify L-eGFR, followed by age, Hb, BMI, sport hours, and SBP in a cohort of elderly Chinese people.
Collapse
Affiliation(s)
- Chao-Hung Chen
- Division of Urology, Department of Surgery, Kaohsiung Armed Forces General Hospital, Kaohsiung 802, Taiwan
- Division of Urology, Department of Surgery, Chang Gung Memorial Hospital, Keelung 204, Taiwan
| | - Chun-Kai Wang
- Department of Obstetrics and Gynecology, Zuoying Branch of Kaohsiung Armed Forces General Hospital, Kaohsiung 813, Taiwan
| | - Chen-Yu Wang
- Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Taipei 114, Taiwan
| | - Chun-Feng Chang
- Division of Urology, Department of Surgery, Kaohsiung Armed Forces General Hospital, Kaohsiung 802, Taiwan
- Division of Urology, Department of Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei 114, Taiwan
| | - Ta-Wei Chu
- Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Taipei 114, Taiwan
- Chief Executive Officer's Office, MJ Health Research Foundation, Taipei 114, Taiwan
| |
Collapse
|
14
|
Tsai MH, Jhou MJ, Liu TC, Fang YW, Lu CJ. An integrated machine learning predictive scheme for longitudinal laboratory data to evaluate the factors determining renal function changes in patients with different chronic kidney disease stages. Front Med (Lausanne) 2023; 10:1155426. [PMID: 37859858 PMCID: PMC10582636 DOI: 10.3389/fmed.2023.1155426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 09/19/2023] [Indexed: 10/21/2023] Open
Abstract
Background and objectives Chronic kidney disease (CKD) is a global health concern. This study aims to identify key factors associated with renal function changes using the proposed machine learning and important variable selection (ML&IVS) scheme on longitudinal laboratory data. The goal is to predict changes in the estimated glomerular filtration rate (eGFR) in a cohort of patients with CKD stages 3-5. Design A retrospective cohort study. Setting and participants A total of 710 outpatients who presented with stable nondialysis-dependent CKD stages 3-5 at the Shin-Kong Wu Ho-Su Memorial Hospital Medical Center from 2016 to 2021. Methods This study analyzed trimonthly laboratory data including 47 indicators. The proposed scheme used stochastic gradient boosting, multivariate adaptive regression splines, random forest, eXtreme gradient boosting, and light gradient boosting machine algorithms to evaluate the important factors for predicting the results of the fourth eGFR examination, especially in patients with CKD stage 3 and those with CKD stages 4-5, with or without diabetes mellitus (DM). Main outcome measurement Subsequent eGFR level after three consecutive laboratory data assessments. Results Our ML&IVS scheme demonstrated superior predictive capabilities and identified significant factors contributing to renal function changes in various CKD groups. The latest levels of eGFR, blood urea nitrogen (BUN), proteinuria, sodium, and systolic blood pressure as well as mean levels of eGFR, BUN, proteinuria, and triglyceride were the top 10 significantly important factors for predicting the subsequent eGFR level in patients with CKD stages 3-5. In individuals with DM, the latest levels of BUN and proteinuria, mean levels of phosphate and proteinuria, and variations in diastolic blood pressure levels emerged as important factors for predicting the decline of renal function. In individuals without DM, all phosphate patterns and latest albumin levels were found to be key factors in the advanced CKD group. Moreover, proteinuria was identified as an important factor in the CKD stage 3 group without DM and CKD stages 4-5 group with DM. Conclusion The proposed scheme highlighted factors associated with renal function changes in different CKD conditions, offering valuable insights to physicians for raising awareness about renal function changes.
Collapse
Affiliation(s)
- Ming-Hsien Tsai
- Division of Nephrology, Department of Medicine, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan
- Department of Medicine, School of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Tzu-Chi Liu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Yu-Wei Fang
- Division of Nephrology, Department of Medicine, Shin Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan
- Department of Medicine, School of Medicine, Fu Jen Catholic University, New Taipei City, Taiwan
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City, Taiwan
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City, Taiwan
| |
Collapse
|
15
|
Jeong K, Mallard AR, Coombe L, Ward J. Artificial intelligence and prediction of cardiometabolic disease: Systematic review of model performance and potential benefits in indigenous populations. Artif Intell Med 2023; 139:102534. [PMID: 37100506 DOI: 10.1016/j.artmed.2023.102534] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 11/20/2022] [Accepted: 03/21/2023] [Indexed: 03/30/2023]
Abstract
BACKGROUND Indigenous peoples often have higher rates of morbidity and mortality associated with cardiometabolic disease (CMD) than non-Indigenous people and this may be even more so in urban areas. The use of electronic health records and expansion of computing power has led to mainstream use of artificial intelligence (AI) to predict the onset of disease in primary health care (PHC) settings. However, it is unknown if AI and in particular machine learning is used for risk prediction of CMD in Indigenous peoples. METHODS We searched peer-reviewed literature using terms associated with AI machine learning, PHC, CMD, and Indigenous peoples. RESULTS We identified 13 suitable studies for inclusion in this review. Median total number of participants was 19,270 (range 911-2,994,837). The most common algorithms used in machine learning in this setting were support vector machine, random forest, and decision tree learning. Twelve studies used the area under the receiver operating characteristic curve (AUC) to measure performance. Two studies reported an AUC of >0.9. Six studies had an AUC score between 0.9 and 0.8, 4 studies had an AUC score between 0.8 and 0.7. 1 study reported an AUC score between 0.7 and 0.6. Risk of bias was observed in 10 (77 %) studies. CONCLUSION AI machine learning and risk prediction models show moderate to excellent discriminatory ability over traditional statistical models in predicting CMD. This technology could help address the needs of urban Indigenous peoples by predicting CMD early and more rapidly than conventional methods.
Collapse
|
16
|
Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review. J Nephrol 2023; 36:1101-1117. [PMID: 36786976 PMCID: PMC10227138 DOI: 10.1007/s40620-023-01573-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Accepted: 01/01/2023] [Indexed: 02/15/2023]
Abstract
OBJECTIVES In this systematic review we aimed at assessing how artificial intelligence (AI), including machine learning (ML) techniques have been deployed to predict, diagnose, and treat chronic kidney disease (CKD). We systematically reviewed the available evidence on these innovative techniques to improve CKD diagnosis and patient management. METHODS We included English language studies retrieved from PubMed. The review is therefore to be classified as a "rapid review", since it includes one database only, and has language restrictions; the novelty and importance of the issue make missing relevant papers unlikely. We extracted 16 variables, including: main aim, studied population, data source, sample size, problem type (regression, classification), predictors used, and performance metrics. We followed the Preferred Reporting Items for Systematic Reviews (PRISMA) approach; all main steps were done in duplicate. RESULTS From a total of 648 studies initially retrieved, 68 articles met the inclusion criteria. Models, as reported by authors, performed well, but the reported metrics were not homogeneous across articles and therefore direct comparison was not feasible. The most common aim was prediction of prognosis, followed by diagnosis of CKD. Algorithm generalizability, and testing on diverse populations was rarely taken into account. Furthermore, the clinical evaluation and validation of the models/algorithms was perused; only a fraction of the included studies, 6 out of 68, were performed in a clinical context. CONCLUSIONS Machine learning is a promising tool for the prediction of risk, diagnosis, and therapy management for CKD patients. Nonetheless, future work is needed to address the interpretability, generalizability, and fairness of the models to ensure the safe application of such technologies in routine clinical practice.
Collapse
|
17
|
Huang YC, Cheng YC, Jhou MJ, Chen M, Lu CJ. Integrated Machine Learning Decision Tree Model for Risk Evaluation in Patients with Non-Valvular Atrial Fibrillation When Taking Different Doses of Dabigatran. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:2359. [PMID: 36767726 PMCID: PMC9915180 DOI: 10.3390/ijerph20032359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 01/24/2023] [Accepted: 01/24/2023] [Indexed: 06/18/2023]
Abstract
The new generation of nonvitamin K antagonists are broadly applied for stroke prevention due to their notable efficacy and safety. Our study aimed to develop a suggestive utilization of dabigatran through an integrated machine learning (ML) decision-tree model. Participants taking different doses of dabigatran in the Randomized Evaluation of Long-Term Anticoagulant Therapy trial were included in our analysis and defined as the 110 mg and 150 mg groups. The proposed scheme integrated ML methods, namely naive Bayes, random forest (RF), classification and regression tree (CART), and extreme gradient boosting (XGBoost), which were used to identify the essential variables for predicting vascular events in the 110 mg group and bleeding in the 150 mg group. RF (0.764 for 110 mg; 0.747 for 150 mg) and XGBoost (0.708 for 110 mg; 0.761 for 150 mg) had better area under the receiver operating characteristic curve (AUC) values than logistic regression (benchmark model; 0.683 for 110 mg; 0.739 for 150 mg). We then selected the top ten important variables as internal nodes of the CART decision tree. The two best CART models with ten important variables output tree-shaped rules for predicting vascular events in the 110 mg group and bleeding in the 150 mg group. Our model can be used to provide more visualized and interpretable suggestive rules to clinicians managing NVAF patients who are taking dabigatran.
Collapse
Affiliation(s)
- Yung-Chuan Huang
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City 24352, Taiwan
| | - Yu-Chen Cheng
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City 24352, Taiwan
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| | - Mingchih Chen
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| |
Collapse
|
18
|
Stedman M, Heald A, Robinson A, Davies M, Harnett P. Associations and mitigations: an analysis of the changing risk factor landscape for chronic kidney disease in primary care using national general practice level data. BMJ Open 2022; 12:e064723. [PMID: 36549719 PMCID: PMC9791436 DOI: 10.1136/bmjopen-2022-064723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVES Early recognition of chronic kidney disease (CKD) should be achieved by every modern healthcare system. The objective of this study was to investigate CKD risk factor trends in England using general practice level data. DESIGN Observational analysis of data at practice level for all general practices in England. Practice characteristics identified as potential CKD risk factors included comorbidities and local demography. Data were analysed using both univariate and multivariate analysis to identify significant factors that were associated with CKD diagnosis for the period 1 April 2019 to 31 March 2020. SETTING Publicly available data from UK primary care sources including Primary Care Quality and Outcomes Framework database, practice-level prescribing data from the British National Formulary and Public Health England health outcome data. PARTICIPANTS All data submitted from 6471 medium to large practices in England were included (over 46 million patients). RISK FACTOR ANALYSIS Potential risk factors were grouped into four classes based on existing literature: demographic factors, comorbidities, service and practice outcome factors, and prescribing data effects. RESULTS The original model's prediction of CKD improved from r2 0.38 to an r2 of 0.66 when updated factors were included. Positive associations included known risk factors with higher relative risk such as hypertension and diabetes, along with less recognised factors such as depression and use of opiates. Negative associations included NSAIDs which are traditionally associated with increased CKD risk, and prescribing of antibiotics, along with more northerly locations. CONCLUSIONS CKD is a preventable disease with high costs and consequences. These data and novel analysis give clearer relative risk values for different patient characteristics with some unexpected findings such as potential harmful association between CKD and opiates, and a more benign association with NSAIDs. A deeper understanding of CKD risk factors is important to update and implement local and national management strategies. Further research is required to establish the causal nature of these associations and to refine location appropriate actions to minimise harm from CKD on regional and local levels.
Collapse
Affiliation(s)
| | - Adrian Heald
- University of Manchester, Manchester, UK
- Department of Diabetes and Endocrinology, Salford Royal Hospital, Salford, UK
| | | | | | - Patrick Harnett
- Department of Renal Medicine, Princess Elizabeth Hospital, Saint Andrews, Guernsey
| |
Collapse
|
19
|
A Hybrid Risk Factor Evaluation Scheme for Metabolic Syndrome and Stage 3 Chronic Kidney Disease Based on Multiple Machine Learning Techniques. Healthcare (Basel) 2022; 10:healthcare10122496. [PMID: 36554020 PMCID: PMC9778302 DOI: 10.3390/healthcare10122496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 11/28/2022] [Accepted: 12/08/2022] [Indexed: 12/14/2022] Open
Abstract
With the rapid development of medicine and technology, machine learning (ML) techniques are extensively applied to medical informatics and the suboptimal health field to identify critical predictor variables and risk factors. Metabolic syndrome (MetS) and chronic kidney disease (CKD) are important risk factors for many comorbidities and complications. Existing studies that utilize different statistical or ML algorithms to perform CKD data analysis mostly analyze the early-stage subjects directly, but few studies have discussed the predictive models and important risk factors for the stage-III CKD high-risk health screening population. The middle stages 3a and 3b of CKD indicate moderate renal failure. This study aims to construct an effective hybrid important risk factor evaluation scheme for subjects with MetS and CKD stages III based on ML predictive models. The six well-known ML techniques, namely random forest (RF), logistic regression (LGR), multivariate adaptive regression splines (MARS), extreme gradient boosting (XGBoost), gradient boosting with categorical features support (CatBoost), and a light gradient boosting machine (LightGBM), were used in the proposed scheme. The data were sourced from the Taiwan health examination indicators and the questionnaire responses of 71,108 members between 2005 and 2017. In total, 375 stage 3a CKD and 50 CKD stage 3b CKD patients were enrolled, and 33 different variables were used to evaluate potential risk factors. Based on the results, the top five important variables, namely BUN, SBP, Right Intraocular Pressure (R-IOP), RBCs, and T-Cho/HDL-C (C/H), were identified as significant variables for evaluating the subjects with MetS and CKD stage 3a or 3b.
Collapse
|
20
|
Schena FP, Anelli VW, Abbrescia DI, Di Noia T. Prediction of chronic kidney disease and its progression by artificial intelligence algorithms. J Nephrol 2022; 35:1953-1971. [PMID: 35543912 DOI: 10.1007/s40620-022-01302-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 03/04/2022] [Indexed: 12/14/2022]
Abstract
BACKGROUND AND OBJECTIVE Aim of nephrologists is to delay the outcome and reduce the number of patients undergoing renal failure (RF) by applying prevention protocols and accurately monitoring chronic kidney disease (CKD) patients. General practitioners and nephrologists are involved in the first and in the late stages of the disease, respectively. Early diagnosis of CKD is an important step in preventing the progression of kidney damage. Our aim was to review publications on machine learning algorithms (MLAs) that can predict early CKD and its progression. METHODS We conducted a systematic review and selected 55 articles on the application of MLAs in CKD. PubMed, Medline, Scopus, Web of Science and IEEE Xplore Digital Library of the Institute of Electrical and Electronics Engineers were searched. The search terms were chronic kidney disease, artificial intelligence, data mining and machine learning algorithms. RESULTS MLAs use enormous numbers of predictors combining them in non-linear and highly interactive ways. This ability increases when new data is added. We observed some limitations in the publications: (i) databases were not accurately reviewed by physicians; (ii) databases did not report the ethnicity of the patients; (iii) some databases collected variables that were not important for the diagnosis and progression of CKD; (iv) no information was presented on the native kidney disease causing CKD; (v) no validation of the results in external independent cohorts was provided; and (vi) no insights were given on the MLAs that were used. Overall, there was limited collaboration among experts in electronics, computer science and physicians. CONCLUSIONS The application of MLAs in kidney diseases may enhance the ability of clinicians to predict CKD and RF, thus improving diagnostic assistance and providing suitable therapeutic decisions. However, it is necessary to improve the development process of MLA tools.
Collapse
Affiliation(s)
- Francesco Paolo Schena
- Department of Emergency and Organ Transplants, University of Bari, Bari, Italy.
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy.
| | - Vito Walter Anelli
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy
| | | | - Tommaso Di Noia
- Department of Electrical and Information Engineering, Polytechnic of Bari, Bari, Italy
| |
Collapse
|
21
|
Su D, Zhang X, He K, Chen Y, Wu N. Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches. Front Public Health 2022; 10:998549. [PMID: 36339144 PMCID: PMC9634246 DOI: 10.3389/fpubh.2022.998549] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/20/2022] [Indexed: 01/26/2023] Open
Abstract
Background Chronic kidney disease (CKD) has become a major public health problem worldwide and has caused a huge social and economic burden, especially in developing countries. No previous study has used machine learning (ML) methods combined with longitudinal data to predict the risk of CKD development in 2 years amongst the elderly in China. Methods This study was based on the panel data of 925 elderly individuals in the 2012 baseline survey and 2014 follow-up survey of the Healthy Aging and Biomarkers Cohort Study (HABCS) database. Six ML models, logistic regression (LR), lasso regression, random forests (RF), gradient-boosted decision tree (GBDT), support vector machine (SVM), and deep neural network (DNN), were developed to predict the probability of CKD amongst the elderly in 2 years (the year of 2014). The decision curve analysis (DCA) provided a range of threshold probability of the outcome and the net benefit of each ML model. Results Amongst the 925 elderly in the HABCS 2014 survey, 289 (18.8%) had CKD. Compared with the other models, LR, lasso regression, RF, GBDT, and DNN had no statistical significance of the area under the receiver operating curve (AUC) value (>0.7), and SVM exhibited the lowest predictive performance (AUC = 0.633, p-value = 0.057). DNN had the highest positive predictive value (PPV) (0.328), whereas LR had the lowest (0.287). DCA results indicated that within the threshold ranges of ~0-0.03 and 0.37-0.40, the net benefit of GBDT was the largest. Within the threshold ranges of ~0.03-0.10 and 0.26-0.30, the net benefit of RF was the largest. Age was the most important predictor variable in the RF and GBDT models. Blood urea nitrogen, serum albumin, uric acid, body mass index (BMI), marital status, activities of daily living (ADL)/instrumental activities of daily living (IADL) and gender were crucial in predicting CKD in the elderly. Conclusion The ML model could successfully capture the linear and nonlinear relationships of risk factors for CKD in the elderly. The decision support system based on the predictive model in this research can help medical staff detect and intervene in the health of the elderly early.
Collapse
Affiliation(s)
- Dai Su
- Department of Health Management and Policy, School of Public Health, Capital Medical University, Beijing, China
| | - Xingyu Zhang
- Department of Systems, Populations, and Leadership, University of Michigan School of Nursing, Ann Arbor, MI, United States,Thomas E. Starzl Transplantation Institute, University of Pittsburgh Medical Center, Pittsburgh, PA, United States
| | - Kevin He
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, United States
| | - Yingchun Chen
- Department of Health Management, School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China,Research Center for Rural Health Services, Hubei Province Key Research Institute of Humanities and Social Sciences, Wuhan, China
| | - Nina Wu
- Department of Health Management and Policy, School of Public Health, Capital Medical University, Beijing, China,*Correspondence: Nina Wu
| |
Collapse
|
22
|
Big Data, Decision Models, and Public Health. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19148543. [PMID: 35886394 PMCID: PMC9324609 DOI: 10.3390/ijerph19148543] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 07/08/2022] [Indexed: 12/10/2022]
|
23
|
Comparison between Machine Learning and Multiple Linear Regression to Identify Abnormal Thallium Myocardial Perfusion Scan in Chinese Type 2 Diabetes. Diagnostics (Basel) 2022; 12:diagnostics12071619. [PMID: 35885524 PMCID: PMC9324130 DOI: 10.3390/diagnostics12071619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 06/27/2022] [Accepted: 06/30/2022] [Indexed: 11/17/2022] Open
Abstract
Type 2 diabetes mellitus (T2DM) patients have a high risk of coronary artery disease (CAD). Thallium-201 myocardial perfusion scan (Th-201 scan) is a non-invasive and extensively used tool in recognizing CAD in clinical settings. In this study, we attempted to compare the predictive accuracy of evaluating abnormal Th-201 scans using traditional multiple linear regression (MLR) with four machine learning (ML) methods. From the study, we can determine whether ML surpasses traditional MLR and rank the clinical variables and compare them with previous reports.In total, 796 T2DM, including 368 men and 528 women, were enrolled. In addition to traditional MLR, classification and regression tree (CART), random forest (RF), stochastic gradient boosting (SGB) and eXtreme gradient boosting (XGBoost) were also used to analyze abnormal Th-201 scans. Stress sum score was used as the endpoint (dependent variable). Our findings show that all four root mean square errors of ML are smaller than with MLR, which implies that ML is more precise than MLR in determining abnormal Th-201 scans by using clinical parameters. The first seven factors, from the most important to the least are:body mass index, hemoglobin, age, glycated hemoglobin, Creatinine, systolic and diastolic blood pressure. In conclusion, ML is not inferior to traditional MLR in predicting abnormal Th-201 scans, and the most important factors are body mass index, hemoglobin, age, glycated hemoglobin, creatinine, systolic and diastolic blood pressure. ML methods are superior in these kinds of studies.
Collapse
|
24
|
Huang LY, Chen FY, Jhou MJ, Kuo CH, Wu CZ, Lu CH, Chen YL, Pei D, Cheng YF, Lu CJ. Comparing Multiple Linear Regression and Machine Learning in Predicting Diabetic Urine Albumin-Creatinine Ratio in a 4-Year Follow-Up Study. J Clin Med 2022; 11:3661. [PMID: 35806944 PMCID: PMC9267784 DOI: 10.3390/jcm11133661] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 06/19/2022] [Accepted: 06/22/2022] [Indexed: 02/07/2023] Open
Abstract
The urine albumin-creatinine ratio (uACR) is a warning for the deterioration of renal function in type 2 diabetes (T2D). The early detection of ACR has become an important issue. Multiple linear regression (MLR) has traditionally been used to explore the relationships between risk factors and endpoints. Recently, machine learning (ML) methods have been widely applied in medicine. In the present study, four ML methods were used to predict the uACR in a T2D cohort. We hypothesized that (1) ML outperforms traditional MLR and (2) different ranks of the importance of the risk factors will be obtained. A total of 1147 patients with T2D were followed up for four years. MLR, classification and regression tree, random forest, stochastic gradient boosting, and eXtreme gradient boosting methods were used. Our findings show that the prediction errors of the ML methods are smaller than those of MLR, which indicates that ML is more accurate. The first six most important factors were baseline creatinine level, systolic and diastolic blood pressure, glycated hemoglobin, and fasting plasma glucose. In conclusion, ML might be more accurate in predicting uACR in a T2D cohort than the traditional MLR, and the baseline creatinine level is the most important predictor, which is followed by systolic and diastolic blood pressure, glycated hemoglobin, and fasting plasma glucose in Chinese patients with T2D.
Collapse
Affiliation(s)
- Li-Ying Huang
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Department of Medical Education, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (L.-Y.H.); (F.-Y.C.); (C.-H.K.); (D.P.)
| | - Fang-Yu Chen
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Department of Medical Education, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (L.-Y.H.); (F.-Y.C.); (C.-H.K.); (D.P.)
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan;
| | - Chun-Heng Kuo
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Department of Medical Education, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (L.-Y.H.); (F.-Y.C.); (C.-H.K.); (D.P.)
| | - Chung-Ze Wu
- Division of Endocrinology, Department of Internal Medicine, Shuang Ho Hospital, New Taipei City 23561, Taiwan;
- Division of Endocrinology and Metabolism, Department of Internal Medicine, School of Medicine, College of Medicine, Taipei Medical University, Taipei 11031, Taiwan
| | - Chieh-Hua Lu
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Tri-Service General Hospital, School of Medicine, National Defense Medical Center, Taipei 11490, Taiwan;
| | - Yen-Lin Chen
- Department of Pathology, Tri-Service General Hospital, National Defense Medical Center, Taipei 11490, Taiwan;
| | - Dee Pei
- Division of Endocrinology and Metabolism, Department of Internal Medicine, Department of Medical Education, Fu Jen Catholic University Hospital, School of Medicine, College of Medicine, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (L.-Y.H.); (F.-Y.C.); (C.-H.K.); (D.P.)
| | - Yu-Fang Cheng
- Department of Endocrinology and Metabolism, Changhua Christian Hospital, Changhua 50051, Taiwan;
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan;
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| |
Collapse
|
25
|
Cao X, Lin Y, Yang B, Li Y, Zhou J. Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening. Risk Manag Healthc Policy 2022; 15:817-826. [PMID: 35502445 PMCID: PMC9056070 DOI: 10.2147/rmhp.s346856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 04/16/2022] [Indexed: 11/23/2022] Open
Abstract
Purpose Using machine learning method to predict and judge unknown data offers opportunity to improve accuracy by exploring complex interactions between risk factors. Therefore, we evaluate the performance of machine learning (ML) algorithms and to compare them with logistic regression for predicting the risk of renal function decline (RFD) using routine clinical data. Patients and Methods This retrospective cohort study includes datasets from 2166 subjects, aged 35–74 years old, provided by an adult health screening follow-up program between 2010 and 2020. Seven different ML models were considered – random forest, gradient boosting, multilayer perceptron, support vector machine, K-nearest neighbors, adaptive boosting, and decision tree - and were compared with standard logistic regression. There were 24 independent variables, and the baseline estimate glomerular filtration rate (eGFR) was used as the predictive variable. Results A total of 2166 participants (mean age 49.2±11.2 years old, 63.3% males) were enrolled and randomly divided into a training set (n=1732) and a test set (n=434). The area under receiver operating characteristic curve (AUROC) for detecting RFD corresponding to the different models were above 0.85 during the training phase. The gradient boosting algorithms exhibited the best average prediction accuracy (AUROC: 0.914) among all algorithms validated in this study. Based on AUROC, the ML algorithms improved the RFD prediction performance, compared to logistic regression model (AUROC:0.882), except the K-nearest neighbors and decision tree algorithms (AUROC:0.854 and 0.824, respectively). However, the improvement differences with logistic regression were small (less than 4%) and nonsignificant. Conclusion Our results indicate that the proposed health screening dataset-based RFD prediction model using ML algorithms is readily applicable, produces validated results. But logistic regression yields as good performance as ML models to predict the risk of RFD with simple clinical predictors.
Collapse
Affiliation(s)
- Xia Cao
- Health Management Center, The Third Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Health Management Research Center, Central South University, Changsha, Hunan, People’s Republic of China
- Hunan Chronic Disease Health Management Medical Research Center, Central South University, Changsha, Hunan, People’s Republic of China
| | - Yanhui Lin
- Health Management Center, The Third Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Health Management Research Center, Central South University, Changsha, Hunan, People’s Republic of China
- Hunan Chronic Disease Health Management Medical Research Center, Central South University, Changsha, Hunan, People’s Republic of China
| | - Binfang Yang
- Health Management Center, The Third Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Health Management Research Center, Central South University, Changsha, Hunan, People’s Republic of China
- Hunan Chronic Disease Health Management Medical Research Center, Central South University, Changsha, Hunan, People’s Republic of China
| | - Ying Li
- Health Management Center, The Third Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Health Management Research Center, Central South University, Changsha, Hunan, People’s Republic of China
- Hunan Chronic Disease Health Management Medical Research Center, Central South University, Changsha, Hunan, People’s Republic of China
| | - Jiansong Zhou
- National Clinical Research Center for Mental Disorders, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan, People’s Republic of China
- Correspondence: Jiansong Zhou, National Clinical Research Center for Mental Disorders, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, 410011, People’s Republic of China, Tel/Fax +86 073188618573, Email
| |
Collapse
|
26
|
Lin M, Heizhati M, Gan L, Yao L, Yang W, Li M, Hong J, Wu Z, Wang H, Li N. Development and Validation of a Prediction Model for 5-Year Risk of Kidney Dysfunction in Patients with Hypertension and Glucose Metabolism Disorder. Risk Manag Healthc Policy 2022; 15:289-298. [PMID: 35221736 PMCID: PMC8880707 DOI: 10.2147/rmhp.s345059] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 02/01/2022] [Indexed: 12/15/2022] Open
Abstract
Purpose Patients with hypertension and glucose metabolism disorder (GMD) are at high risk of developing kidney dysfunction (KD). Therefore, we aimed to develop a nomogram for predicting individuals’ 5-year risk of KD in hypertensives with GMD. Patients and Methods In total, 1961 hypertensives with GMD were consecutively included. Baseline data were extracted from medical electronic system, and follow-up data were obtained using annual health check-ups or hospital readmission. KD was defined as estimated glomerular filtration rate (eGFR) <60 mL/min/1.73m2. Subjects were randomly divided into training and validation sets with a ratio of 7 to 3. Least absolute shrinkage and selection operator method was used to identify potential predictors. Cox proportional hazard model was applied to build a nomogram for predicting KD risk. The discriminative ability, calibration and usefulness of the model were evaluated. The prediction model was verified by internal validation. Results During the follow-up of 5351 person-years with a median follow-up of 32 (range: 3–91) months, 130 patients developed KD. Age, sex, ethnicity, hemoglobin A1c, uric acid, and baseline eGFR were identified as significant predictors for incident KD and used for establishing nomogram. The prediction model displayed good discrimination with C-index of 0.770 (95% CI: 0.712–0.828) and 0.763 (95% CI: 0.704–0.823) in training and validation sets, respectively. Calibration curve indicated good agreement between the predicted and actual probabilities. The decision curve analysis demonstrated that the model was clinically useful. Conclusion The prediction nomogram, including six common easy-to-obtain factors, shows good performance for predicting 5-year risk of KD in hypertensives with GMD. This quantitative tool could help clinicians, and even primary care providers, recognize potential KD patients early and make strategy for prevention and management.
Collapse
Affiliation(s)
- Mengyue Lin
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
- Xinjiang Medical University, Urumqi, Xinjiang, People’s Republic of China
| | - Mulalibieke Heizhati
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Lin Gan
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
- Xinjiang Medical University, Urumqi, Xinjiang, People’s Republic of China
| | - Ling Yao
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Wenbo Yang
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Mei Li
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Jing Hong
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Zihao Wu
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
| | - Hui Wang
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
- Xinjiang Medical University, Urumqi, Xinjiang, People’s Republic of China
| | - Nanfang Li
- Hypertension Center of People’s Hospital of Xinjiang Uygur Autonomous Region; Xinjiang Hypertension Institute; National Health Committee Key Laboratory of Hypertension Clinical Research; Key Laboratory of Xinjiang Uygur Autonomous Region “Hypertension Research Laboratory”; Xinjiang Clinical Medical Research Center for Hypertension (Cardio-Cerebrovascular) Diseases, Urumqi, People’s Republic of China
- Correspondence: Nanfang Li, Email
| |
Collapse
|
27
|
Chang CC, Yeh JH, Chiu HC, Chen YM, Jhou MJ, Liu TC, Lu CJ. Utilization of Decision Tree Algorithms for Supporting the Prediction of Intensive Care Unit Admission of Myasthenia Gravis: A Machine Learning-Based Approach. J Pers Med 2022; 12:32. [PMID: 35055347 PMCID: PMC8778268 DOI: 10.3390/jpm12010032] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/24/2021] [Accepted: 12/28/2021] [Indexed: 12/23/2022] Open
Abstract
Myasthenia gravis (MG), an acquired autoimmune-related neuromuscular disorder that causes muscle weakness, presents with varying severity, including myasthenic crisis (MC). Although MC can cause significant morbidity and mortality, specialized neuro-intensive care can produce a good long-term prognosis. Considering the outcomes of MG during hospitalization, it is critical to conduct risk assessments to predict the need for intensive care. Evidence and valid tools for the screening of critical patients with MG are lacking. We used three machine learning-based decision tree algorithms, including a classification and regression tree, C4.5, and C5.0, for predicting intensive care unit (ICU) admission of patients with MG. We included 228 MG patients admitted between 2015 and 2018. Among them, 88.2% were anti-acetylcholine receptors antibody positive and 4.7% were anti-muscle-specific kinase antibody positive. Twenty clinical variables were used as predictive variables. The C5.0 decision tree outperformed the other two decision tree and logistic regression models. The decision rules constructed by the best C5.0 model showed that the Myasthenia Gravis Foundation of America clinical classification at admission, thymoma history, azathioprine treatment history, disease duration, sex, and onset age were significant risk factors for the development of decision rules for ICU admission prediction. The developed machine learning-based decision tree can be a supportive tool for alerting clinicians regarding patients with MG who require intensive care, thereby improving the quality of care.
Collapse
Affiliation(s)
- Che-Cheng Chang
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (C.-C.C.); (Y.-M.C.)
- Ph.D. Program in Nutrition and Food Sciences, Human Ecology College, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| | - Jiann-Horng Yeh
- School of Medicine, Fu Jen Catholic University, New Taipei City 24205, Taiwan; (J.-H.Y.); (H.-C.C.)
- Department of Neurology, Shin Kong Wu Ho-Su Memorial Hospital, Taipei 11101, Taiwan
- Department of Neurology, Kaohsiung Medical University, Kaohsiung 80708, Taiwan
| | - Hou-Chang Chiu
- School of Medicine, Fu Jen Catholic University, New Taipei City 24205, Taiwan; (J.-H.Y.); (H.-C.C.)
- Department of Neurology, Shuang-Ho Hospital, Taipei Medical University, New Taipei City 23561, Taiwan
| | - Yen-Ming Chen
- Department of Neurology, Fu Jen Catholic University Hospital, Fu Jen Catholic University, New Taipei City 24352, Taiwan; (C.-C.C.); (Y.-M.C.)
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan;
| | - Tzu-Chi Liu
- Department of Business Administration, Fu Jen Catholic University, New Taipei City, 242062, Taiwan;
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei City 242062, Taiwan;
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City 242062, Taiwan
- Department of Information Management, Fu Jen Catholic University, New Taipei City 242062, Taiwan
| |
Collapse
|
28
|
Wang Y, Sun Y, Lu N, Feng X, Gao M, Zhang L, Dou Y, Meng F, Zhang K. Diagnosis and Treatment Rules of Chronic Kidney Disease and Nursing Intervention Models of Related Mental Diseases Using Electronic Medical Records and Data Mining. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:5187837. [PMID: 34925735 PMCID: PMC8683225 DOI: 10.1155/2021/5187837] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 10/20/2021] [Indexed: 11/22/2022]
Abstract
Objective On the basis of electronic medical records, the data mining technology was adopted to explore the law of chronic kidney disease (CKD) and the intervention mode of mental health of patients. Methods Based on the electronic medical records, the corresponding data extraction, database establishment, and data cleaning of CKD were performed. After that, the related data analysis, frequency analysis, cluster analysis, and nonparametric analysis were used to explore the laws of CKD diagnosis and treatment and nursing intervention mode of mental illness. The most common causes of CKD were chronic glomerulonephritis (43.76%), aristolochic acid nephritis (16.34%), diabetic nephritis (12.87%), and hypertensive nephritis (11.58%). The major treatment method for end-stage patients was alternative therapies, accounting for 46%. Compared with the depression score before intervention, that of the patients after the mindfulness therapy (50.99 ± 9.77 vs. 47.01 ± 9.33, P=0.024 < 0.5) and target behaviour nursing intervention (52.21 ± 8.12 vs. 48.01 ± 9.33, P=0.032 < 0.05) was obviously decreased. Conclusion The data mining technology based on electronic records showed a good application prospect in the analysis of the diagnosis and treatment of CKD; and target behaviour nursing and mindfulness intervention were effective psychological intervention models.
Collapse
Affiliation(s)
- Yanli Wang
- Department of Mental Health, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Yueyao Sun
- Department of Hepatobiliary Surgery, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Na Lu
- Department of Emergency, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Xuan Feng
- Department of Nephrology, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Minglong Gao
- Department of Mental Health, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Lihong Zhang
- Department of Nephrology, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Yaping Dou
- Department of Respiratory Medicine, First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Fulei Meng
- Department of Nephrology, The First Hospital of Hebei Medical University, Shijiazhuang, China
| | - Kaidi Zhang
- Department of Nephrology, The First Hospital of Hebei Medical University, Shijiazhuang, China
| |
Collapse
|
29
|
Shih CC, Chen SH, Chen GD, Chang CC, Shih YL. Development of a Longitudinal Diagnosis and Prognosis in Patients with Chronic Kidney Disease: Intelligent Clinical Decision-Making Scheme. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph182312807. [PMID: 34886533 PMCID: PMC8657318 DOI: 10.3390/ijerph182312807] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 12/01/2021] [Accepted: 12/02/2021] [Indexed: 01/08/2023]
Abstract
Previous studies on CKD patients have mostly been retrospective, cross-sectional studies. Few studies have assessed the longitudinal assessment of patients over an extended period. In consideration of the heterogeneity of CKD progression. It’s critical to develop a longitudinal diagnosis and prognosis for CKD patients. We proposed an auto Machine Learning (ML) scheme in this study. It consists of four main parts: classification pipeline, cross-validation (CV), Taguchi method and improve strategies. This study includes datasets from 50,174 patients, data were collected from 32 chain clinics and three special physical examination centers, between 2015 and 2019. The proposed auto-ML scheme can auto-select the level of each strategy to associate with a classifier which finally shows an acceptable testing accuracy of 86.17%, balanced accuracy of 84.08%, sensitivity of 90.90% and specificity of 77.26%, precision of 88.27%, and F1 score of 89.57%. In addition, the experimental results showed that age, creatinine, high blood pressure, smoking are important risk factors, and has been proven in previous studies. Our auto-ML scheme light on the possibility of evaluation for the effectiveness of one or a combination of those risk factors. This methodology may provide essential information and longitudinal change for personalized treatment in the future.
Collapse
Affiliation(s)
- Chin-Chuan Shih
- Dean of the Lian-An Clinic, Taipei 24200, Taiwan;
- Deputy Chairman, Taiwan Association of Family Medicine, Taipei 24200, Taiwan
| | - Ssu-Han Chen
- Department of Industrial Engineering and Management, Ming Chi University of Technology, New Taipei City 243303, Taiwan;
- Center for Artificial Intelligence & Data Science, Ming Chi University of Technology, New Taipei City 243303, Taiwan
| | - Gin-Den Chen
- Institute of Medicine, Chung Shan Medical University, Taichung 40201, Taiwan;
| | - Chi-Chang Chang
- Department of Medical Informatics, Chung Shan Medical University & IT Office, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
- Department of Information Management, Ming Chuan University, Taoyuan 33300, Taiwan
- Correspondence: ; Tel.: +886-4-24730022
| | - Yu-Lin Shih
- Department of Otolaryngology-Head and Neck Surgery, Chang-Gung Memorial Hospital, Linkou Branch, Taoyuan City 33305, Taiwan;
| |
Collapse
|
30
|
Chiu YL, Jhou MJ, Lee TS, Lu CJ, Chen MS. Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease. Risk Manag Healthc Policy 2021; 14:4401-4412. [PMID: 34737657 PMCID: PMC8558038 DOI: 10.2147/rmhp.s319405] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 09/30/2021] [Indexed: 01/02/2023] Open
Abstract
PURPOSE As global aging progresses, the health management of chronic diseases has become an important issue of concern to governments. Influenced by the aging of its population and improvements in the medical system and healthcare in general, Taiwan's population of patients with chronic kidney disease (CKD) has tended to grow year by year, including the incidence of high-risk cases that pose major health hazards to the elderly and middle-aged populations. METHODS This study analyzed the annual health screening data for 65,394 people from 2010 to 2015 sourced from the MJ Group - a major health screening center in Taiwan - including data for 18 risk indicators. We used five prediction model analysis methods, namely, logistic regression (LR) analysis, C5.0 decision tree (C5.0) analysis, stochastic gradient boosting (SGB) analysis, multivariate adaptive regression splines (MARS), and eXtreme gradient boosting (XGboost), with estimated glomerular filtration rate (e-GFR) data to determine G3a, G3b & G4 stage CKD risk factors. RESULTS The LR analysis (AUC=0.848), SGB analysis (AUC=0.855), and XGboost (AUC=0.858) generated similar classification performance levels and all outperformed the C5.0 and MARS methods. The study results showed that in terms of CKD risk factors, blood urea nitrogen (BUN) and uric acid (UA) were identified as the first and second most important indicators in the models of all five analysis methods, and they were also clinically recognized as the major risk factors. The results for systolic blood pressure (SBP), SGPT, SGOT, and LDL were similar to those of a related study. Interestingly, however, socioeconomic status-related education was found to be the third important indicator in all three of the better performing analysis methods, indicating that it is more important than the other risk indicators of this study, which had different levels of importance according to the different methods. CONCLUSION The five prediction model methods can provide high and similar classification performance in this study. Based on the results of this study, it is recommended that education as the socioeconomic status should be an important factor for CKD, as high educational level showed a negative and highly significant correlation with CKD. The findings of this study should also be of value for further discussions and follow-up research.
Collapse
Affiliation(s)
- Yen-Ling Chiu
- Graduate Institue of Medicine and Graduate School of Biomedical Informatics, Yuan Ze University, Taoyuan, 32003, Taiwan, Republic of China
- Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, 10002, Taiwan, Republic of China
- Department of Medical Research, Department of Medicine,Far Eastern Memorial Hospital, New Taipei, 22056, Taiwan, Republic of China
| | - Mao-Jhen Jhou
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei, 242062, Taiwan, Republic of China
| | - Tian-Shyug Lee
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei, 242062, Taiwan, Republic of China
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, 242062, Taiwan, Republic of China
| | - Chi-Jie Lu
- Graduate Institute of Business Administration, Fu Jen Catholic University, New Taipei, 242062, Taiwan, Republic of China
- Artificial Intelligence Development Center, Fu Jen Catholic University, New Taipei City, 242062, Taiwan, Republic of China
- Department of Information Management, Fu Jen Catholic University, New Taipei City, 242062, Taiwan, Republic of China
| | - Ming-Shu Chen
- Department of Healthcare Administration,College of Healthcare and Management, Asia Eastern University of Science and Technology, New Taipei, 22061, Taiwan, Republic of China
| |
Collapse
|
31
|
Comparison of Different Machine Learning Classifiers for Glaucoma Diagnosis Based on Spectralis OCT. Diagnostics (Basel) 2021; 11:diagnostics11091718. [PMID: 34574059 PMCID: PMC8471622 DOI: 10.3390/diagnostics11091718] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/17/2021] [Accepted: 09/18/2021] [Indexed: 11/17/2022] Open
Abstract
Early detection is important in glaucoma management. By using optical coherence tomography (OCT), the subtle structural changes caused by glaucoma can be detected. Though OCT provided abundant parameters for comprehensive information, clinicians may be confused once the results conflict. Machine learning classifiers (MLCs) are good tools for considering numerous parameters and generating reliable diagnoses in glaucoma practice. Here we aim to compare different MLCs based on Spectralis OCT parameters, including circumpapillary retinal nerve fiber layer (cRNFL) thickness, Bruch’s membrane opening-minimum rim width (BMO-MRW), Early Treatment Diabetes Retinopathy Study (ETDRS) macular thickness, and posterior pole asymmetry analysis (PPAA), in discriminating normal from glaucomatous eyes. Five MLCs were proposed, namely conditional inference trees (CIT), logistic model tree (LMT), C5.0 decision tree, random forest (RF), and extreme gradient boosting (XGBoost). Logistic regression (LGR) was used as a benchmark for comparison. RF was shown to be the best model. Ganglion cell layer measurements were the most important predictors in early glaucoma detection and cRNFL measurements were more important as the glaucoma severity increased. The global, temporal, inferior, superotemporal, and inferotemporal sites were relatively influential locations among all parameters. Clinicians should cautiously integrate the Spectralis OCT results into the entire clinical picture when diagnosing glaucoma.
Collapse
|
32
|
Risk Prediction of Second Primary Endometrial Cancer in Obese Women: A Hospital-Based Cancer Registry Study. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18178997. [PMID: 34501584 PMCID: PMC8431143 DOI: 10.3390/ijerph18178997] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 08/06/2021] [Accepted: 08/16/2021] [Indexed: 12/15/2022]
Abstract
Due to the high effectiveness of cancer screening and therapies, the diagnosis of second primary cancers (SPCs) has increased in women with endometrial cancer (EC). However, previous studies providing adequate evidence to support screening for SPCs in endometrial cancer are lacking. This study aimed to develop effective risk prediction models of second primary endometrial cancer (SPEC) in women with obesity (body mass index (BMI) > 25) and included datasets on the incidence of SPEC and the other risks of SPEC in 4480 primary cancer survivors from a hospital-based cancer registry database. We found that obesity plays a key role in SPEC. We used 10 independent variables as predicting variables, which correlated to obesity, and so should be monitored for the early detection of SPEC in endometrial cancer. Our proposed scheme is promising for SPEC prediction and demonstrates the important influence of obesity and clinical data representation in all cases following primary treatments. Our results suggest that obesity is still a crucial risk factor for SPEC in endometrial cancer.
Collapse
|
33
|
Chan CL, Chang CC. Big Data, Decision Models, and Public Health. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17186723. [PMID: 32942728 PMCID: PMC7558933 DOI: 10.3390/ijerph17186723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 09/14/2020] [Indexed: 12/23/2022]
Abstract
Unlike most daily decisions, medical decision making often has substantial consequences and trade-offs. Recently, big data analytics techniques such as statistical analysis, data mining, machine learning and deep learning can be applied to construct innovative decision models. With complex decision making, it can be difficult to comprehend and compare the benefits and risks of all available options to make a decision. For these reasons, this Special Issue focuses on the use of big data analytics and forms of public health decision making based on the decision model, spanning from theory to practice. A total of 64 submissions were carefully blind peer reviewed by at least two referees and, finally, 23 papers were selected for this Special Issue.
Collapse
Affiliation(s)
- Chien-Lung Chan
- Department of Information Management, Yuan Ze University, Taoyuan 320, Taiwan;
- Innovation Center for Big Data and Digital Convergence, Yuan Ze University, Taoyuan 320, Taiwan
| | - Chi-Chang Chang
- School of Medical Informatics, Chung Shan Medical University & IT Office, Chung Shan Medical University Hospital, Taichung 40201, Taiwan
- Correspondence: ; Tel.: +886-4-24730022
| |
Collapse
|