Predicting major adverse cardiac events in diabetes and chronic kidney disease: a machine learning study from the Silesia Diabetes-Heart Project

预测糖尿病合并慢性肾病患者的主要不良心脏事件:来自西里西亚糖尿病-心脏项目的机器学习研究

阅读:1

Abstract

BACKGROUND: People living with diabetes mellitus (DM) and chronic kidney disease (CKD) are at significantly high risk of cardiovascular events (CVEs), however the predictive performance of traditional risk prediction methods are limited. METHODS: We utilised machine learning (ML) model to predict CVEs in persons with DM and CKD from the Silesia Diabetes-Heart Project, a routine standard of care dataset. CVEs were defined as composite of nonfatal myocardial infarction, new onset heart failure, nonfatal stroke, incident atrial fibrillation, undergoing percutaneous coronary intervention or coronary artery bypass grafting, hospitalisation or death due to cardiovascular disease. Five ML models (Logistic regression [LR], Random forest [RF], Support vector classification [SVC], Light gradient boosting machine [LGBM], and eXtreme gradient boosting machine [XGBM]) were constructed. The predictive performance of the five ML models was compared and the model interpretability were evaluated by Shapley Additive exPlanations (SHAP). RESULTS: A total of 1,116 people with DM and CKD out of 3,056 with DM were included (median age 67 [IQR 57-76] years; 57% men). The incidence of CVEs was 14.1% (157/1,116) during a median of 3.1 years follow-up period. Ten important features were identified through univariate Logistic regression, Boruta, and Least Absolute Shrinkage and Selection Operator [LASSO] regression. Among the five ML models based on these features, LGBM had the highest area under curve [AUC] (AUC = 0.740, 95% Confidence Interval [CI] 0.738-0.743), followed by LR (AUC = 0.621, 95% CI 0.618-0.623), RF (AUC = 0.707, 95% CI 0.704-0.709), SVC (AUC = 0.707, 95% CI 0.704-0.710), and XGBM (AUC = 0.710, 95% CI 0.707-0.713). Meanwhile, LGBM had relatively higher Recall (0.739), F1-score (0.820), and G-mean (0.826). The SHAP plot of LGBM revealed that estimated glomerular filtration rate (eGFR), age, and triglyceride glucose index were the three most important features for predicting CVEs. CONCLUSION: Ten features-based ML models, especially the LGBM model, had acceptable performance in predicting CVEs in persons with DM and CKD. A decrease in eGFR, aging, and elevated inflammatory markers significantly enhanced the predictive capability of the model. Future external validation of our model is required prior to implementation in a clinical environment.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。