Explainable machine learning model of coronary artery disease combined with diabetes: development and validation study

冠状动脉疾病合并糖尿病的可解释机器学习模型:开发与验证研究

阅读:4

Abstract

BACKGROUND: Coronary artery disease (CAD) demonstrates a strong bidirectional association with diabetes mellitus, which not only elevates cardiovascular disease risk but also correlates with poorer clinical prognosis. Prognostication in patients with comorbid CAD and diabetes remains a critical clinical challenge, significantly influencing therapeutic decision-making. Leveraging readily available clinical parameters for predicting adverse outcomes in this population offers substantial clinical value. This investigation employs machine learning algorithms to develop predictive models for prognostic assessment in CAD patients with diabetes comorbidity. METHOD: We conducted a retrospective cohort study of 389 patients with comorbid coronary artery disease (CAD) and diabetes mellitus. The cohort was randomly allocated into a training set (n = 273) and an internal validation set (n = 116). Feature selection utilized LASSO regression followed by backward stepwise Cox regression analysis. A nomogram incorporating independent predictors was developed for clinical application. Model performance was assessed through discrimination metrics, calibration plots, and decision curve analysis (DCA). Random survival forest analysis validated the clinical significance of selected variables. RESULT: Our modeling approach employed a systematic methodology: LASSO regression for initial feature selection followed by backward stepwise Cox regression analysis, yielding eight independent predictors.The final model incorporated hemoglobin, INR, albumin, NT-proBNP, age, fibrinogen, diuretic use, and digitalis therapy. The integrated model demonstrated strong discriminative performance for mortality prediction across both training (AUC = 0.846, 0.838, 0.82) and validation cohorts (AUC = 0.824, 0.813, 0.798) at 3-, 5-, and 8-year intervals. Calibration plots and decision curve analysis confirmed model reliability and clinical utility over time. A nomogram was developed to facilitate individualized risk stratification. Kaplan-Meier analysis showed significant survival stratification by individual predictors, and restricted cubic spline analysis identified non-linear associations between continuous variables and mortality. Random survival forest analysis prioritized five key predictors (hemoglobin, INR, albumin, NT-proBNP, age). Comparative evaluation against the 9-variable model confirmed superior performance of the comprehensive model across all timepoints. CONCLUSION: Our multimodal prognostic model demonstrated robust performance in predicting all-cause mortality among patients with coronary artery disease and diabetes comorbidity. The nomogram's capacity for personalized risk estimation offers potential utility in clinical decision-making and patient stratification.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。