Explainable machine learning for predicting coronary heart disease risk in patients with carotid atherosclerosis: A retrospective study with SHAP and decision curve analysis

利用可解释机器学习预测颈动脉粥样硬化患者冠心病风险:一项基于SHAP和决策曲线分析的回顾性研究

阅读:2

Abstract

BACKGROUND: Carotid atherosclerosis is associated with increased coronary heart disease (CHD) risk, yet current risk models lack specificity and interpretability for this population. This study aimed to develop explainable machine learning (ML) models to predict CHD in these patients. METHODS: We retrospectively analyzed 487 patients with carotid atherosclerosis (191 CHD, 296 non-CHD) from January 2022 to July 2025. Thirty-eight variables were collected, including demographic, clinical, and biochemical indicators. LASSO regression identified six key predictors. Seven ML models were trained and evaluated using area under receiver operating characteristic curve (AUC), PRC-AUC, calibration curves, and decision curve analysis (DCA). SHAP was applied to interpret the best-performing model. RESULTS: Logistic regression model achieved the highest test-set performance (AUC = 0.827; PRC-AUC = 0.752), with strong generalizability and calibration. SHAP analysis identified age and diastolic blood pressure as the most influential features, aligning with model coefficients. DCA demonstrated superior clinical net benefit of the logistic regression model across probability thresholds. CONCLUSION: A six-variable logistic model provides accurate and interpretable CHD risk prediction in patients with carotid atherosclerosis. Its transparency and clinical utility support its integration into personalized risk management.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。