Explainable SHAP-XGBoost models for identifying important social factors associated with the atherosclerotic cardiovascular disease risk score using the LASSO feature selection technique

利用 LASSO 特征选择技术,构建可解释的 SHAP-XGBoost 模型,以识别与动脉粥样硬化性心血管疾病风险评分相关的重要社会因素。

阅读:1

Abstract

OBJECTIVES: Extensive evidence indicates that social factors play an essential role in explaining atherosclerotic cardiovascular disease (ASCVD). This study aimed to examine which social factors are associated with the estimated 10-year ASCVD risk score among male and female adults, incorporating both multifaceted social network components and conventional risk factors. METHODS: Using data from 4,368 middle-aged Korean adults, we explored factors most likely to explain ASCVD risk with interpretable machine learning algorithms. The ASCVD risk was determined using the 10-year ASCVD risk score, as calculated using pooled cohort equations. Social network components were assessed through the name generator module. A total of 52 variables were included in the model. RESULTS: For male participants (area under the receiver operating characteristic curve [AUC], 0.65), the average years known for network members contributed most to ASCVD risk prediction (mean Shapley additive explanations value, 0.31), followed by spouse's education level (0.22), medical history with diagnosis (0.18), and snoring frequency (0.14). By contrast, for female participants (AUC, 0.60), medical history with diagnosis was the strongest predictor (0.47), followed by logged income (0.21), education level (0.19), and the average number of years known in network members (0.17). CONCLUSIONS: Several important social factors were associated with the ASCVD risk score in both male and female adults. However, longitudinal research is needed to determine whether these factors predict future ASCVD events.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。