A Multicohort Machine Learning Framework to Predict Mortality in Elderly Patients With Heart Disease: Insights From HARLS, SHARE, and HRS

基于多队列机器学习框架预测老年心脏病患者死亡率:来自 HARLS、SHARE 和 HRS 的启示

阅读:2

Abstract

BACKGROUND: Elderly patients with heart disease face elevated mortality risk, yet predictive models specifically tailored for this population across different global regions remain limited. Current mortality prediction tools often lack cross-cultural validation and interpretability, hindering their clinical application in diverse healthcare settings. METHODS: We developed and validated machine learning models for predicting mortality in elderly heart disease patients using data from three major aging cohorts: the China Health and Retirement Longitudinal Study (CHARLS, n = 2130), the Survey of Health, Ageing and Retirement in Europe (SHARE, n = 10,928), and the Health and Retirement Study (HRS) from the United States (n = 4835). Boruta feature selection identified 27 common predictors across cohorts. Eleven machine learning algorithms were trained on the SHARE cohort (70% training and 30% testing) and externally validated on CHARLS and HRS cohorts. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and calibration metrics. SHapley Additive exPlanations (SHAP) analysis was employed to interpret model predictions. RESULTS: XGBoost demonstrated superior performance with the highest average AUC (0.798) across all datasets, showing excellent generalizability from the SHARE training set (AUC: 0.805) to internal validation (AUC: 0.799) and external validation in HRS (AUC: 0.821) and CHARLS (AUC: 0.770) cohorts. Age consistently emerged as the most influential predictor across all cohorts (SHAP values: 0.056-0.102), followed by gender, moderate physical activity, and self-rated health, though their relative importance varied by cohort. Feature dependence analysis revealed important nonlinear relationships, including U-shaped associations between grip strength and mortality risk. CONCLUSION: Our multicohort machine learning approach successfully developed a robust, interpretable model for predicting mortality in elderly heart disease patients across diverse global populations. The model's strong performance in external validation demonstrates its potential for cross-cultural clinical application, while SHAP analysis provides valuable insights into population-specific risk factors that could guide targeted interventions.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。