Prediction of cardiovascular disease based on multiple feature selection and improved PSO-XGBoost model

基于多特征选择和改进的PSO-XGBoost模型的心血管疾病预测

阅读:1

Abstract

Cardiovascular disease is a common disease that threatens human health. In order to predict it more accurately, this paper proposes a cardiovascular disease prediction model that combines multiple feature selection, improved particle swarm optimization algorithm, and extreme gradient boosting tree. Firstly, the dataset is preprocessed, and an XGBoost cardiovascular disease prediction model is constructed for model training and compare it with other algorithms. Then, combined with two factor Pearson correlation analysis and feature importance ranking, multiple feature selection is performed, with the optimal feature subset as the feature input. Finally, the improved particle swarm optimization algorithm is used to adjust the hyperparameters of the extreme gradient boosting tree algorithm, and selecting the optimal hyperparameter combination to construct the MFS-DLPSO-XGBoost model. The recall, precision, accuracy, F1 score, and area under the ROC curve (AUC) of the MFS-DLPSO-XGBoost model reached 71.4%, 76.3%, 74.7%, 73.6%, and 80.8%, respectively, which increased by 3.6%, 3.2%, 2.7%, 3.2%, and 2.3% compared to XGBoost. The results indicate that the model proposed in this article has good classification performance and can provide assistance for doctors and patients in predicting and preventing heart disease.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。