Machine learning model development and validation using SHAP: predicting 28-day mortality risk in pulmonary fibrosis patients

利用SHAP进行机器学习模型开发和验证:预测肺纤维化患者的28天死亡风险

阅读:1

Abstract

BACKGROUND: Early prediction of mortality risk within 28 days of admission is crucial for personalized treatment in patients with pulmonary fibrosis (PF). This study aims to develop a predictive model for 28-day mortality risk in PF patients using interpretable machine learning (ML) methods. METHODS: Data from patients with pulmonary fibrosis were extracted from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database. The study endpoint was mortality within 28 days of admission. Feature selection was performed using logistic regression and LASSO algorithms. Six machine learning algorithms-decision tree, k-nearest neighbors (KNN), LightGBM, single-hidden-layer neural network, support vector machine (SVM), and extreme gradient boosting (XGBoost)-were employed to construct risk prediction models. Additionally, SHapley Additive exPlanations (SHAP) were utilized to interpret the predictive models. RESULTS: Among the six evaluated machine learning models, the LightGBM model demonstrated robust predictive performance, with an area under the receiver operating characteristic curve (AUC) of 0.819. SHAP analysis revealed that length of ICU stay, respiratory rate, and white blood cell count were the three most important features for predicting 28-day mortality risk in PF patients, with ICU stay duration having the most significant impact. CONCLUSION: This study indicates that machine learning methods hold potential for early prediction of mortality risk within 28 days of admission in patients with pulmonary fibrosis. Moreover, SHAP analysis enhanced the interpretability of the LightGBM model, thereby guiding clinical decision-making.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。