Comprehensive Sepsis Risk Prediction in Leukemia Using a Random Forest Model and Restricted Cubic Spline Analysis

利用随机森林模型和限制性三次样条分析对白血病患者进行全面的脓毒症风险预测

阅读:1

Abstract

BACKGROUND: Sepsis is a severe complication in leukemia patients, contributing to high mortality rates. Identifying early predictors of sepsis is crucial for timely intervention. This study aimed to develop and validate a predictive model for sepsis risk in leukemia patients using machine learning techniques. METHODS: This retrospective study included 4310 leukemia patients admitted to the Affiliated Hospital of Guangdong Medical University from 2005 to 2024, using 70% for training and 30% for validation. Feature selection was performed using univariate logistic regression, LASSO, and the Boruta algorithm, followed by multivariate logistic regression analysis. Seven machine learning models were constructed and evaluated using receiver operating characteristic (ROC) curves and decision curve analysis (DCA). Shapley additive explanations (SHAP) were applied to interpret the results, and restricted cubic spline (RCS) regression explored the nonlinear relationships between variables and sepsis risk. Furthermore, we examined the interactions among predictors to better understand their potential interrelationships. RESULTS: The random forest (RF) model outperformed all others, achieving an AUC of 0.765 in the training cohort and 0.700 in the validation cohort. Key predictors of sepsis identified by SHAP analysis included C-reactive protein (CRP), procalcitonin (PCT), neutrophil count (Neut), lymphocyte count (Lymph), thrombin time (TT), red blood cell count (RBC), total bile acid (TBA), and systolic blood pressure (SBP). RCS analysis revealed significant non-linear associations between CPR, PCT, Neut, Lymph, TT, RBC and SBP with sepsis risk. Pairwise correlation analysis further revealed interactions among these variables. CONCLUSION: The RF model exhibited robust predictive power for sepsis in leukemia patients, providing clinicians with a valuable tool for early risk assessment and the optimization of treatment strategies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。