Development of a machine learning-based early prediction model for disease severity in acute pancreatitis

开发基于机器学习的急性胰腺炎疾病严重程度早期预测模型

阅读:1

Abstract

BACKGROUND: Severe acute pancreatitis (SAP) represents the most severe form of acute pancreatitis (AP) and is associated with considerable morbidity, mortality, and socioeconomic burden. Early identification of critically ill patients is crucial for disease management in AP. However, commonly used scoring systems are often complex, time-consuming, and lack interpretability. Therefore, the purpose of this study is to develop a machine learning-based model for early prediction of severity in AP and to perform interpretability analysis. METHODS: A total of 528 AP patients meeting the inclusion criteria were screened via the hospital electronic medical record system and randomly divided into training and testing sets at a 7:3 ratio. Least Absolute Shrinkage and Selection Operator (LASSO) regression was employed for variable selection. Four machine learning algorithms, Logistic Regression, extreme Gradient Boosting (XGBoost), Random Forest, and Support Vector Machine (SVM), were utilized to construct prediction models. The optimal model was identified by comparing the following metrics between the training and testing sets: area under the receiver operating characteristic curve (AUC), Kappa value, accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1 score. In addition, the calibration curve and Decision Curve Analysis (DCA) within the testing set were evaluated. SHapley Additive exPlanations (SHAP) analysis was applied to interpret the selected model. RESULTS: Comparative evaluation of the machine learning models revealed that the SVM model demonstrated the best overall performance. The SVM model achieved an AUC = 0.819, Kappa = 0.509, accuracy = 0.861, sensitivity = 0.615, specificity = 0.909, PPV = 0.571, NPV = 0.923, and F1 score = 0.593 on the testing set, respectively. SHAP analysis indicated that AP severity increased with higher SHAP values. Elevated levels of the Simple Prognostic Score (SPS), respiratory rate, and High density lipoprotein cholesterol (HDL-C), along with the presence of peritoneal effusion, were associated with the occurrence of SAP. CONCLUSIONS: The developed SVM model shows promising potential for distinguishing AP severity levels. It holds significant clinical utility, particularly for its high effectiveness in excluding non-SAP patients.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。