Development and validation of an early predictive model for hemiplegic shoulder pain: a comparative study of logistic regression, support vector machine, and random forest

偏瘫肩痛早期预测模型的开发与验证:逻辑回归、支持向量机和随机森林的比较研究

阅读:1

Abstract

OBJECTIVE: In this study, we aim to identify the predictive variables for hemiplegic shoulder pain (HSP) through machine learning algorithms, select the optimal model and predict the occurrence of HSP. METHODS: Data of 332 stroke patients admitted to a tertiary hospital in Zhejiang Province from January 2022 to January 2023 were collected. After screening predictive variables by LASSO regression, three predictive models selected using the LazyPredict package, namely logistic regression (LR), support vector machine (SVM) and random forest (RF), were established respectively. The performance parameters (accuracy, precision, recall, and F1 score) of the models were calculated, the receiver operating characteristic curve (ROC) and the decision curve analysis (DCA) were plotted to compare the performance of the three models. An explainability analysis (SHAP) was conducted on the optimal model. RESULTS: The RF model performed the best, with accuracy: 0.90, precision: 0.89, recall: 0.88, F1 score: 0.86, AUC-ROC: 0.94, and the range of the threshold probability in DCA: 7%-99%. Based on the SHAP analysis of the explainability of the RF model, the contribution degrees of the early HSP predictive variables from high to low are as follows: multiple injuries, shoulder joint flexion (p), biceps tendon effusion, sensory disorder, supraspinatus tendinopathy, subluxation, diabetes, and age. CONCLUSION: The RF prediction model has a good predictive effect on HSP and has good clinical explainability. It can provide objective references for the early warning and stratified management of HSP.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。