Development of an ensemble prediction model for acute graft-versus-host disease in allogeneic transplantation based on machine learning

基于机器学习的同种异体移植急性移植物抗宿主病集成预测模型的开发

阅读:1

Abstract

BACKGROUND: Acute graft-versus-host disease (aGVHD) is a major post-transplantation complication and one of the most significant causes of non-relapse-related death. However, the massive and complex clinical data make aGVHD difficult to predict. Machine learning (ML), a branch of artificial intelligence, has since been introduced in medicine due to its ability to process complex, high-dimensional variables quickly and capture nonlinear relationships. However, the effects of immunosuppressants exposure was not considered in previous ML models. Thus, the purpose of this study was to develop and optimize models by Cox regression and machine learning algorithms to predict the risk of aGVHD in which cyclosporin A exposure and common clinical factors were included as variables. METHODS: The data was preprocessed in the first step, and was randomly allocated at an 8:2 ratio. Cox regression model was constructed on the training set. Meanwhile, correlation analysis and recursive feature elimination were used for feature screening before machine learning model development. Then fifteen algorithms were used to establish models, and an ensemble model was established through soft voting based on the top five performance algorithms. Area under curve (AUC) was the main metric used to evaluate the model performance in the validation set, while nomogram and SHAP were applied to interpret the variables. RESULT: A total of 479 patients and 47 variables were included in the study. The incidence of grade II-IV aGVHD was 33.61%. The AUC of Cox regression model in the validation set was 0.625. In contrast, the new ensemble model has a better prediction ability (AUC = 0.776, Accuracy = 0.729, Precision = 0.667, Recall = 0.375, F1-score = 0.480). Except for the variables which were identified by previous studies, some rarely reported risk factors were found, such as quinolone, blood urea nitrogen and alkaline phosphatase. CONCLUSIONS: In summary, a new ensemble model with promising accuracy was established to predict grade II-IV classic aGVHD in allo-HSCT patients. It will help identify high-risk patients at an early stage and thus reduce the incidence of aGVHD. CLINICAL TRIAL NUMBER: Not applicable.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。