Development and validation of an explainable machine learning-based prediction model for primary Kawasaki disease complicated with coronary artery aneurysms

开发和验证基于可解释机器学习的川崎病合并冠状动脉瘤预测模型

阅读:1

Abstract

BACKGROUND: Kawasaki disease (KD) can lead to coronary artery aneurysms (CAA) in approximately 1 in 5 untreated children despite intravenous immunoglobulin (IVIG) therapy in the acute phase. The aim of this study is to develop and validate an explainable machine learning (ML)-based prediction model for CAA in KD. METHODS: This study retrospectively analyzed the clinical data of children diagnosed with primary KD at Nanchong Central Hospital, Sichuan Province between 2015 and 2023. Six models, including support vector machine (SVM), K-nearest neighbors (KNN), least absolute shrinkage and selection operator (Lasso), extreme gradient boosting (XGBoost), random forest (RF), and multilayer perceptron (MLP), based on ML algorithms were developed. The model with optimal performance was validated and the explainable SHapley Additive exPlanations (SHAP) analysis was used. RESULTS: A total of 327 children diagnosed with KD were included in the training set and validation set. Receiver operator characteristic curve analysis showed that XGBoost based model exhibited an optimal performance among the six models. Moreover, for a given CAA positive sample, the sum of the SHAP values of all variables of XGBoost represented the individual deviation from the mean predicted from the entire dataset. CONCLUSIONS: The XGBoost algorithm-based explainable model might be used to predict the occurrence of CAA in children with KD.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。