Machine learning-based prediction algorithm of spontaneous preterm birth using multi-source data

基于机器学习的多源数据自发性早产预测算法

阅读:2

Abstract

BACKGROUND: Spontaneous preterm birth (sPTB) is a complex condition with unclear etiology, associated with increased neonatal risks. Early prediction of sPTB enables timely interventions to improve outcomes. Our study aimed to construct machine learning (ML) models to predict sPTB using multi-source data, including electronic health records (EHR) and environmental factors. METHODS: This retrospective cohort study included 54132 singleton pregnancies from Wuhan Children's Hospital (Wuhan Maternal and Child Healthcare Hospital) between December 2012 and December 2022. We collected multi-source predictors including demographics, routine prenatal tests, air pollution exposure, meteorological factors, and greenness exposure, resulting in a total of 82 predictors. Extreme gradient boosting (XGBoost), random forest (RF), support vector machine (SVM), and logistic regression (LR) models were used to construct predictive models of sPTB. Screening performance was assessed via the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC). Shapley additive explanation (SHAP) value was computed to assess the importance of each feature contributing to the prediction. RESULTS: The XGBoost model yielded the best performance in the test set with an AUROC of 0.926 and an AUPRC of 0.502. Eosinophils percentage, albumin, uric acid, amniotic fluid pocket, and sulfur dioxide exposure during late pregnancy were identified as the most important predictors of sPTB. CONCLUSIONS: Our results demonstrate that combining EHR data, environmental factors, and ML methods enables highly accurate and moderately precise predictions of sPTB. While the model shows promising discriminatory power, its precision requires improvement before clinical application.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。