Establishment and Validation of a Machine Learning-Based Prediction Model for Termination of Pregnancy via Cesarean Section

建立和验证基于机器学习的剖宫产终止妊娠预测模型

阅读:3

Abstract

OBJECTIVE: This study aimed to investigate the risk factors of cesarean section and establish a prediction model for cesarean section based on the characteristics of pregnant women. METHODS: The clinical characteristics of 2552 singleton pregnant women who delivered a live baby between January 2020 and December 2021 were retrospectively reviewed. They were divided into vaginal delivery group (n = 1850) and cesarean section group (n = 702). These subjects were divided into training set (2020.1-2021.6) and validation set (2021.7-2021.12). In the training set, univariate analysis, Lasso regression, and Boruta were used to screen independent risk factors for cesarean section. Four models, including Logistic Regression (LR), K-Nearest Neighbor (KNN), Classification and Regression Tree (CART), and Random forest (RF), were established in the training set using K-fold cross validation, hyperparameter optimization, and random oversampling techniques. The best model was screened, and Sort graph of feature variables, univariate partial dependency profile, and Break Down profile were delineated. In the validation set, the confusion matrix parameters were calculated, and receiver operating characteristic curve (ROC), precision recall curve (PRC), calibration curve, and clinical decision curve analysis (DCA) were delineated. RESULTS: The risk factors of cesarean section included age and height of women, weight at delivery, weight gain, para, assisted reproduction, abnormal blood glucose during pregnancy, pregnancy hypertension, scarred uterus, premature rupture of membrane (PROM), placenta previa, fetal malposition, thrombocytopenia, floating fetal head, and labor analgesia. RF had the best performance among the four models, and the accuracy of confusion matrix parameters was 0.8956357. The Matthews correlation coefficient (MCC) was 0.753012. The area under ROC (AUC-ROC) was 0.9790787, and the area under PRC (AUC-PRC) was 0.957888. CONCLUSION: RF prediction model for caesarean section has high discrimination performance, accuracy and consistency, and outstanding generalization ability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。