A multidimensional clinical prediction model for early screening of recurrent spontaneous abortion: integrating coagulation, immune, and endocrine markers

用于早期筛查复发性自然流产的多维度临床预测模型:整合凝血、免疫和内分泌标志物

阅读:1

Abstract

OBJECTIVE: Recurrent spontaneous abortion (RSA) affects 0.5%-2.5% of fertile couples and arises from complex, interacting thrombotic, immune, coagulation, endocrine-metabolic, and demographic factors. However, current early risk stratification in routine practice remains insufficient for population-level screening. We aimed to develop an accurate, low-cost, and clinically feasible early screening model for identifying women at high risk of RSA using routinely available clinical biomarkers. METHODS: This retrospective study enrolled women attending Guangdong Reproductive Hospital between 1 January 2020 and 31 December 2024. Among 1226 screened individuals, 285 met eligibility criteria and were included (181 RSA patients and 104 healthy controls). Demographic and laboratory variables were extracted from electronic medical records and structured follow-up. Ten classical machine-learning algorithms and a Transformer-based tabular model (TabPFN) were trained and compared. Class imbalance was handled using the synthetic minority oversampling technique (SMOTE). Model robustness was evaluated using 5-fold cross-validation. Biological-domain contributions were quantified through ablation analysis. Feature selection was optimized using recursive feature elimination with random forest (RFE-RF), and interpretability was assessed via SHAP. RESULTS: The TabPFN Multidimensional model integrating features across six clinical domains achieved the best discriminative performance for RSA risk prediction (ROC-AUC = 0.927, 95% CI 0.891-0.947), outperforming all comparator algorithms. Domain ablation showed that removing any single biological category reduced performance, supporting the complementary value of multidimensional clinical integration. Acquired thrombophilia markers provided the strongest predictive contribution, followed by hereditary thrombophilia, immune indices, coagulation parameters, endocrine-metabolic variables, and demographic factors. A parsimonious six-biomarker model-anti-phosphatidylserine/prothrombin antibodies (aPS/PT), protein C (PC), antinuclear antibodies (ANA), antithrombin III (AT-III), thrombin time (TT), and body mass index (BMI)-retained high discrimination (AUC = 0.925) with 83% accuracy, supporting a pragmatic and cost-effective screening strategy. SHAP analysis identified elevated aPS/PT, ANA positivity, reduced AT-III activity, and prolonged TT as the most influential predictors, implicating thrombo-immune dysregulation as a central mechanism associated with RSA. CONCLUSION: A Transformer-based tabular model using six routinely measured, low-cost biomarkers enable accurate, interpretable, and scalable early screening for RSA risk, with potential utility in resource-limited settings to facilitate timely referral and targeted preventive management.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。