Abstract
BACKGROUND: Preterm birth (PTB) is a major cause of neonatal morbidity and mortality. Inflammation and metabolic disruption are involved in its pathology. This study aimed to assess maternal serum inflammatory and lipid markers as predictors of PTB using various machine learning models. METHODS: Women who were pregnant and attending antenatal clinics were recruited for this study. A group of 186 females who had their births before 37 weeks was marked PTB. One hundred forty women who delivered at term and met the same eligibility criteria were selected from the same source population during the study period. T-tests were used to evaluate variations in baseline and clinical parameters. Pearson correlations were visualized via a heatmap. We built models for random forests (RF), logistic regression (LR), XGBoost, and support vector machine (SVM) using a 70/30 train/test split and 5-fold cross-validation. Models performance was measured using accuracy and AUC. RESULTS: CRP (r ≈ 0.45), IL-6 (r ≈ 0.40), C3 (r ≈ 0.31), BMI, and lipids correlated positively with PTB, whereas HDL correlated inversely (r ≈ - 0.13). Multivariable logistic regression identified age, BMI, IL-6, C3, and CRP as independent predictors. All ML models showed good discrimination (test AUC ≥ 0.819); LR performed best (accuracy 78.57%, AUC 0.849) with cross-validated AUCs around 0.86-0.87 across models. SHAP analysis confirmed that IL-6, BMI, CRP, age, and C3 were dominant contributors to PTB risk. CONCLUSIONS: Maternal inflammation and high BMI are important risk factors for PTB in this cohort. The LR model combining clinical and serum measures is as good a predictor as a complex ML algorithm. It is an interpretable model that can help with risk assessment at an early stage in similar settings.