Abstract
Ovarian cancer's high mortality is primarily due to late-stage diagnosis, underscoring the critical need for improved early detection tools. This study develops and validates explainable artificial intelligence (XAI) models to discriminate malignant from benign ovarian masses using readily available demographic and laboratory data. A dataset of 309 patients (140 malignant, 169 benign) with 47 clinical parameters was analyzed. The Boruta algorithm selected 19 significant features, including tumor markers (CA125, HE4, CEA, CA19-9, AFP), hematological indices, liver function tests, and electrolytes. Five ensemble machine learning algorithms were optimized and evaluated using repeated stratified 5-fold cross-validation. The Gradient Boosting model achieved the highest performance with 88.99% (±3.2%) accuracy, 0.934 AUC-ROC, and 0.782 Matthews correlation coefficient. SHAP analysis identified HE4, CEA, globulin, CA125, and age as the most globally important features. Unlike black-box approaches, our XAI framework provides clinically interpretable decision pathways through LIME and SHAP visualizations, revealing how feature values push predictions toward malignancy or benignity. Partial dependence plots illustrated non-linear risk relationships, such as a sharp increase in malignancy probability with CA125 > 35 U/mL. This explainable approach demonstrates that ensemble models can achieve high diagnostic accuracy using routine lab data alone, performing comparably to established clinical indices while ensuring transparency and clinical plausibility. The integration of state-of-the-art XAI techniques highlights established biomarkers and reveals potential novel contributors like inflammatory and hepatic indices, offering a pragmatic, scalable triage tool to augment existing diagnostic pathways, particularly in resource-constrained settings.