Abstract
Background and Objective: Changes in female sex hormone levels are closely linked to the development and progression of uterine fibroids (UFs). Clinical approaches to fibroid management vary according to guidelines and depend on patient symptoms, fibroid size, and clinician judgment. Despite available diagnostic tools, surgical decisions remain largely subjective. With the advancement of artificial intelligence (AI) and clinical decision support technologies, clinical experience can now be transferred into data-driven computational models trained with hormone-based parameters. To develop a clinical decision support algorithm that predicts surgical necessity for uterine fibroids by integrating fibroid characteristics and female sex hormone levels. Methods: This multicenter study included 618 women with UFs who presented to three hospitals; 238 underwent surgery. Statistical analyses and artificial intelligence-based modeling were performed to compare surgical and non-surgical groups. Training was conducted with each hormone-follicle-stimulating hormone (FSH), luteinizing hormone (LH), estrogen (E2), prolactin (PRL), and anti-Müllerian hormone (AMH)-and with 126 input combinations including hormonal and morphological variables. Five supervised learning algorithms-support vector machine, decision tree, random forest, and k-nearest neighbors-were applied, resulting in 630 trained models. In addition to this retrospective development phase, a prospective validation was conducted in which 20 independent clinical cases were evaluated in real time by a gynecologist blinded to both the model predictions and the surgical outcomes. Agreement between the clinician's assessments and the model outputs was measured. Results: FSH, LH, and PRL levels were significantly lower in the surgery group (p < 0.001, 0.009, and <0.001, respectively), while E2 and AMH were higher (p = 0.012 and 0.001). Fibroid volume was also greater among surgical cases (90.8 cc vs. 73.1 cc, p < 0.001). The random forest model using LH, FSH, E2, and AMH achieved the highest accuracy of 91 percent. In the external validation phase, the model's predictions matched the blinded gynecologist's decisions in 18 of 20 cases, corresponding to a 90% concordance rate. The two discordant cases were later identified as borderline scenarios with clinically ambiguous surgical indications. Conclusions: The decision support algorithm integrating hormonal and fibroid parameters offers an objective and data-driven approach to predicting surgical necessity in women with UFs. Beyond its strong internal performance metrics, the model demonstrated a high level of clinical concordance during external validation, achieving a 90% agreement rate with an independent, blinded gynecologist. This alignment underscores the model's practical reliability and its potential to reduce subjective variability in surgical decision-making. By providing a reproducible and clinically consistent framework, the proposed AI-based system represents a meaningful advancement toward the validated integration of computational decision tools into routine gynecological practice.