Abstract
AIMS: This study aims to enhance the preoperative diagnosis of non-mass breast lesions (NMLs) by validating radiomics-based machine learning models and assessing their performance alone and in combination with clinical ultrasound features to distinguish benign from malignant lesions. METHODS: A total of 123 NMLs from 119 patients with confirmed pathology were analyzed. Patients were split into a training cohort (n = 98) and a validation cohort (n = 25). From each ultrasound image, 1558 radiomics features were extracted. After dimensionality reduction and feature selection, 10 key features were retained. Predictive models were developed using logistic regression (LR), linear regression, support vector machine (SVM), random forests, Extremely Randomized Trees (Extra Trees), and Light Gradient Boosting Machine (LightGBM). A clinical model was built using LR based on ultrasound findings such as calcification, high resistance index, and axillary lymph node enlargement. A combined model incorporated both radiomics and clinical features. Model performance was evaluated using receiver operating characteristic (ROC) curves and decision curve analysis (DCA). RESULTS: The LightGBM model achieved the highest radiomics-only performance (AUC: 0.932 training; 0.867 validation). The clinical model achieved AUCs of 0.837 (training) and 0.790 (validation). The combined model outperformed both, with AUCs of 0.973 (training) and 0.933 (validation), and showed superior clinical benefit in DCA. CONCLUSIONS: Combining radiomics with clinical ultrasound data significantly improves diagnostic accuracy for NMLs, supporting better differentiation between benign and malignant lesions and aiding clinical decision-making.