Binary and Ternary Classification Prediction for Breast Cancer and Breast Sclerosing Adenosis With Interpretable Artificial Intelligence From Clinical and Imaging Features: A Retrospective, Diagnostic Accuracy Cohort Study

基于临床和影像特征的可解释人工智能在乳腺癌和乳腺硬化性腺病二元和三元分类预测中的应用:一项回顾性诊断准确性队列研究

阅读:2

Abstract

BACKGROUND: Sclerosing adenosis (SA) and breast cancer (BC) often exhibit overlapping clinical, imaging, and pathological characteristics, making them difficult to differentiate. SA may also coexist with BC (SA + BC), including ductal carcinoma in situ (SA-DCIS) and invasive breast cancer (SA-IBC), which complicates diagnosis even when core-needle biopsy (CNB) suggests SA. This study aimed to develop interpretable AI-based binary and ternary classification models that leverage clinical and imaging features to distinguish SA-only from SA + BC and to further differentiate among SA-only, SA-DCIS, and SA-IBC. METHODS: We retrospectively analyzed a cohort of 726 patients with SA (January 2006 to December 2021), comprising 537 SA-only and 189 SA + BC cases (90 SA-DCIS, 99 SA-IBC). Multiple machine learning algorithms-logistic regression, support vector machine, decision tree, XGBoost, and random forest-were compared using AUC, accuracy, F1-score, and C-index. Model interpretability was assessed with SHAP to elucidate feature contributions and identify key predictors. Additionally, we incorporated an independent external validation cohort consisting of 113 patients to verify the model's effectiveness. RESULTS: XGBoost consistently outperformed other algorithms in both tasks. Eight features emerged as most informative: age, ultrasound BI-RADS category, maximum and minimum ultrasound diameters, ultrasound margin characteristics, biopsy procedure, mammographic density, and microcalcifications. For binary classification (SA-only vs. SA + BC), XGBoost achieved an AUC of 0.925, accuracy of 0.883, and C-index of 0.844. For ternary classification (SA-only, SA-DCIS, SA-IBC), the model achieved an AUC of 0.888, accuracy of 0.811, and C-index of 0.813. Age, ultrasound BI-RADS, and minimum lesion diameter were consistently top predictors. We further proposed a three-tier interpretability framework (global, cohort-level; local, subgroup-level; and individual, case-level) to facilitate clinical translation. CONCLUSION: Given the substantial risk of coexisting of SA with DCIS or IBC, and the potential for CNB to underestimate disease due to limited sampling, lesions diagnosed as SA on CNB should be evaluated with additional modalities before determining the need for surgical excision. The proposed interpretable AI model enhances discrimination between SA-only and SA with concomitant breast cancer (SA + BC), thereby supporting more informed clinical decision-making in breast disease management.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。