Explainable and uncertainty-aware ensemble framework with causal analysis for breast cancer detection

一种可解释且考虑不确定性的集成框架，结合因果分析用于乳腺癌检测

阅读：2

作者：Zaheer Sajid,Muhammad,Fareed Hamid,Muhammad,Qureshi,Imran

期刊：	Frontiers in Oncology	影响因子：	3.300
时间：	2025	起止号：	2025;15:1751090
doi：	10.3389/fonc.2025.1751090	研究方向：	肿瘤
疾病类型：	乳腺癌

Abstract

Breast cancer is one of the main causes of cancer deaths around the world and is known for its aggressive growth and ability to spread. While machine learning has shown good results for diagnosis, most existing methods do not handle uncertainty or explain their predictions clearly. In this study, we present an integrated framework that combines uncertainty-aware ensemble learning with causal feature analysis and multimodal explainability for breast cancer prediction. The framework uses a mix of Light Gradient Boosting Machine (LightGBM), random forest, and gradient boosting classifiers that include uncertainty estimation so that the model can mark predictions that are less confident. It also applies causal analysis to detect possible clinical confounders and uses SHAP (Shapley Additive Explanations), permutation importance, and feature attribution for interpretation. Tests on two public datasets showed strong and consistent performance. On the UCTH Clinical Dataset, the model reached an area under the curve (AUC) of 0.97%, an accuracy of 0.95%, and an F1 score of 0.94%, with 100% precision for high confidence cases and no false positives. On the Breast Cancer Wisconsin dataset, it achieved an AUC of 0.99, an accuracy of 0.94%, and an F1 score of 0.92%, which increased to 0.98% accuracy and 0.98% F1 score when only certain predictions were considered. Causal analysis pointed out important clinical confounders like lymph node involvement, tumor size, and metastasis, while fairness tests showed balanced results across demographic groups. Overall, the framework combines uncertainty estimation and causal interpretability to give predictions that are both accurate and trustworthy. It provides clinicians with clear confidence levels for every prediction and supports transparent decision-making that can reduce diagnostic errors and improve reliability in clinical use.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。