A SHAP-interpretable XGBoost model: MRI-based intratumoral perfusion heterogeneity predicts HER2-zero, -low, and -positive ternary expression status in breast cancer

基于SHAP算法的XGBoost模型:MRI肿瘤内灌注异质性预测乳腺癌HER2零表达、低表达和阳性三元表达状态

阅读:2

Abstract

OBJECTIVES: This study aimed to predict HER2 status (HER2-zero, -low, and -positive) in breast cancer using MRI perfusion heterogeneity. The SHapley Additive exPlanations (SHAP) method was employed to interpret the outputs of machine learning models, which is crucial for guiding treatment with novel antibody-drug conjugates (ADCs). MATERIALS AND METHODS: The retrospective study included 912 women from three centers (Center A [n = 570] as the training cohort, and Centers B [n = 173] and C [n = 169] as external test cohorts) who underwent MRI between April 2018 and March 2024. Voxel vectors from MRI perfusion parameters (wash-in, wash-out, wash-out ratio) were clustered into subregions using k-means clustering. Radiomics features were extracted, and an XGBoost model incorporating these features was used to build the Habitat model. SHAP was applied to evaluate feature contributions and their importance. RESULTS: Four sub-regions of tumor perfusion patterns were identified, containing 8, 8, 8, and 10 radiomics features, respectively. The Habitat model achieved AUCs of 0.902 for HER2-zero, 0.877 for HER2-low, and 0.880 for HER2-positive in Center A. In the external test cohorts, AUCs were 0.873, 0.845, and 0.865 for Center B and 0.865, 0.844, and 0.878 for Center C, respectively. SHAP analysis revealed the radiomic features that most strongly contributed to distinguishing HER2-zero, -low, and -positive tumors across the four perfusion-derived subregions. The global SHAP results identified subregion-specific features with the highest influence on model decisions, while the local SHAP explanations clarified how individual feature patterns drove prediction outcomes for specific patients. CONCLUSION: The Habitat model accurately predicts HER2-zero, HER2-low, and HER2-positive expression status, while SHAP clarifies the contribution of subregion-derived radiomic features and enhances the overall interpretability and clinical transparency of the prediction framework. CLINICAL TRIAL NUMBER: Not applicable. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40644-026-01000-4.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。