Skull-stripping induces shortcut learning in MRI-based Alzheimer's disease classification

颅骨剥离术可诱导基于磁共振成像的阿尔茨海默病分类中的捷径学习

阅读:1

Abstract

OBJECTIVES: High classification accuracy of Alzheimer's disease (AD) from structural MRI has been achieved using deep neural networks, yet the specific image features contributing to these decisions remain unclear. In this study, the contributions of T1-weighted (T1w) gray-white matter texture, volumetric information, and preprocessing-particularly skull-stripping-were systematically assessed. MATERIALS AND METHODS: A dataset of 990 matched T1w MRIs from AD patients and cognitively normal controls from the ADNI database was used. Preprocessing was varied through skull-stripping and intensity binarization to isolate texture and shape contributions. A 3D convolutional neural network was trained on each configuration, and classification performance was compared using exact McNemar tests with discrete Bonferroni-Holm correction. Feature relevance was analyzed using Layer-wise Relevance Propagation, image similarity metrics, and spectral clustering of relevance maps. RESULTS: Despite substantial differences in image content, classification accuracy, sensitivity, and specificity remained stable across preprocessing conditions. Models trained on binarized images preserved performance, indicating minimal reliance on gray-white matter texture. Instead, volumetric features-particularly brain contours introduced through skull-stripping-were consistently used by the models. CONCLUSION: This behavior reflects a shortcut learning phenomenon, where preprocessing artifacts act as potentially unintended cues. The resulting Clever Hans effect emphasizes the critical importance of interpretability tools to reveal hidden biases and to ensure robust and trustworthy deep learning in medical imaging. CRITICAL RELEVANCE STATEMENT: We investigated the mechanisms underlying deep learning-based disease classification using a widely utilized Alzheimer's disease dataset, and our findings reveal a reliance on features induced through skull-stripping, highlighting the need for careful preprocessing to ensure clinically relevant and interpretable models. KEY POINTS: Shortcut learning is induced by skull-stripping applied to T1-weighted MRIs. Explainable deep learning and spectral clustering estimate the bias. Highlights the importance of understanding the dataset, image preprocessing and deep learning model, for interpretation and validation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。