Bridging the semantic gap in medical image segmentation via multi-scale dependency and attention-guided enhancement

通过多尺度依赖性和注意力引导增强来弥合医学图像分割中的语义鸿沟

阅读:1

Abstract

The encoder-decoder paradigm has emerged as the prevailing framework in medical image segmentation, and recent studies within this paradigm have demonstrated its remarkable effectiveness for lesion delineation. However, because the encoder compresses high-dimensional inputs and the decoder must reconstruct the target from the encoder's limited latent representation, a fixed encoder-decoder pipeline inevitably introduces a semantic gap between the two stages. To bridge this gap, we present MAFormer, a novel U-shaped network tailored for medical image segmentation. Specifically, we design a Multi-scale Dependency Feature Construction (MDFC) module that refines the skip-connection pathway to fuse semantic information across hierarchical levels. In addition, we propose an Attention Representation Reinforcement Module (ARRM) that strengthens encoder-decoder semantic alignment via bidimensional similarity computation and a hierarchical masking strategy. Extensive experiments on GlaS, Synapse and ISIC2018 datasets confirm that MAFormer consistently surpasses state-of-the-art encoder-decoder methods on both large and small scale datasets. In particular, it achieves higher Dice scores, underscoring the effectiveness of MAFormer in improving overall segmentation accuracy.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。