DMSCA: dynamic multi-scale channel-spatial attention for enhanced feature representation in convolutional neural networks

DMSCA:动态多尺度通道空间注意力机制,用于增强卷积神经网络中的特征表示

阅读:3

Abstract

While attention mechanisms significantly enhance feature representation in Convolutional Neural Networks (CNNs), existing approaches often suffer from limited receptive fields, insufficient directional modeling, and static fusion strategies that treat channel and spatial domains in isolation. To address these challenges, we propose the Dynamic Multi-Scale Channel-Spatial Attention (DMSCA) mechanism. This plug-and-play module synergistically integrates six cohesive components to achieve deep feature coupling. Specifically, DMSCA introduces Temperature-controlled Channel Attention (TCA) to dynamically regulate the sharpness of attention distributions via a learnable temperature parameter, and a Direction-aware Multi-scale Spatial Context Encoder (MSCE) that captures granular details across varying kernel sizes while preserving precise positional cues through orthogonal interaction. Crucially, unlike fixed-structure methods such as CBAM, our Dynamic Feature Fusion (DFF) employs a learnable gating mechanism to adaptively weight and fuse channel-spatial information based on pixel-wise input content. Extensive experiments on CIFAR-10/100 and ImageNet demonstrate that DMSCA consistently outperforms state-of-the-art attention mechanisms. Notably, it achieves a 1.52% Top-1 accuracy gain on ImageNet with a ResNet-50 backbone. Detailed analysis confirms that DMSCA offers superior robustness against image degradation and generalization capabilities with a modest computational trade-off (11.3% parameter and 2.4% FLOPs increase).

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。