MSRS-DETR: End-to-End Object Detection for Multi-Scale Remote Sensing

MSRS-DETR:面向多尺度遥感的端到端目标检测

阅读:1

Abstract

Remote sensing imagery (RSI) object detection is critical to many applications, yet mainstream detectors analyse only spatial features and, because of spectral bias, fail to learn high-frequency information adequately, resulting in performance bottlenecks under cluttered backgrounds, distractors, and multi-scale targets, especially small ones. To break these limitations, we propose MSRS-DETR, an end-to-end framework that deeply fuses spatial and frequency cues. The approach introduces three key innovations: (1) C2f(FAT)NET, a frequency-attention-enhanced lightweight residual backbone that provides richer dual-domain features with fewer parameters; (2) an Entanglement Transformer Block (ETB) in the encoder that refines deep semantics via cross-domain frequency-spatial interaction and suppresses background interference; and (3) S2-CCFF, a shallow-feature-extended bidirectional fusion path that markedly improves the retention and utilisation of fine details for small objects. Experiments on HRSC2016 and ShipRSImageNet demonstrate the effectiveness and generalisation of this spatial-frequency paradigm: relative to the baseline, MSRS-DETR reduces parameters by 29.1%, boosts inference speed by 12.4% and 8.4%, and raises mAP(50-95) by 1.69% and 2.16%, respectively.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。