A Scale-Adaptive Aggregation and Multi-Domain Feature Fusion Architecture for Small-Target Detection in UAV Aerial Imagery

一种面向无人机航拍图像小目标检测的尺度自适应聚合和多域特征融合架构

阅读:1

Abstract

Vision-based unmanned aerial vehicles (UAVs) have been widely studied and applied in aerial monitoring tasks; however, detecting small objects in UAV imagery remains challenging due to limited visual features, significant scale variations, dense distributions, and complex background interference. In real-world UAV scenarios, small objects often occupy only a few pixels and are easily obscured by cluttered backgrounds, which complicates stable and accurate detection. To address these issues, this study proposes MSCM-YOLO, a UAV-oriented lightweight detection framework based on YOLOv11. The framework integrates four key innovations: (1) a dedicated P2 detection head to preserve high-resolution features for extremely small and dense targets; (2) a lightweight backbone enhanced with Mobile Bottleneck Convolution (MBConv) to improve feature extraction for visually weak objects; (3) a Scale-Adaptive Attention Fusion (SAF) mechanism with a Channel-Adaptive Projection (CAP) module to effectively integrate multi-scale spatial and semantic features under large object-size variations; and (4) a Multi-Domain Feature Attention Fusion (MDFAF) module to enhance target-background discrimination in complex UAV scenes. Experiments on the VisDrone2019 dataset show that MSCM-YOLO achieves mAP50 and mAP50:95 scores of 44.41% and 27.13%, respectively, outperforming the YOLOv11 baseline by 10.77 and 7.22 percentage points. Notably, the proposed framework achieves this significant performance improvement while maintaining a balanced computational profile suitable for UAV deployment. Additional validation on the UAVDT, DIOR, and AI-TOD datasets confirms consistent improvements in mAP50, demonstrating the robustness and generalization ability of the proposed method. Overall, MSCM-YOLO provides an effective and practical solution for accurate small object detection in aerial monitoring applications.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。