Enhanced feature representation for real time UAV image object detection using contextual information and adaptive fusion

利用上下文信息和自适应融合技术增强特征表示,实现实时无人机图像目标检测。

阅读:1

Abstract

In UAV-based downstream tasks, intelligent interpretation of UAV images demands higher real-time performance and accuracy. However, achieving high-precision, real-time object detection in UAV images poses significant challenges due to the prevalence of small objects (e.g., persons and bicycles), uneven target distribution, occlusion, and other factors. Current UAV object detection algorithms lack comprehensive solutions to the multifaceted challenges encountered in real-world deployment scenarios, resulting in suboptimal performance. Moreover, direct application of mainstream real-time detection algorithms like the YOLO series to UAV images lead to a significant performance drop. To address these issues, this paper presents an enhanced real-time object detection network named YOLO-UD, which is built upon the YOLO11 architecture. Our approach aims to achieve superior feature representation through the effective integration of contextual information and adaptive multi-scale fusion. Specifically, we incorporate a novel C3kHR module, which employs dilated convolutions with varying rates to capture contextual information across multiple granularity hierarchy, enabling superior and richer multi-scale feature representation. Additionally, an efficient adaptive feature fusion network (EAFN) is designed to filter and prioritize key information from multi-scale feature layers and flexibly provide the detection head with the information needed for the detection process. A small object detection layer (SMDL) is also introduced to enhance the detection of small objects and provide rich information about small targets. Finally, extensive experiments on the VisDrone2019 and UAVDET datasets demonstrate that YOLO-UD achieves excellent balance between accuracy and inference speed, validating its effectiveness.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。