An efficient reparameterized small object detection transformer for thermal infrared images

一种用于热红外图像的高效重参数化小目标检测变换器

阅读:1

Abstract

Accurate and efficient small object detection in thermal infrared images remains a critical challenge due to inherent issues such as low contrast, limited texture, and deployment constraints on edge platforms like Unmanned Aerial Vehicle (UAV). This paper presents PWL-RTDETR, an efficient Transformer-based framework specifically designed for infrared small object detection. The proposed model incorporates a novel Partial Convolutional Reparameterization Block (PConvRep-Block), which fuses Partial Convolution and reparameterization to support multi-branch training and single-path inference, significantly reducing computation without compromising representation quality. To enhance multi-scale feature aggregation, we introduce WTRCSPNeck, a lightweight neck architecture integrating CNCSPELAN and WTConv modules. CNCSPELAN improves gradient flow and feature representation through structural reparameterization, while WTConv employs multi-level wavelet decomposition to effectively expand the receptive field and capture both global context and fine-grained details. Furthermore, we adopt Layer-Adaptive Magnitude-based Pruning to achieve global sparsification with layer-wise adaptability, enabling further compression while maintaining model accuracy. Comprehensive evaluations on the HIT-UAV and LLVIP infrared datasets confirm that PWL-RTDETR surpasses existing state-of-the-art models in accuracy, while achieving substantial reductions in parameters and FLOPs. The results highlight the model's suitability for real-time deployment in resource-constrained infrared perception scenarios.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。