Exploiting Gaussian based effective receptive fields for object detection

利用基于高斯分布的有效感受野进行目标检测

阅读:1

Abstract

The effective receptive field (ERF) is a crucial concept in object detection, as it captures rich semantic information about the target, including its position and class. Existing methods typically associate the ERF with the depth, size, and nonlinear operations of the convolutional network in a static manner, such that the feature maps at each layer of the convolutional neural network correspond to a fixed ERF size. However, in fact images, multiple objects with varying scales, shapes, and other characteristics can influence the ERF, and the ERF often follows Gaussian distribution. In this paper, we propose a dynamic and real-time region-oriented ERF computation method, named GERF (Gaussian-based Effective Receptive Fields). We apply GERF to the BRA (Bi-Level Routing Attention) module of BiFormer, and refer to the method as GERF-BRA. Our approach can predict the ERF for each window in feature map and capture the weighted features of adjacent windows using Gaussian distribution. We integrate GERF-BRA into the detection heads of YOLOv8n, and experimental results on the COCO 2017 dataset demonstrate the effectiveness of GERF-BRA, achieving an improvement of 2.5 AP. Meanwhile, our method also demonstrates remarkable efficacy on proprietary agricultural and medical datasets.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。