Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures

基于Faster R-CNN和YOLOv10的Transformer架构,利用无人机影像对钢筋进行计数时,传统增强方法的有效性研究

阅读:1

Abstract

Accurate inspection of Reinforced Concrete (RC) structures requires precise rebar counting. Although deep-learning object detectors can extract this information from drone imagery, their effectiveness depends on large, diverse, and well-labeled datasets. Image augmentation can increase data variability, yet its impact on Unmanned Aerial Vehicles (UAVs)-based rebar counting has been underexplored. This study systematically evaluates ten augmentation methods-brightness, contrast, perspective, rotation, scale, shearing, translation, blurring, a probabilistic sampling policy, and a sum of techniques composition-using Faster R-CNN and YOLOv10 across six backbones (ResNet-101, ResNet-152, MobileNetV3; ViT, PVT, Swin Transformer). Performance is reported using AP50, AP50:95, and exact-count accuracy. Results show that augmentation efficacy is both architecture and metric-dependent. The best test-set configuration is YOLOv10-PVT with shearing, which achieves AP50 = 87.71%, AP50:95 = 68.53%, and rebar-count accuracy = 86.27%-improvements of + 5.92, + 9.07, and + 5.99 percentage points, respectively, over the PVT original baseline. A probabilistic sampling policy provides consistent, policy-level gains over original data and approaches the best single transform (especially with a magnitude ramp), whereas indiscriminate a sum of techniques application does not reliably outperform the top single augmentation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。