POLAR-DETR: Polarized occlusion-aware local-global attention real-time detection transformer for total laboratory automation

POLAR-DETR:面向全实验室自动化的偏振遮挡感知局部-全局注意力实时检测转换器

阅读:1

Abstract

The deployment of Total Laboratory Automation (TLA) systems in medical production lines faces challenges including spatial constraints, dense object distributions, and severe occlusions, rendering traditional detection methods inadequate. This paper proposes POLAR-DETR (Polarized Occlusion-aware Local-global Attention Real-time Detection Transformer), an efficient real-time end-to-end detection framework for medical production scenarios. First, we design a Polarized Occlusion-aware Hierarchical Feature Encoder (POHFE) incorporating polar linear attention and dynamic nonlinear feature modulation, enhancing spatial-contextual awareness and detail representation. Second, we introduce a Multi-level Hierarchical Attention Fusion (MHAF) module that strengthens semantic associations between multi-scale features through hypergraph computation. Additionally, we develop a Hierarchical Dual-branch Attention Fusion (HDAF) module for precise discrimination of local details and global information. To optimize deployment efficiency, we devise a Hessian matrix-based pruning strategy reducing network redundancy. Furthermore, we construct the Augmented Medical Production Line (AMPL) dataset, comprising 5040 high-resolution images with 85,797 annotated instances. Experimental results demonstrate that POLAR-DETR achieves 70.0% Average Precision (AP) on AMPL while maintaining 68.4 FPS. Compared to baseline, our approach improves AP by 4.7% while reducing parameters and computational complexity by 20.5% and 22.6% respectively, providing an efficient visual detection solution for medical production automation.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。