Attention based unified architecture for Arabic text detection on traffic panels to advance autonomous navigation in natural scenes

基于注意力机制的统一架构用于交通标志牌上的阿拉伯语文本检测,以提升自然场景下的自主导航能力

阅读:1

Abstract

The increasing reliance on autonomous navigation systems necessitates robust methods for detecting and recognizing textual information in natural scenes, especially in complex scripts like Arabic. This paper presents a novel attention-based unified architecture for Arabic text detection and recognition on traffic panels, addressing the unique challenges posed by Arabic's cursive nature, varying character shapes, and contextual dependencies. Leveraging the ASAYAR dataset, which includes diverse Arabic text samples with precise annotations, the proposed model integrates Convolutional Neural Networks (CNNs) and Bidirectional Long Short-Term Memory (BiLSTM) networks with attention mechanisms to accurately localize and interpret text regions. The architecture demonstrates state-of-the-art performance, achieving a Mean Intersection over Union (IoU) of 0.9505, precision of 0.953, recall of 0.934, F1-score of 0.929, and an overall recognition accuracy of 97%. Visualizations of attention weights and SHAP analyses highlight the model's explainability and focus on relevant features, ensuring reliability in real-world applications. Furthermore, the system's computational efficiency and real-time applicability make it suitable for use in Advanced Driver Assistance Systems (ADAS) and autonomous vehicles, reducing driver distractions and enhancing traffic safety. This study not only advances Arabic text recognition research but also provides insights into developing scalable, multilingual text detection systems for complex real-world scenarios.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。