Pedestrian navigation activity recognition method based on two-stream transformer and contrastive learning

基于双流Transformer和对比学习的行人导航活动识别方法

阅读:4

Abstract

Pedestrian navigation activity recognition (PNAR) serves a pivotal role in in the pedestrian positioning and navigation field, providing strong technical support for various aspects such as pedestrian dead reckoning, and multi-source information fusion positioning. This paper proposes a PNAR method that combines a two-stream convolutional transformer architecture with self-supervised contrastive pretraining to address challenges in learning robust, transferable, and generalizable representations from sensor data. The spatial stream captures multi-modal sensor dependencies, while the temporal stream leverages attention mechanism to excavate temporal relationships. The two-stream design effectively processes multi-modal sensor data and models complex activities. Contrastive pretraining leverages unlabeled data to learn invariant and transferable representations, significantly enhancing generalization across datasets. The proposed method was evaluated on four public datasets, achieving exceptional performance-99.08% accuracy and 99.22% F1-score, outperforming existing PNAR methods, including CNNLSTM + Attention and Transformer-based PNAR models. Furthermore, we conducted cross-dataset experiments on data with different sensor configurations and activity labels to validate the model's superior generalization ability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。