Robot motion skill learning method based on focused reward transformer

基于聚焦奖励变换器的机器人运动技能学习方法

阅读:1

Abstract

In the field of robotics control, prevailing research is progressively leveraging more sophisticated deep learning networks to enhance learning outcomes in specific domains of robotics motion control. Notably, the Decision Transformer (DT), a prominent model in motion control learning, predominantly focuses on long-term expected returns and often overlooks the critical significance of immediate action values, which are essential for the adaptive behaviors observed in higher animals. Addressing this gap, this paper introduces the Focused Reward Transformer (FRT) . This innovative training network combines guided attention mechanisms with a significant emphasis on reward processing. The FRT aims to optimize robot motion control by dynamically balancing long-term predictions with immediate environmental cues. The effectiveness of the FRT was rigorously evaluated by detailed simulations on the MuJoCo platform, using three different tasks: Hopper, Walker2d, and Halfcheetah. These tasks broadly represent the challenges inherent to robotics dynamics. Our experimental results demonstrate a substantial improvement in the performance of control tasks, the ability to validate FRT, refine robotic control, and skillfully combine in-time context responses with the long-term planning of strategies. This confirms the potential of FRT to advance the field of robotic control by introducing more nuanced approaches to learning and decision-making processes.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。