Multi-objective trajectory optimization method for industrial robots based on improved TD3 algorithm

基于改进TD3算法的工业机器人多目标轨迹优化方法

阅读:1

Abstract

This paper addresses the challenge of simultaneously satisfying collision-free trajectory, path quality optimization, and execution time optimization in trajectory planning for industrial robots in confined spaces. A multi-objective integrated trajectory optimization algorithm is proposed based on the TD3 reinforcement learning framework. Firstly, the motion generation mechanism is improved using a Butterworth filter and a dynamic noise attenuation strategy to enhance trajectory smoothness. Secondly, a genetic algorithm is employed to automatically optimize the hyperparameters of the TD3 algorithm, and a prioritized experience replay mechanism is introduced to improve the utilization efficiency of critical experiences, thereby enhancing the algorithm's convergence speed and stability. Finally, a composite reward function based on time-distance information is designed to effectively guide the industrial robot in optimizing trajectory execution time under collision-free conditions. This paper uses the Fairino Robot5 robotic arm model as the training object and conducts simulation experiments in a PyBullet environment, as well as real physical experiments. Compared to the RRT method, manual teaching method, traditional TD3 method, and SAC method, the actual trajectory execution time was reduced by 57.03% , 22.94% , 26.05%, and 20.5%, respectively.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。