High-dimensional continuous action space control via trust region optimized deep reinforcement learning

基于信任域优化的深度强化学习的高维连续动作空间控制

阅读:1

Abstract

The present research introduces the Adaptive Trust Region Policy Optimization for Action Space Compression (ATRPO-ACS) framework, a novel deep reinforcement learning approach optimized through trust region strategies, designed to address adaptive control challenges in high-dimensional continuous action spaces. By integrating distributed KL constraint optimization and manifold projection with residual compensation, the framework achieves significant improvements in sampling efficiency and real-time performance while reducing trajectory tracking errors and voltage limit violations. Experimental validations demonstrate its superior performance, with robotic arm tracking errors maintained within ± 0.08 mm and microgrid scheduling costs reduced by 28.5%. The framework also notably shortens production cycles in automotive welding lines. These advancements provide robust theoretical and technical support for real-time optimization control in industrial intelligent systems.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。