DT-aided resource allocation via generative adversarial imitation learning in complex cloud-edge-end scenarios

在复杂的云-边-端场景中,基于生成对抗模仿学习的决策树辅助资源分配

阅读:1

Abstract

Traditional DRL-based resource allocation for cloud-edge-end computing primarily depends on known state parameters and real-time feedback rewards when making decisions. The traditional model, which heavily relies on prior knowledge and real-time feedback of the scene, faces challenges in delivering effective services in complex scenarios. We propose a DT-aided Expert-driven Generative Adversarial Imitation Learning (E-GAIL) model that leverages imitation learning capability to jointly allocate multiple constrained resources. Firstly, we introduce a single-expert trajectory generation algorithm based on Actor-Critic and Noisynet by using the rich historical data provided in DT Networks. This idea can enhance the fidelity of the imitated expert trajectory by utilizing the critic to update the network iteratively. Secondly, we fuse different single-expert trajectories into a multi-expert trajectory to expand the coverage area. We also employ the Nash equilibrium to identify the optimal equilibrium solution and reduce the conflicts among different experts. Finally, the parameters of the generator and discriminator in E-GAIL are updated according to the respective gradients to fit the multi-expert trajectory during the training process. Once the task is uploaded, the E-GAIL Agent in the edge server can rapidly obtain the resource allocation policy even without prior knowledge or real-time reward feedback. The experiment results indicate that E-GAIL can obtain the best-fit expert trajectory in large-scale noisy environments.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。