AsymmetricActor Critic for Image-Based Robot Learning,这篇文章的新颖之处在将simulator(模拟器)的stateimage信息分别输入AC的CriticActor。

AC算法中,Actor负责产生policy,Critic来评价actor产生policy的优劣。整个AC最终只有image输入,输出policy,即是一个end-to-end system,训练好的agent可直接应用于real world。算法思想很巧妙,也得到了不错的效果。

如下图:

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

论文详细过程,大家可以参考我做的如下ppt:

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning

读论文-Asymmetric Actor Critic for Image-Based Robot Learning


相关文章:

  • 2021-09-21
  • 2021-07-06
  • 2022-12-23
  • 2021-07-28
  • 2021-12-17
  • 2021-10-20
  • 2021-12-19
猜你喜欢
  • 2022-12-23
  • 2021-10-19
  • 2021-05-13
  • 2021-09-20
  • 2021-10-28
  • 2021-11-07
  • 2021-08-13
相关资源
相似解决方案