Tianshou/README.md at 4e4a7b74c17cc1ab223482f48a9345bb4f6a2e29 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

Tongzheng Ren 4e4a7b74c1 update the optimizer README

2017-11-06 14:01:29 +08:00

11 lines

112 B

Markdown

Raw Blame History

 # Optimizer for policy gradient methods
 TODO:
 vanilla
 introduce a baseline
 REINFORCE
 TRPO
 PPO
 GAE
 NAF
 DPG
 ACKTR