Tianshou/optimizer/README.md
Tongzheng Ren 6a4adac1a0 Optimizer
2017-11-06 13:39:36 +08:00

11 lines
111 B
Markdown

# Optimizer for policy gradient methods
TODO:
vanilla
introduce a baseline
REINFORCE
TRPO
PPO
GAE
NAF
DPG
ACKTR