architecture design

This commit is contained in:
Tongzheng Ren 2017-11-06 15:15:44 +08:00
parent 4e4a7b74c1
commit 595e62e111
2 changed files with 35 additions and 2 deletions

24
tianshou/core/README.md Normal file
View File

@ -0,0 +1,24 @@
# Core
## Optimizer
TODO:
### policy based:
Vanilla
Baseline
TRPO
PPO
NAF
GAE
DPG
### value based:
TD

View File

@ -1,11 +1,20 @@
# Optimizer for policy gradient methods
TODO:
vanilla
introduce a baseline
baseline
REINFORCE
TRPO
PPO
GAE
NAF
DPG
ACKTR
ACKTR