architecture design patch

This commit is contained in:
Tongzheng Ren 2017-11-06 15:17:55 +08:00
parent 595e62e111
commit e6cad0bce9
2 changed files with 25 additions and 20 deletions

View File

@ -1,2 +1,27 @@
# tianshou
Tianshou(天授) is a reinforcement learning platform.
## data
TODO:
Replay Memory
Multiple wirter/reader
Importance sampling
## simulator
go(for AlphaGo)
## environment
gym
## core
TODO:
Optimizer
MCTS
## agent (optional)
DQNAgent etc.

View File

@ -1,20 +0,0 @@
# Optimizer for policy gradient methods
TODO:
vanilla
baseline
REINFORCE
TRPO
PPO
GAE
NAF
DPG
ACKTR