add the arch image to readme
Tianshou(天授) is a reinforcement learning platform.
TODO:
Replay Memory
Multiple wirter/reader
Importance sampling
go(for AlphaGo)
gym
Optimizer
MCTS
DQNAgent etc.