Go to file

sproblvem 674ba4656b Update README.md

Sub-module function of tianshou.

2017-12-04 16:20:45 +08:00

AlphaGo

merge gtp

2017-12-04 11:01:49 +08:00

docs/figures

upload the architecture image

2017-11-06 15:56:16 +08:00

examples

architecture design patch two

2017-11-06 15:24:34 +08:00

tianshou

merge gtp

2017-12-04 11:01:49 +08:00

utils

remove .swp

2017-11-28 15:04:00 +08:00

__init__.py

add __init__.py

2017-12-01 01:38:11 +08:00

.gitignore

remove .swp

2017-11-28 15:04:00 +08:00

LICENSE

Initial commit

2017-11-04 01:38:59 +08:00

README.md

Update README.md

2017-12-04 16:20:45 +08:00

README.md

tianshou

Tianshou(天授) is a reinforcement learning platform.

agent

Examples Self-play Framework

core

Model

DQN, Policy-Value Network of AlphaGo Zero, PPO-specific, TROP-specific

Algorithm

Loss design

Actor-Critic (Variations), DQN (Variations), DDPG, TRPO, PPO

Optimization method

SGD, ADAM, TRPO, natural gradient, etc.

Planning

MCTS

data

Training style - Monte Carlo or Temporal Difference

Reward Reshaping/ Advantage Estimation Function

Importance weight

Multithread Read/Write

environment

DQN repeat frames etc.

simulator

Go, Othello/Reversi, Warzone

TODO

Search based method parallel.