Tianshou/README.md

# tianshou
Tianshou(天授) is a reinforcement learning platform.

![alt text](https://github.com/sproblvem/tianshou/blob/master/docs/figures/tianshou_architecture.png "Architecture of tianshou")

## data
TODO:

Replay Memory

Multiple wirter/reader

Importance sampling

## simulator
go(for AlphaGo)

## environment
gym

## core
TODO:

Optimizer

MCTS

## agent (optional)

DQNAgent etc.

## Pontential Bugs:

0. Wrong calculation of eval value

UCTNode.cpp
```
106     if (to_move == FastBoard::WHITE) {
107         net_eval = 1.0f - net_eval;
108     }

309         if (tomove == FastBoard::WHITE) {
310             score = 1.0f - score;
311         }
```

1. create children only on leaf node

UCTSearch.cpp
```
 60     if (!node->has_children() && m_nodes < MAX_TREE_SIZE) {
 61         float eval;
 62         auto success = node->create_children(m_nodes, currstate, eval);
 63         if (success) {
 64             result = SearchResult(eval);
 65         }
 66     }
```
Initial commit 2017-11-04 01:38:59 +08:00			`# tianshou`
			`Tianshou(天授) is a reinforcement learning platform.`
Update README.md add the arch image to readme 2017-11-06 15:58:21 +08:00
			`![alt text](https://github.com/sproblvem/tianshou/blob/master/docs/figures/tianshou_architecture.png "Architecture of tianshou")`

architecture design patch 2017-11-06 15:17:55 +08:00			`## data`
			`TODO:`

			`Replay Memory`

			`Multiple wirter/reader`

			`Importance sampling`

			`## simulator`
			`go(for AlphaGo)`

			`## environment`
			`gym`

			`## core`
			`TODO:`

			`Optimizer`

			`MCTS`

			`## agent (optional)`

			`DQNAgent etc.`
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00
Update README.md format modify 2017-11-06 20:39:09 +08:00			`## Pontential Bugs:`

Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00			`0. Wrong calculation of eval value`
Update README.md format modify 2017-11-06 20:39:09 +08:00
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00			`UCTNode.cpp`
Update README.md format modify 2017-11-06 20:39:09 +08:00			```
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00			`106 if (to_move == FastBoard::WHITE) {`
			`107 net_eval = 1.0f - net_eval;`
			`108 }`

			`309 if (tomove == FastBoard::WHITE) {`
			`310 score = 1.0f - score;`
			`311 }`
Update README.md format modify 2017-11-06 20:39:09 +08:00			```
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00
			`1. create children only on leaf node`
Update README.md format modify 2017-11-06 20:39:09 +08:00
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00			`UCTSearch.cpp`
Update README.md format modify 2017-11-06 20:39:09 +08:00			```
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00			`60 if (!node->has_children() && m_nodes < MAX_TREE_SIZE) {`
			`61 float eval;`
			`62 auto success = node->create_children(m_nodes, currstate, eval);`
			`63 if (success) {`
			`64 result = SearchResult(eval);`
			`65 }`
			`66 }`
Update README.md format modify 2017-11-06 20:39:09 +08:00			```
Update README.md add potential bugs of leela. 2017-11-06 20:35:53 +08:00