Go to file

sproblvem d2e6c517ea Update README.md

add potential bugs of leela.

2017-11-06 20:35:53 +08:00

modify the network

2017-11-05 16:47:01 +08:00

2017-11-06 15:56:16 +08:00

2017-11-06 15:24:34 +08:00

2017-11-06 15:24:34 +08:00

2017-11-04 01:45:55 +08:00

.gitignore

2017-11-06 15:56:16 +08:00

LICENSE

Initial commit

2017-11-04 01:38:59 +08:00

README.md

Update README.md

2017-11-06 20:35:53 +08:00

tianshou

Tianshou(天授) is a reinforcement learning platform.

data

TODO:

Replay Memory

Multiple wirter/reader

Importance sampling

go(for AlphaGo)

gym

TODO:

Optimizer

MCTS

DQNAgent etc.

Pontential Bugs: 0. Wrong calculation of eval value UCTNode.cpp 106 if (to_move == FastBoard::WHITE) { 107 net_eval = 1.0f - net_eval; 108 }

309 if (tomove == FastBoard::WHITE) { 310 score = 1.0f - score; 311 }

create children only on leaf node UCTSearch.cpp 60 if (!node->has_children() && m_nodes < MAX_TREE_SIZE) { 61 float eval; 62 auto success = node->create_children(m_nodes, currstate, eval); 63 if (success) { 64 result = SearchResult(eval); 65 } 66 }