24 Commits

Author SHA1 Message Date
Dong Yan
afc55ed9c2 refactor code to avoid memory leak 2018-01-11 17:02:36 +08:00
Dong Yan
f425085e0a fix the tf assign error of copy the trained variable from black to white 2018-01-09 21:16:35 +08:00
rtz19970824
c2775df8e6 modify game.py for multi-player 2018-01-09 20:09:48 +08:00
rtz19970824
eb0ce95919 modify model.py for multi-player 2018-01-09 19:50:37 +08:00
rtz19970824
32b7b33ed5 debug: we should estimate our own win rate 2018-01-08 16:19:59 +08:00
Dong Yan
08b6649fea test next_action.next_state in MCTS 2017-12-28 15:52:31 +08:00
Dong Yan
d48982d59e move evaluator from action node to mcts 2017-12-27 20:49:54 +08:00
Dong Yan
9f60984973 remove type_conversion function 2017-12-27 14:08:34 +08:00
Dong Yan
029ab199f4 add softmax for mcts root node 2017-12-26 16:47:24 +08:00
Dong Yan
aa6b5434c6 add debuf info for mcts and add softmax for the prior 2017-12-26 14:46:14 +08:00
rtz19970824
74504ceb1d debug for go and reversi 2017-12-24 14:40:50 +08:00
Dong Yan
426251e158 add some code for debug and profiling 2017-12-24 01:07:46 +08:00
Dong Yan
919784e88b bug fix of model.py 2017-12-23 17:43:33 +08:00
rtz19970824
4589fcf521 add random preprocess, modify the uniform sample from training data 2017-12-23 16:27:09 +08:00
rtz19970824
67ba76a04d implement a stochastic sample training method 2017-12-22 17:16:44 +08:00
rtz19970824
ed96268454 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:38 +08:00
rtz19970824
a8509ba292 faster the loading 2017-12-22 13:42:53 +08:00
rtz19970824
5f296ce009 merge 2017-12-22 13:31:41 +08:00
rtz19970824
6b3efd7fca modify the training config 2017-12-22 13:30:48 +08:00
rtz19970824
d281ecc6e0 no restrict on saving checkpoints 2017-12-22 13:05:01 +08:00
rtz19970824
2b1285143c debug the training process, initialize a nameserver if no nameserver exists 2017-12-22 13:04:02 +08:00
rtz19970824
9ad53de54f implement the training process 2017-12-21 23:30:24 +08:00
rtz19970824
eda7ed07a1 implement data collection and part of training 2017-12-21 21:01:25 +08:00
rtz19970824
7fca90c61b modify the mcts, refactor the network 2017-12-20 16:43:42 +08:00