186 Commits

Author SHA1 Message Date
Dong Yan
e58df65301 fix the async bug between think and do move checking, which introduced by bobo 2018-01-11 21:00:32 +08:00
Dong Yan
afc55ed9c2 refactor code to avoid memory leak 2018-01-11 17:02:36 +08:00
Dong Yan
5482815de6 replace two isolated player process by two different set of variables in the tf graph 2018-01-10 23:27:17 +08:00
Dong Yan
f425085e0a fix the tf assign error of copy the trained variable from black to white 2018-01-09 21:16:35 +08:00
rtz19970824
c2775df8e6 modify game.py for multi-player 2018-01-09 20:09:48 +08:00
rtz19970824
eb0ce95919 modify model.py for multi-player 2018-01-09 19:50:37 +08:00
Tongzheng Ren
891c5b1e47 Merge branch 'master' of https://github.com/sproblvem/tianshou 2018-01-08 21:21:08 +08:00
Tongzheng Ren
f2edc4896e modify play.py for avoiding potential bug 2018-01-08 21:19:17 +08:00
rtz19970824
32b7b33ed5 debug: we should estimate our own win rate 2018-01-08 16:19:59 +08:00
haoshengzou
88648f0c4b Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-31 15:56:19 +08:00
Wenbo Hu
50e8ea36e8 merge 2017-12-29 03:31:57 +08:00
Wenbo Hu
63a0d32b34 use hash table for check_global_isomorphous 2017-12-29 03:30:09 +08:00
rtz19970824
2dfab68efe debug for unit test 2017-12-28 19:28:21 +08:00
rtz19970824
b699258e76 debug for reversi 2017-12-28 15:55:07 +08:00
Dong Yan
08b6649fea test next_action.next_state in MCTS 2017-12-28 15:52:31 +08:00
Dong Yan
47676993fd solve the performance bottleneck by only hashing the last board 2017-12-28 01:16:24 +08:00
Dong Yan
d48982d59e move evaluator from action node to mcts 2017-12-27 20:49:54 +08:00
rtz19970824
f2291efc72 check exists when save data 2017-12-27 19:54:36 +08:00
Dong Yan
9f60984973 remove type_conversion function 2017-12-27 14:08:34 +08:00
Dong Yan
a1f6044cba rewrite selection function of ActionNode for clarity, add and delete some notes 2017-12-27 11:43:04 +08:00
Dong Yan
c788b253fb show the stdout of player.py for debugging 2017-12-27 01:04:09 +08:00
Dong Yan
7f0565a5f6 variable rename and delete redundant code 2017-12-26 22:19:10 +08:00
Dong Yan
0c3ff3bf37 delete unused code 2017-12-26 19:29:35 +08:00
Dong Yan
029ab199f4 add softmax for mcts root node 2017-12-26 16:47:24 +08:00
Dong Yan
8f508c790b add role for mcts debug 2017-12-26 15:07:15 +08:00
Dong Yan
aa6b5434c6 add debuf info for mcts and add softmax for the prior 2017-12-26 14:46:14 +08:00
rtz19970824
725fc2c04e pass the checkpoint path to the model 2017-12-26 13:17:46 +08:00
rtz19970824
76f641a0f1 minor fixed 2017-12-25 16:51:44 +08:00
rtz19970824
76f6a0c470 merge conflict 2017-12-25 16:42:08 +08:00
rtz19970824
4379f4c0fd modify play.py for better experience 2017-12-25 16:40:38 +08:00
Dong Yan
fcb160dff6 fix python 2,3 print format error 2017-12-25 16:35:43 +08:00
Dong Yan
64da200e5d move , from inside of () to outside of () 2017-12-25 16:26:51 +08:00
mcgrady00h
4362d76432 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-25 15:33:48 +08:00
mcgrady00h
0fdbaef1a1 add '()' to support python3 2017-12-25 15:33:17 +08:00
rtz19970824
70824a3612 remove historical file data.py 2017-12-25 15:09:26 +08:00
sproblvem
2b24f0760e Merge branch 'master' into mcts_virtual_loss 2017-12-24 21:27:54 +08:00
Dong Yan
89226b449a replace try catch by isinstance collections.Hashable 2017-12-24 20:57:53 +08:00
Dong Yan
f0074aa7ca fix bug of game config and add profing functions to mcts 2017-12-24 17:43:45 +08:00
mcgrady00h
8c6f44a015 Merge remote-tracking branch 'origin' into mcts_virtual_loss 2017-12-24 15:49:45 +08:00
mcgrady00h
cf57144ce9 merge master 2017-12-24 15:47:11 +08:00
rtz19970824
2d9aa32758 change all copy to deepcopy 2017-12-24 14:41:40 +08:00
rtz19970824
77e8aa3c28 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-24 14:40:57 +08:00
rtz19970824
74504ceb1d debug for go and reversi 2017-12-24 14:40:50 +08:00
Wenbo Hu
001263a683 use a simplified version of get_score 2017-12-24 12:07:56 +08:00
Dong Yan
426251e158 add some code for debug and profiling 2017-12-24 01:07:46 +08:00
JialianLee
162aa313b6 A new version of reversi 2017-12-24 00:42:59 +08:00
Dong Yan
dcf293d637 count the winning rate for each player 2017-12-23 22:05:34 +08:00
Dong Yan
919784e88b bug fix of model.py 2017-12-23 17:43:33 +08:00
rtz19970824
4589fcf521 add random preprocess, modify the uniform sample from training data 2017-12-23 16:27:09 +08:00
rtz19970824
a787f73cf6 add random preprocess, modify the uniform sample from training data 2017-12-23 16:27:09 +08:00