195 Commits

Author SHA1 Message Date
Wenbo
0131bcdc85 fix minor 2018-01-17 15:57:41 +08:00
Wenbo
0e4aa44ebb add deepcopy for hash, add some testing 2018-01-17 15:54:46 +08:00
Wenbo Hu
e76ccaee80 add union set for do_move and is_valid 2018-01-16 14:10:56 +08:00
rtz19970824
3b222f5edb add an args to intrigue training 2018-01-13 15:59:57 +08:00
rtz19970824
2e8662889f add multi-thread for end-to-end training 2018-01-13 15:57:41 +08:00
rtz19970824
fcaa571b42 add the interface in engine.py 2018-01-12 21:48:01 +08:00
Dong Yan
68cc63144f fix the hash conflict bug 2018-01-12 21:08:07 +08:00
rtz19970824
90ffdcbb1f check the latest checkpoint while self play 2018-01-12 19:16:44 +08:00
rtz19970824
c217aa165d add some error message for better debugging 2018-01-12 17:17:03 +08:00
Dong Yan
e58df65301 fix the async bug between think and do move checking, which introduced by bobo 2018-01-11 21:00:32 +08:00
Dong Yan
afc55ed9c2 refactor code to avoid memory leak 2018-01-11 17:02:36 +08:00
Dong Yan
5482815de6 replace two isolated player process by two different set of variables in the tf graph 2018-01-10 23:27:17 +08:00
Dong Yan
f425085e0a fix the tf assign error of copy the trained variable from black to white 2018-01-09 21:16:35 +08:00
rtz19970824
c2775df8e6 modify game.py for multi-player 2018-01-09 20:09:48 +08:00
rtz19970824
eb0ce95919 modify model.py for multi-player 2018-01-09 19:50:37 +08:00
Tongzheng Ren
891c5b1e47 Merge branch 'master' of https://github.com/sproblvem/tianshou 2018-01-08 21:21:08 +08:00
Tongzheng Ren
f2edc4896e modify play.py for avoiding potential bug 2018-01-08 21:19:17 +08:00
rtz19970824
32b7b33ed5 debug: we should estimate our own win rate 2018-01-08 16:19:59 +08:00
haoshengzou
88648f0c4b Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-31 15:56:19 +08:00
Wenbo Hu
50e8ea36e8 merge 2017-12-29 03:31:57 +08:00
Wenbo Hu
63a0d32b34 use hash table for check_global_isomorphous 2017-12-29 03:30:09 +08:00
rtz19970824
2dfab68efe debug for unit test 2017-12-28 19:28:21 +08:00
rtz19970824
b699258e76 debug for reversi 2017-12-28 15:55:07 +08:00
Dong Yan
08b6649fea test next_action.next_state in MCTS 2017-12-28 15:52:31 +08:00
Dong Yan
47676993fd solve the performance bottleneck by only hashing the last board 2017-12-28 01:16:24 +08:00
Dong Yan
d48982d59e move evaluator from action node to mcts 2017-12-27 20:49:54 +08:00
rtz19970824
f2291efc72 check exists when save data 2017-12-27 19:54:36 +08:00
Dong Yan
9f60984973 remove type_conversion function 2017-12-27 14:08:34 +08:00
Dong Yan
a1f6044cba rewrite selection function of ActionNode for clarity, add and delete some notes 2017-12-27 11:43:04 +08:00
Dong Yan
c788b253fb show the stdout of player.py for debugging 2017-12-27 01:04:09 +08:00
Dong Yan
7f0565a5f6 variable rename and delete redundant code 2017-12-26 22:19:10 +08:00
Dong Yan
0c3ff3bf37 delete unused code 2017-12-26 19:29:35 +08:00
Dong Yan
029ab199f4 add softmax for mcts root node 2017-12-26 16:47:24 +08:00
Dong Yan
8f508c790b add role for mcts debug 2017-12-26 15:07:15 +08:00
Dong Yan
aa6b5434c6 add debuf info for mcts and add softmax for the prior 2017-12-26 14:46:14 +08:00
rtz19970824
725fc2c04e pass the checkpoint path to the model 2017-12-26 13:17:46 +08:00
rtz19970824
76f641a0f1 minor fixed 2017-12-25 16:51:44 +08:00
rtz19970824
76f6a0c470 merge conflict 2017-12-25 16:42:08 +08:00
rtz19970824
4379f4c0fd modify play.py for better experience 2017-12-25 16:40:38 +08:00
Dong Yan
fcb160dff6 fix python 2,3 print format error 2017-12-25 16:35:43 +08:00
Dong Yan
64da200e5d move , from inside of () to outside of () 2017-12-25 16:26:51 +08:00
mcgrady00h
4362d76432 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-25 15:33:48 +08:00
mcgrady00h
0fdbaef1a1 add '()' to support python3 2017-12-25 15:33:17 +08:00
rtz19970824
70824a3612 remove historical file data.py 2017-12-25 15:09:26 +08:00
sproblvem
2b24f0760e Merge branch 'master' into mcts_virtual_loss 2017-12-24 21:27:54 +08:00
Dong Yan
89226b449a replace try catch by isinstance collections.Hashable 2017-12-24 20:57:53 +08:00
Dong Yan
f0074aa7ca fix bug of game config and add profing functions to mcts 2017-12-24 17:43:45 +08:00
mcgrady00h
8c6f44a015 Merge remote-tracking branch 'origin' into mcts_virtual_loss 2017-12-24 15:49:45 +08:00
mcgrady00h
cf57144ce9 merge master 2017-12-24 15:47:11 +08:00
rtz19970824
2d9aa32758 change all copy to deepcopy 2017-12-24 14:41:40 +08:00