Dong Yan
|
afc55ed9c2
|
refactor code to avoid memory leak
|
2018-01-11 17:02:36 +08:00 |
|
Dong Yan
|
5482815de6
|
replace two isolated player process by two different set of variables in the tf graph
|
2018-01-10 23:27:17 +08:00 |
|
Dong Yan
|
f425085e0a
|
fix the tf assign error of copy the trained variable from black to white
|
2018-01-09 21:16:35 +08:00 |
|
rtz19970824
|
c2775df8e6
|
modify game.py for multi-player
|
2018-01-09 20:09:48 +08:00 |
|
rtz19970824
|
eb0ce95919
|
modify model.py for multi-player
|
2018-01-09 19:50:37 +08:00 |
|
Tongzheng Ren
|
891c5b1e47
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2018-01-08 21:21:08 +08:00 |
|
Tongzheng Ren
|
f2edc4896e
|
modify play.py for avoiding potential bug
|
2018-01-08 21:19:17 +08:00 |
|
rtz19970824
|
32b7b33ed5
|
debug: we should estimate our own win rate
|
2018-01-08 16:19:59 +08:00 |
|
haoshengzou
|
88648f0c4b
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-31 15:56:19 +08:00 |
|
Wenbo Hu
|
50e8ea36e8
|
merge
|
2017-12-29 03:31:57 +08:00 |
|
Wenbo Hu
|
63a0d32b34
|
use hash table for check_global_isomorphous
|
2017-12-29 03:30:09 +08:00 |
|
rtz19970824
|
2dfab68efe
|
debug for unit test
|
2017-12-28 19:28:21 +08:00 |
|
rtz19970824
|
b699258e76
|
debug for reversi
|
2017-12-28 15:55:07 +08:00 |
|
Dong Yan
|
08b6649fea
|
test next_action.next_state in MCTS
|
2017-12-28 15:52:31 +08:00 |
|
Dong Yan
|
47676993fd
|
solve the performance bottleneck by only hashing the last board
|
2017-12-28 01:16:24 +08:00 |
|
Dong Yan
|
d48982d59e
|
move evaluator from action node to mcts
|
2017-12-27 20:49:54 +08:00 |
|
rtz19970824
|
f2291efc72
|
check exists when save data
|
2017-12-27 19:54:36 +08:00 |
|
Dong Yan
|
9f60984973
|
remove type_conversion function
|
2017-12-27 14:08:34 +08:00 |
|
Dong Yan
|
a1f6044cba
|
rewrite selection function of ActionNode for clarity, add and delete some notes
|
2017-12-27 11:43:04 +08:00 |
|
Dong Yan
|
c788b253fb
|
show the stdout of player.py for debugging
|
2017-12-27 01:04:09 +08:00 |
|
Dong Yan
|
7f0565a5f6
|
variable rename and delete redundant code
|
2017-12-26 22:19:10 +08:00 |
|
Dong Yan
|
0c3ff3bf37
|
delete unused code
|
2017-12-26 19:29:35 +08:00 |
|
Dong Yan
|
029ab199f4
|
add softmax for mcts root node
|
2017-12-26 16:47:24 +08:00 |
|
Dong Yan
|
8f508c790b
|
add role for mcts debug
|
2017-12-26 15:07:15 +08:00 |
|
Dong Yan
|
aa6b5434c6
|
add debuf info for mcts and add softmax for the prior
|
2017-12-26 14:46:14 +08:00 |
|
rtz19970824
|
725fc2c04e
|
pass the checkpoint path to the model
|
2017-12-26 13:17:46 +08:00 |
|
rtz19970824
|
76f641a0f1
|
minor fixed
|
2017-12-25 16:51:44 +08:00 |
|
rtz19970824
|
76f6a0c470
|
merge conflict
|
2017-12-25 16:42:08 +08:00 |
|
rtz19970824
|
4379f4c0fd
|
modify play.py for better experience
|
2017-12-25 16:40:38 +08:00 |
|
Dong Yan
|
fcb160dff6
|
fix python 2,3 print format error
|
2017-12-25 16:35:43 +08:00 |
|
Dong Yan
|
64da200e5d
|
move , from inside of () to outside of ()
|
2017-12-25 16:26:51 +08:00 |
|
mcgrady00h
|
4362d76432
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-25 15:33:48 +08:00 |
|
mcgrady00h
|
0fdbaef1a1
|
add '()' to support python3
|
2017-12-25 15:33:17 +08:00 |
|
rtz19970824
|
70824a3612
|
remove historical file data.py
|
2017-12-25 15:09:26 +08:00 |
|
sproblvem
|
2b24f0760e
|
Merge branch 'master' into mcts_virtual_loss
|
2017-12-24 21:27:54 +08:00 |
|
Dong Yan
|
89226b449a
|
replace try catch by isinstance collections.Hashable
|
2017-12-24 20:57:53 +08:00 |
|
Dong Yan
|
f0074aa7ca
|
fix bug of game config and add profing functions to mcts
|
2017-12-24 17:43:45 +08:00 |
|
mcgrady00h
|
8c6f44a015
|
Merge remote-tracking branch 'origin' into mcts_virtual_loss
|
2017-12-24 15:49:45 +08:00 |
|
mcgrady00h
|
cf57144ce9
|
merge master
|
2017-12-24 15:47:11 +08:00 |
|
rtz19970824
|
2d9aa32758
|
change all copy to deepcopy
|
2017-12-24 14:41:40 +08:00 |
|
rtz19970824
|
77e8aa3c28
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-24 14:40:57 +08:00 |
|
rtz19970824
|
74504ceb1d
|
debug for go and reversi
|
2017-12-24 14:40:50 +08:00 |
|
Wenbo Hu
|
001263a683
|
use a simplified version of get_score
|
2017-12-24 12:07:56 +08:00 |
|
Dong Yan
|
426251e158
|
add some code for debug and profiling
|
2017-12-24 01:07:46 +08:00 |
|
JialianLee
|
162aa313b6
|
A new version of reversi
|
2017-12-24 00:42:59 +08:00 |
|
Dong Yan
|
dcf293d637
|
count the winning rate for each player
|
2017-12-23 22:05:34 +08:00 |
|
Dong Yan
|
919784e88b
|
bug fix of model.py
|
2017-12-23 17:43:33 +08:00 |
|
rtz19970824
|
4589fcf521
|
add random preprocess, modify the uniform sample from training data
|
2017-12-23 16:27:09 +08:00 |
|
rtz19970824
|
a787f73cf6
|
add random preprocess, modify the uniform sample from training data
|
2017-12-23 16:27:09 +08:00 |
|
rtz19970824
|
3f238864fb
|
minor fixed for mcts, check finish for go
|
2017-12-23 15:58:06 +08:00 |
|