196 Commits

Author SHA1 Message Date
rtz19970824
4379f4c0fd modify play.py for better experience 2017-12-25 16:40:38 +08:00
rtz19970824
70824a3612 remove historical file data.py 2017-12-25 15:09:26 +08:00
sproblvem
9583a14856 Merge pull request #2 from sproblvem/mcts_virtual_loss
Mcts virtual loss
2017-12-24 21:29:13 +08:00
sproblvem
e8ac38c79e Merge branch 'master' into mcts_virtual_loss 2017-12-24 21:28:50 +08:00
sproblvem
2b24f0760e Merge branch 'master' into mcts_virtual_loss 2017-12-24 21:27:54 +08:00
Dong Yan
89226b449a replace try catch by isinstance collections.Hashable 2017-12-24 20:57:53 +08:00
Dong Yan
f0074aa7ca fix bug of game config and add profing functions to mcts 2017-12-24 17:43:45 +08:00
mcgrady00h
5aa5dcd191 add comments for mcts with virtual loss 2017-12-24 16:47:43 +08:00
mcgrady00h
8c6f44a015 Merge remote-tracking branch 'origin' into mcts_virtual_loss 2017-12-24 15:49:45 +08:00
mcgrady00h
cf57144ce9 merge master 2017-12-24 15:47:11 +08:00
mcgrady00h
941284e7b1 Merge remote-tracking branch 'origin' into mcts_virtual_loss 2017-12-24 15:44:30 +08:00
rtz19970824
2d9aa32758 change all copy to deepcopy 2017-12-24 14:41:40 +08:00
rtz19970824
77e8aa3c28 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-24 14:40:57 +08:00
rtz19970824
74504ceb1d debug for go and reversi 2017-12-24 14:40:50 +08:00
Wenbo Hu
001263a683 use a simplified version of get_score 2017-12-24 12:07:56 +08:00
Dong Yan
426251e158 add some code for debug and profiling 2017-12-24 01:07:46 +08:00
JialianLee
162aa313b6 A new version of reversi 2017-12-24 00:42:59 +08:00
Dong Yan
dcf293d637 count the winning rate for each player 2017-12-23 22:05:34 +08:00
Dong Yan
8780417378 Merge branch 'master' of github.com:sproblvem/tianshou 2017-12-23 17:43:47 +08:00
Dong Yan
919784e88b bug fix of model.py 2017-12-23 17:43:33 +08:00
haoshengzou
b2b2d01d9c Merge remote-tracking branch 'origin/master' 2017-12-23 17:25:37 +08:00
haoshengzou
b21a55dc88 towards policy/value refactor 2017-12-23 17:25:16 +08:00
rtz19970824
4589fcf521 add random preprocess, modify the uniform sample from training data 2017-12-23 16:27:09 +08:00
rtz19970824
3f238864fb minor fixed for mcts, check finish for go 2017-12-23 15:58:06 +08:00
JialianLee
430b78abf5 Merge remote-tracking branch 'origin/master' 2017-12-23 15:49:39 +08:00
JialianLee
84208a7ac9 Modification for reversi.py 2017-12-23 15:43:45 +08:00
haoshengzou
8c13d8ebe6 Merge remote-tracking branch 'origin/master' 2017-12-23 15:36:44 +08:00
haoshengzou
04048b7873 fix imports to support both python2 and python3. move contents from __init__.py to leave for work after major development. 2017-12-23 15:36:10 +08:00
haoshengzou
951eed60ed fix imports to support both python2 and python3. move contents from __init__.py to leave for work after major development. 2017-12-23 15:34:44 +08:00
rtz19970824
b96fa9448b minor fixed 2017-12-23 14:45:07 +08:00
Dong Yan
d0e150f090 Merge branch 'master' of github.com:sproblvem/tianshou 2017-12-23 13:05:38 +08:00
Dong Yan
b2ef770415 connect reversi with game 2017-12-23 13:05:25 +08:00
JialianLee
032ea46b7b small modification 2017-12-23 09:47:08 +08:00
mcgrady00h
3b534064bd fix virtual loss bug 2017-12-23 02:48:53 +08:00
rtz19970824
d19559857f merge 2017-12-22 17:17:50 +08:00
rtz19970824
67ba76a04d implement a stochastic sample training method 2017-12-22 17:16:44 +08:00
Dong Yan
c5e33af841 move the unit test of is_eye into go.py 2017-12-22 15:44:44 +08:00
JialianLee
511f64b3d6 Modification for reversi 2017-12-22 15:26:47 +08:00
rtz19970824
d42a76f8f3 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:39 +08:00
rtz19970824
ed96268454 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:38 +08:00
rtz19970824
8328153b86 print in the loading process 2017-12-22 13:47:27 +08:00
rtz19970824
a8509ba292 faster the loading 2017-12-22 13:42:53 +08:00
rtz19970824
5f296ce009 merge 2017-12-22 13:31:41 +08:00
rtz19970824
6b3efd7fca modify the training config 2017-12-22 13:30:48 +08:00
rtz19970824
d281ecc6e0 no restrict on saving checkpoints 2017-12-22 13:05:01 +08:00
rtz19970824
2b1285143c debug the training process, initialize a nameserver if no nameserver exists 2017-12-22 13:04:02 +08:00
JialianLee
5c29dad263 An initial version for Reversi 2017-12-22 01:57:48 +08:00
Haosheng Zou
8ba16a8808 Merge remote-tracking branch 'origin/master' 2017-12-22 00:24:06 +08:00
Haosheng Zou
1cc5063007 add value_function (critic). value_function and policy not finished yet. 2017-12-22 00:22:23 +08:00
rtz19970824
6835ec62e1 multi-instance support 2017-12-22 00:04:51 +08:00