320 Commits

Author SHA1 Message Date
JialianLee
1eb46774c2 small modification 2017-12-23 09:47:08 +08:00
mcgrady00h
3b534064bd fix virtual loss bug 2017-12-23 02:48:53 +08:00
rtz19970824
d19559857f merge 2017-12-22 17:17:50 +08:00
rtz19970824
5b044c9a0c merge 2017-12-22 17:17:50 +08:00
rtz19970824
67ba76a04d implement a stochastic sample training method 2017-12-22 17:16:44 +08:00
rtz19970824
d8c0eae6a3 implement a stochastic sample training method 2017-12-22 17:16:44 +08:00
Dong Yan
c5e33af841 move the unit test of is_eye into go.py 2017-12-22 15:44:44 +08:00
Dong Yan
51b8e7fc18 move the unit test of is_eye into go.py 2017-12-22 15:44:44 +08:00
JialianLee
511f64b3d6 Modification for reversi 2017-12-22 15:26:47 +08:00
JialianLee
7964064242 Modification for reversi 2017-12-22 15:26:47 +08:00
rtz19970824
a062c7610c Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:39 +08:00
rtz19970824
d42a76f8f3 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:39 +08:00
rtz19970824
ed96268454 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:38 +08:00
rtz19970824
e72fd52913 Merge branch 'master' of https://github.com/sproblvem/tianshou 2017-12-22 13:47:38 +08:00
rtz19970824
e8a10f189e print in the loading process 2017-12-22 13:47:27 +08:00
rtz19970824
8328153b86 print in the loading process 2017-12-22 13:47:27 +08:00
rtz19970824
a8509ba292 faster the loading 2017-12-22 13:42:53 +08:00
rtz19970824
97161f37ef faster the loading 2017-12-22 13:42:53 +08:00
rtz19970824
5f296ce009 merge 2017-12-22 13:31:41 +08:00
rtz19970824
9824cf8bef merge 2017-12-22 13:31:41 +08:00
rtz19970824
6b3efd7fca modify the training config 2017-12-22 13:30:48 +08:00
rtz19970824
d151f71ee3 modify the training config 2017-12-22 13:30:48 +08:00
rtz19970824
d281ecc6e0 no restrict on saving checkpoints 2017-12-22 13:05:01 +08:00
rtz19970824
62a241e763 no restrict on saving checkpoints 2017-12-22 13:05:01 +08:00
rtz19970824
2b1285143c debug the training process, initialize a nameserver if no nameserver exists 2017-12-22 13:04:02 +08:00
rtz19970824
e75883a5fb debug the training process, initialize a nameserver if no nameserver exists 2017-12-22 13:04:02 +08:00
JialianLee
5c29dad263 An initial version for Reversi 2017-12-22 01:57:48 +08:00
JialianLee
7f1191ef02 An initial version for Reversi 2017-12-22 01:57:48 +08:00
Haosheng Zou
8ba16a8808 Merge remote-tracking branch 'origin/master' 2017-12-22 00:24:06 +08:00
Haosheng Zou
b32418d11a Merge remote-tracking branch 'origin/master' 2017-12-22 00:24:06 +08:00
Haosheng Zou
1cc5063007 add value_function (critic). value_function and policy not finished yet. 2017-12-22 00:22:23 +08:00
Haosheng Zou
6611d948dd add value_function (critic). value_function and policy not finished yet. 2017-12-22 00:22:23 +08:00
rtz19970824
6835ec62e1 multi-instance support 2017-12-22 00:04:51 +08:00
rtz19970824
a61c1f136a multi-instance support 2017-12-22 00:04:51 +08:00
rtz19970824
43f6527d8e modify for multi instance 2017-12-21 23:55:31 +08:00
rtz19970824
a20255249c modify for multi instance 2017-12-21 23:55:31 +08:00
rtz19970824
6bb34afba5 merge conflict 2017-12-21 23:36:57 +08:00
rtz19970824
ff2ebd49c1 merge conflict 2017-12-21 23:36:57 +08:00
rtz19970824
9ad53de54f implement the training process 2017-12-21 23:30:24 +08:00
rtz19970824
c11eccbc90 implement the training process 2017-12-21 23:30:24 +08:00
Dong Yan
2acb1aab07 eliminate all references of Game class in Go class 2017-12-21 22:48:53 +08:00
Dong Yan
c3e9e55b24 eliminate all references of Game class in Go class 2017-12-21 22:48:53 +08:00
rtz19970824
eda7ed07a1 implement data collection and part of training 2017-12-21 21:01:25 +08:00
rtz19970824
2dad8e4020 implement data collection and part of training 2017-12-21 21:01:25 +08:00
Wenbo Hu
ced63af18f fixing bug pass parameterg 2017-12-21 19:31:51 +08:00
Wenbo Hu
1e2567c174 fixing bug pass parameterg 2017-12-21 19:31:51 +08:00
Wenbo Hu
00d2aa86bf repair komi. add todo for forbid pass: 2017-12-20 22:57:58 +08:00
Wenbo Hu
336cede197 repair komi. add todo for forbid pass: 2017-12-20 22:57:58 +08:00
Wenbo Hu
f0d59dab6c forbid pass, if we have other choices 2017-12-20 22:10:47 +08:00
Wenbo Hu
40909fa994 forbid pass, if we have other choices 2017-12-20 22:10:47 +08:00