rtz19970824
|
4379f4c0fd
|
modify play.py for better experience
|
2017-12-25 16:40:38 +08:00 |
|
rtz19970824
|
70824a3612
|
remove historical file data.py
|
2017-12-25 15:09:26 +08:00 |
|
sproblvem
|
2b24f0760e
|
Merge branch 'master' into mcts_virtual_loss
|
2017-12-24 21:27:54 +08:00 |
|
Dong Yan
|
89226b449a
|
replace try catch by isinstance collections.Hashable
|
2017-12-24 20:57:53 +08:00 |
|
Dong Yan
|
f0074aa7ca
|
fix bug of game config and add profing functions to mcts
|
2017-12-24 17:43:45 +08:00 |
|
mcgrady00h
|
8c6f44a015
|
Merge remote-tracking branch 'origin' into mcts_virtual_loss
|
2017-12-24 15:49:45 +08:00 |
|
mcgrady00h
|
cf57144ce9
|
merge master
|
2017-12-24 15:47:11 +08:00 |
|
rtz19970824
|
2d9aa32758
|
change all copy to deepcopy
|
2017-12-24 14:41:40 +08:00 |
|
rtz19970824
|
77e8aa3c28
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-24 14:40:57 +08:00 |
|
rtz19970824
|
74504ceb1d
|
debug for go and reversi
|
2017-12-24 14:40:50 +08:00 |
|
Wenbo Hu
|
001263a683
|
use a simplified version of get_score
|
2017-12-24 12:07:56 +08:00 |
|
Dong Yan
|
426251e158
|
add some code for debug and profiling
|
2017-12-24 01:07:46 +08:00 |
|
JialianLee
|
162aa313b6
|
A new version of reversi
|
2017-12-24 00:42:59 +08:00 |
|
Dong Yan
|
dcf293d637
|
count the winning rate for each player
|
2017-12-23 22:05:34 +08:00 |
|
Dong Yan
|
919784e88b
|
bug fix of model.py
|
2017-12-23 17:43:33 +08:00 |
|
rtz19970824
|
4589fcf521
|
add random preprocess, modify the uniform sample from training data
|
2017-12-23 16:27:09 +08:00 |
|
rtz19970824
|
3f238864fb
|
minor fixed for mcts, check finish for go
|
2017-12-23 15:58:06 +08:00 |
|
JialianLee
|
430b78abf5
|
Merge remote-tracking branch 'origin/master'
|
2017-12-23 15:49:39 +08:00 |
|
JialianLee
|
84208a7ac9
|
Modification for reversi.py
|
2017-12-23 15:43:45 +08:00 |
|
rtz19970824
|
b96fa9448b
|
minor fixed
|
2017-12-23 14:45:07 +08:00 |
|
Dong Yan
|
d0e150f090
|
Merge branch 'master' of github.com:sproblvem/tianshou
|
2017-12-23 13:05:38 +08:00 |
|
Dong Yan
|
b2ef770415
|
connect reversi with game
|
2017-12-23 13:05:25 +08:00 |
|
JialianLee
|
032ea46b7b
|
small modification
|
2017-12-23 09:47:08 +08:00 |
|
rtz19970824
|
d19559857f
|
merge
|
2017-12-22 17:17:50 +08:00 |
|
rtz19970824
|
67ba76a04d
|
implement a stochastic sample training method
|
2017-12-22 17:16:44 +08:00 |
|
Dong Yan
|
c5e33af841
|
move the unit test of is_eye into go.py
|
2017-12-22 15:44:44 +08:00 |
|
JialianLee
|
511f64b3d6
|
Modification for reversi
|
2017-12-22 15:26:47 +08:00 |
|
rtz19970824
|
d42a76f8f3
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-22 13:47:39 +08:00 |
|
rtz19970824
|
ed96268454
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-22 13:47:38 +08:00 |
|
rtz19970824
|
8328153b86
|
print in the loading process
|
2017-12-22 13:47:27 +08:00 |
|
rtz19970824
|
a8509ba292
|
faster the loading
|
2017-12-22 13:42:53 +08:00 |
|
rtz19970824
|
5f296ce009
|
merge
|
2017-12-22 13:31:41 +08:00 |
|
rtz19970824
|
6b3efd7fca
|
modify the training config
|
2017-12-22 13:30:48 +08:00 |
|
rtz19970824
|
d281ecc6e0
|
no restrict on saving checkpoints
|
2017-12-22 13:05:01 +08:00 |
|
rtz19970824
|
2b1285143c
|
debug the training process, initialize a nameserver if no nameserver exists
|
2017-12-22 13:04:02 +08:00 |
|
JialianLee
|
5c29dad263
|
An initial version for Reversi
|
2017-12-22 01:57:48 +08:00 |
|
rtz19970824
|
6835ec62e1
|
multi-instance support
|
2017-12-22 00:04:51 +08:00 |
|
rtz19970824
|
43f6527d8e
|
modify for multi instance
|
2017-12-21 23:55:31 +08:00 |
|
rtz19970824
|
6bb34afba5
|
merge conflict
|
2017-12-21 23:36:57 +08:00 |
|
rtz19970824
|
9ad53de54f
|
implement the training process
|
2017-12-21 23:30:24 +08:00 |
|
Dong Yan
|
2acb1aab07
|
eliminate all references of Game class in Go class
|
2017-12-21 22:48:53 +08:00 |
|
rtz19970824
|
eda7ed07a1
|
implement data collection and part of training
|
2017-12-21 21:01:25 +08:00 |
|
Wenbo Hu
|
00d2aa86bf
|
repair komi. add todo for forbid pass:
|
2017-12-20 22:57:58 +08:00 |
|
Wenbo Hu
|
f0d59dab6c
|
forbid pass, if we have other choices
|
2017-12-20 22:10:47 +08:00 |
|
Wenbo Hu
|
e2c6b96e57
|
minor revision.
|
2017-12-20 21:52:30 +08:00 |
|
Wenbo Hu
|
cabbb21968
|
minor revision
|
2017-12-20 21:40:03 +08:00 |
|
Wenbo Hu
|
48e95a21ea
|
simulator process a valid set, instead of a single action
|
2017-12-20 21:35:35 +08:00 |
|
Wenbo Hu
|
50e306368f
|
checkpoint
|
2017-12-20 20:12:08 +08:00 |
|
rtz19970824
|
7fca90c61b
|
modify the mcts, refactor the network
|
2017-12-20 16:43:42 +08:00 |
|
Dong Yan
|
c2b46c44e7
|
merge Go and GoEnv finallygit status!
|
2017-12-20 01:14:05 +08:00 |
|