Dong Yan
|
f425085e0a
|
fix the tf assign error of copy the trained variable from black to white
|
2018-01-09 21:16:35 +08:00 |
|
rtz19970824
|
c2775df8e6
|
modify game.py for multi-player
|
2018-01-09 20:09:48 +08:00 |
|
rtz19970824
|
eb0ce95919
|
modify model.py for multi-player
|
2018-01-09 19:50:37 +08:00 |
|
rtz19970824
|
32b7b33ed5
|
debug: we should estimate our own win rate
|
2018-01-08 16:19:59 +08:00 |
|
Dong Yan
|
08b6649fea
|
test next_action.next_state in MCTS
|
2017-12-28 15:52:31 +08:00 |
|
Dong Yan
|
d48982d59e
|
move evaluator from action node to mcts
|
2017-12-27 20:49:54 +08:00 |
|
Dong Yan
|
9f60984973
|
remove type_conversion function
|
2017-12-27 14:08:34 +08:00 |
|
Dong Yan
|
029ab199f4
|
add softmax for mcts root node
|
2017-12-26 16:47:24 +08:00 |
|
Dong Yan
|
aa6b5434c6
|
add debuf info for mcts and add softmax for the prior
|
2017-12-26 14:46:14 +08:00 |
|
rtz19970824
|
74504ceb1d
|
debug for go and reversi
|
2017-12-24 14:40:50 +08:00 |
|
Dong Yan
|
426251e158
|
add some code for debug and profiling
|
2017-12-24 01:07:46 +08:00 |
|
Dong Yan
|
919784e88b
|
bug fix of model.py
|
2017-12-23 17:43:33 +08:00 |
|
rtz19970824
|
4589fcf521
|
add random preprocess, modify the uniform sample from training data
|
2017-12-23 16:27:09 +08:00 |
|
rtz19970824
|
67ba76a04d
|
implement a stochastic sample training method
|
2017-12-22 17:16:44 +08:00 |
|
rtz19970824
|
ed96268454
|
Merge branch 'master' of https://github.com/sproblvem/tianshou
|
2017-12-22 13:47:38 +08:00 |
|
rtz19970824
|
a8509ba292
|
faster the loading
|
2017-12-22 13:42:53 +08:00 |
|
rtz19970824
|
5f296ce009
|
merge
|
2017-12-22 13:31:41 +08:00 |
|
rtz19970824
|
6b3efd7fca
|
modify the training config
|
2017-12-22 13:30:48 +08:00 |
|
rtz19970824
|
d281ecc6e0
|
no restrict on saving checkpoints
|
2017-12-22 13:05:01 +08:00 |
|
rtz19970824
|
2b1285143c
|
debug the training process, initialize a nameserver if no nameserver exists
|
2017-12-22 13:04:02 +08:00 |
|
rtz19970824
|
9ad53de54f
|
implement the training process
|
2017-12-21 23:30:24 +08:00 |
|
rtz19970824
|
eda7ed07a1
|
implement data collection and part of training
|
2017-12-21 21:01:25 +08:00 |
|
rtz19970824
|
7fca90c61b
|
modify the mcts, refactor the network
|
2017-12-20 16:43:42 +08:00 |
|