Tianshou

Author	SHA1	Message	Date
Dong Yan	f425085e0a	fix the tf assign error of copy the trained variable from black to white	2018-01-09 21:16:35 +08:00
rtz19970824	c2775df8e6	modify game.py for multi-player	2018-01-09 20:09:48 +08:00
rtz19970824	eb0ce95919	modify model.py for multi-player	2018-01-09 19:50:37 +08:00
rtz19970824	32b7b33ed5	debug: we should estimate our own win rate	2018-01-08 16:19:59 +08:00
Dong Yan	08b6649fea	test next_action.next_state in MCTS	2017-12-28 15:52:31 +08:00
Dong Yan	d48982d59e	move evaluator from action node to mcts	2017-12-27 20:49:54 +08:00
Dong Yan	9f60984973	remove type_conversion function	2017-12-27 14:08:34 +08:00
Dong Yan	029ab199f4	add softmax for mcts root node	2017-12-26 16:47:24 +08:00
Dong Yan	aa6b5434c6	add debuf info for mcts and add softmax for the prior	2017-12-26 14:46:14 +08:00
rtz19970824	74504ceb1d	debug for go and reversi	2017-12-24 14:40:50 +08:00
Dong Yan	426251e158	add some code for debug and profiling	2017-12-24 01:07:46 +08:00
Dong Yan	919784e88b	bug fix of model.py	2017-12-23 17:43:33 +08:00
rtz19970824	4589fcf521	add random preprocess, modify the uniform sample from training data	2017-12-23 16:27:09 +08:00
rtz19970824	67ba76a04d	implement a stochastic sample training method	2017-12-22 17:16:44 +08:00
rtz19970824	ed96268454	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-22 13:47:38 +08:00
rtz19970824	a8509ba292	faster the loading	2017-12-22 13:42:53 +08:00
rtz19970824	5f296ce009	merge	2017-12-22 13:31:41 +08:00
rtz19970824	6b3efd7fca	modify the training config	2017-12-22 13:30:48 +08:00
rtz19970824	d281ecc6e0	no restrict on saving checkpoints	2017-12-22 13:05:01 +08:00
rtz19970824	2b1285143c	debug the training process, initialize a nameserver if no nameserver exists	2017-12-22 13:04:02 +08:00
rtz19970824	9ad53de54f	implement the training process	2017-12-21 23:30:24 +08:00
rtz19970824	eda7ed07a1	implement data collection and part of training	2017-12-21 21:01:25 +08:00
rtz19970824	7fca90c61b	modify the mcts, refactor the network	2017-12-20 16:43:42 +08:00

23 Commits