Tianshou

Author	SHA1	Message	Date
Dong Yan	6a410384bb	rewrite _is_qi in a more understandable way	2017-12-19 00:47:21 +08:00
Dong Yan	1a164d4d7d	rewrite _is_qi in a more understandable way	2017-12-19 00:47:21 +08:00
Dong Yan	14da3200ff	Merge branch 'master' of github.com:sproblvem/tianshou	2017-12-19 00:16:33 +08:00
Dong Yan	243bbaff64	Merge branch 'master' of github.com:sproblvem/tianshou	2017-12-19 00:16:33 +08:00
Dong Yan	ea52096713	delete unused parameter of _find_block, and using _find_group to replace _find_block	2017-12-19 00:16:21 +08:00
Dong Yan	fb511aa76d	delete unused parameter of _find_block, and using _find_group to replace _find_block	2017-12-19 00:16:21 +08:00
Tongzheng Ren	6b6c48f122	update gitignore	2017-12-18 23:34:32 +08:00
Tongzheng Ren	0e1287b5cb	update gitignore	2017-12-18 23:34:32 +08:00
Tongzheng Ren	75bc2968d2	add a detailed Chinese google coding style for convenience	2017-12-18 23:32:41 +08:00
Tongzheng Ren	27c1017259	add a detailed Chinese google coding style for convenience	2017-12-18 23:32:41 +08:00
宋世虹	7693c38f44	add comments and todos	2017-12-17 13:28:21 +08:00
宋世虹	d220f7f2a8	add comments and todos	2017-12-17 13:28:21 +08:00
宋世虹	62e2c6582d	finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit	2017-12-17 12:52:00 +08:00
宋世虹	3624cc9036	finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit	2017-12-17 12:52:00 +08:00
Dong Yan	e10acf5130	0. code refactor, try to merge Go and GoEnv	2017-12-16 23:29:11 +08:00
Dong Yan	31199c7d0d	0. code refactor, try to merge Go and GoEnv	2017-12-16 23:29:11 +08:00
Dong Yan	431f551ce9	check if the network weights exists for every player	2017-12-16 14:55:19 +08:00
Dong Yan	01c0c2483a	check if the network weights exists for every player	2017-12-16 14:55:19 +08:00
Dong Yan	b8bdfea8bd	start the player server in a more robost way.	2017-12-16 14:33:31 +08:00
Dong Yan	d115c586d4	start the player server in a more robost way.	2017-12-16 14:33:31 +08:00
Dong Yan	6cb4b02fca	merge class strategy with class game. Next, merge Go with GoEnv	2017-12-15 22:19:44 +08:00
Dong Yan	4fc50c5f1b	merge class strategy with class game. Next, merge Go with GoEnv	2017-12-15 22:19:44 +08:00
rtz19970824	00f599bba3	assign TODO to Haosheng and Tongzheng	2017-12-15 14:27:04 +08:00
rtz19970824	d0bdccc25a	assign TODO to Haosheng and Tongzheng	2017-12-15 14:27:04 +08:00
rtz19970824	ea541ed559	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-15 14:24:15 +08:00
rtz19970824	cb9540b91c	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-15 14:24:15 +08:00
rtz19970824	0874d5342f	implement dqn loss and dpg loss, add TODO for separate actor and critic	2017-12-15 14:24:08 +08:00
rtz19970824	e5bf7a9270	implement dqn loss and dpg loss, add TODO for separate actor and critic	2017-12-15 14:24:08 +08:00
Haosheng Zou	9ed3e7b092	minor fix	2017-12-14 19:46:38 +08:00
Haosheng Zou	92deae9f8d	minor fix	2017-12-14 19:46:38 +08:00
Haosheng Zou	f496725437	add dqn.py to write	2017-12-13 22:43:45 +08:00
Haosheng Zou	039c8140e2	add dqn.py to write	2017-12-13 22:43:45 +08:00
Haosheng Zou	72ae304ab3	preliminary design of dqn_example, dqn interface. identify the assign of networks	2017-12-13 20:47:45 +08:00
Haosheng Zou	7ab211b63c	preliminary design of dqn_example, dqn interface. identify the assign of networks	2017-12-13 20:47:45 +08:00
Wenbo Hu	d280260a46	Merge pull request #1 from sproblvem/add_rules Add rules	2017-12-13 14:35:39 +08:00
Wenbo Hu	657422a4ed	Merge pull request #1 from sproblvem/add_rules Add rules	2017-12-13 14:35:39 +08:00
Wenbo Hu	3f3d7b56f5	minor indent fix	2017-12-12 23:16:50 +08:00
Wenbo Hu	d52ee30259	add nearby stones	2017-12-12 23:13:31 +08:00
Wenbo Hu	f820aab008	change mcts steps	2017-12-12 20:37:57 +08:00
Wenbo Hu	848b8f0399	minor fix	2017-12-12 17:09:26 +08:00
rtz19970824	9791ad386e	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-12 16:54:52 +08:00
Wenbo Hu	44fbccd380	add stone estimation using nearby stone for those UNKNOWs	2017-12-13 00:35:18 +08:00
rtz19970824	e88d651400	minor fixed on self play	2017-12-11 15:56:16 +08:00
rtz19970824	715f7be6a8	update the policy	2017-12-11 13:38:24 +08:00
rtz19970824	0c4a83f3eb	vanilla policy gradient	2017-12-11 13:37:27 +08:00
haosheng	88ecaa332d	minor fix in core/policy	2017-12-11 13:25:22 +08:00
Dong Yan	e3c0478fa0	Merge branch 'master' of github.com:sproblvem/tianshou	2017-12-10 20:23:30 +08:00
Dong Yan	cacf31657b	supporting self-play between different version of neural netowrks	2017-12-10 20:23:10 +08:00
haosheng	972044c39d	minor fix	2017-12-10 17:33:10 +08:00
haosheng	a00b930c2c	fix naming and comments of coding style, delete .json	2017-12-10 17:23:13 +08:00

... 3 4 5 6 7

341 Commits