Tianshou

Author	SHA1	Message	Date
Dong Yan	e58df65301	fix the async bug between think and do move checking, which introduced by bobo	2018-01-11 21:00:32 +08:00
Dong Yan	afc55ed9c2	refactor code to avoid memory leak	2018-01-11 17:02:36 +08:00
Dong Yan	5482815de6	replace two isolated player process by two different set of variables in the tf graph	2018-01-10 23:27:17 +08:00
Dong Yan	f425085e0a	fix the tf assign error of copy the trained variable from black to white	2018-01-09 21:16:35 +08:00
rtz19970824	c2775df8e6	modify game.py for multi-player	2018-01-09 20:09:48 +08:00
rtz19970824	eb0ce95919	modify model.py for multi-player	2018-01-09 19:50:37 +08:00
Tongzheng Ren	891c5b1e47	Merge branch 'master' of https://github.com/sproblvem/tianshou	2018-01-08 21:21:08 +08:00
Tongzheng Ren	f2edc4896e	modify play.py for avoiding potential bug	2018-01-08 21:19:17 +08:00
rtz19970824	32b7b33ed5	debug: we should estimate our own win rate	2018-01-08 16:19:59 +08:00
haoshengzou	88648f0c4b	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-31 15:56:19 +08:00
Wenbo Hu	50e8ea36e8	merge	2017-12-29 03:31:57 +08:00
Wenbo Hu	63a0d32b34	use hash table for check_global_isomorphous	2017-12-29 03:30:09 +08:00
rtz19970824	2dfab68efe	debug for unit test	2017-12-28 19:28:21 +08:00
rtz19970824	b699258e76	debug for reversi	2017-12-28 15:55:07 +08:00
Dong Yan	08b6649fea	test next_action.next_state in MCTS	2017-12-28 15:52:31 +08:00
Dong Yan	47676993fd	solve the performance bottleneck by only hashing the last board	2017-12-28 01:16:24 +08:00
Dong Yan	d48982d59e	move evaluator from action node to mcts	2017-12-27 20:49:54 +08:00
rtz19970824	f2291efc72	check exists when save data	2017-12-27 19:54:36 +08:00
Dong Yan	9f60984973	remove type_conversion function	2017-12-27 14:08:34 +08:00
Dong Yan	a1f6044cba	rewrite selection function of ActionNode for clarity, add and delete some notes	2017-12-27 11:43:04 +08:00
Dong Yan	c788b253fb	show the stdout of player.py for debugging	2017-12-27 01:04:09 +08:00
Dong Yan	7f0565a5f6	variable rename and delete redundant code	2017-12-26 22:19:10 +08:00
Dong Yan	0c3ff3bf37	delete unused code	2017-12-26 19:29:35 +08:00
Dong Yan	029ab199f4	add softmax for mcts root node	2017-12-26 16:47:24 +08:00
Dong Yan	8f508c790b	add role for mcts debug	2017-12-26 15:07:15 +08:00
Dong Yan	aa6b5434c6	add debuf info for mcts and add softmax for the prior	2017-12-26 14:46:14 +08:00
rtz19970824	725fc2c04e	pass the checkpoint path to the model	2017-12-26 13:17:46 +08:00
rtz19970824	76f641a0f1	minor fixed	2017-12-25 16:51:44 +08:00
rtz19970824	76f6a0c470	merge conflict	2017-12-25 16:42:08 +08:00
rtz19970824	4379f4c0fd	modify play.py for better experience	2017-12-25 16:40:38 +08:00
Dong Yan	fcb160dff6	fix python 2,3 print format error	2017-12-25 16:35:43 +08:00
Dong Yan	64da200e5d	move , from inside of () to outside of ()	2017-12-25 16:26:51 +08:00
mcgrady00h	4362d76432	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-25 15:33:48 +08:00
mcgrady00h	0fdbaef1a1	add '()' to support python3	2017-12-25 15:33:17 +08:00
rtz19970824	70824a3612	remove historical file data.py	2017-12-25 15:09:26 +08:00
sproblvem	2b24f0760e	Merge branch 'master' into mcts_virtual_loss	2017-12-24 21:27:54 +08:00
Dong Yan	89226b449a	replace try catch by isinstance collections.Hashable	2017-12-24 20:57:53 +08:00
Dong Yan	f0074aa7ca	fix bug of game config and add profing functions to mcts	2017-12-24 17:43:45 +08:00
mcgrady00h	8c6f44a015	Merge remote-tracking branch 'origin' into mcts_virtual_loss	2017-12-24 15:49:45 +08:00
mcgrady00h	cf57144ce9	merge master	2017-12-24 15:47:11 +08:00
rtz19970824	2d9aa32758	change all copy to deepcopy	2017-12-24 14:41:40 +08:00
rtz19970824	77e8aa3c28	Merge branch 'master' of https://github.com/sproblvem/tianshou	2017-12-24 14:40:57 +08:00
rtz19970824	74504ceb1d	debug for go and reversi	2017-12-24 14:40:50 +08:00
Wenbo Hu	001263a683	use a simplified version of get_score	2017-12-24 12:07:56 +08:00
Dong Yan	426251e158	add some code for debug and profiling	2017-12-24 01:07:46 +08:00
JialianLee	162aa313b6	A new version of reversi	2017-12-24 00:42:59 +08:00
Dong Yan	dcf293d637	count the winning rate for each player	2017-12-23 22:05:34 +08:00
Dong Yan	919784e88b	bug fix of model.py	2017-12-23 17:43:33 +08:00
rtz19970824	4589fcf521	add random preprocess, modify the uniform sample from training data	2017-12-23 16:27:09 +08:00
rtz19970824	a787f73cf6	add random preprocess, modify the uniform sample from training data	2017-12-23 16:27:09 +08:00

1 2 3 4

186 Commits