Tianshou

Author	SHA1	Message	Date
Trinkle23897	3243484f8e	show stat in pytest	2020-05-16 08:48:12 +08:00
Trinkle23897	e58fc78546	build docs	2020-04-29 14:16:38 +08:00
Trinkle23897	80d661907e	Multimodal obs (#38 , #27 , #25 )	2020-04-28 20:56:02 +08:00
Trinkle23897	6b96f124ae	fix pdqn	2020-04-26 15:11:20 +08:00
rocknamx	b23749463e	Prioritized DQN (#30 ) * add sum_tree.py * add prioritized replay buffer * del sum_tree.py * fix some format issues * fix weight_update bug * simply replace replaybuffer in test_dqn without weight update * weight default set to 1 * fix sampling bug when buffer is not full * rename parameter * fix formula error, add accuracy check * add PrioritizedDQN test * add test_pdqn.py * add update_weight() doc * add ref of prio dqn in readme.md and index.rst * restore test_dqn.py, fix args of test_pdqn.py	2020-04-26 12:05:58 +08:00
Trinkle23897	6bf1ea644d	fix ppo	2020-04-19 14:30:42 +08:00
Trinkle23897	680fc0ffbe	gae	2020-04-14 21:11:06 +08:00
Trinkle23897	7b65d43394	vanilla imitation learning	2020-04-13 19:37:27 +08:00
Trinkle23897	befdfb07e8	polish docs	2020-04-11 19:29:46 +08:00
Trinkle23897	6a244d1fbb	save_fn	2020-04-11 16:54:27 +08:00
Trinkle23897	ecfcb9f295	fix docs	2020-04-10 11:16:33 +08:00
Trinkle23897	3cc22b7c0c	__call__ -> forward	2020-04-10 10:47:16 +08:00
Trinkle23897	e0809ff135	add policy docs (#21 )	2020-04-06 19:36:59 +08:00
Trinkle23897	610390c132	add docs of collector and trainer (#20 )	2020-04-05 18:34:45 +08:00
Trinkle23897	b6c9db6b0b	docs for env	2020-04-04 21:02:06 +08:00
Trinkle23897	974ade8019	add some docs	2020-04-03 21:28:12 +08:00
Trinkle23897	7cb5146611	add docs of trick	2020-04-02 21:57:26 +08:00
Trinkle23897	0e86d44860	finish concepts	2020-04-02 12:31:22 +08:00
Trinkle23897	0acd0d164c	test api doc	2020-04-02 09:07:04 +08:00
Trinkle23897	4f843d3f51	update readme	2020-04-01 10:21:58 +08:00
Trinkle23897	04208e6cce	update some tutorial	2020-03-30 22:52:25 +08:00
Trinkle23897	2169dd2201	update high-res logo	2020-03-29 15:52:47 +08:00
Trinkle23897	4e7df7616a	update dqn tutorial	2020-03-29 15:18:33 +08:00
Trinkle23897	d9e4b9d16f	upd doc	2020-03-29 10:22:03 +08:00
Trinkle23897	57735ce1b5	add logo and sphinx setup	2020-03-28 22:01:23 +08:00
Minghao Zhang	77068af526	add examples, fix some bugs (#5 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest * add args "render" * change the tensorboard writter * change the tensorboard writter * change device, render, tensorboard log location * change device, render, tensorboard log location * remove some wrong local files * fix some tab mistakes and the envs name in continuous/test_xx.py * add examples and point robot maze environment * fix some bugs during testing examples * add dqn network and fix some args * change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally * add a warning to collector * rm some unrelated files * reformat * fix a bug in test_dqn due to the model wrong selection	2020-03-28 07:27:18 +08:00
Trinkle23897	c505cd8205	update readme	2020-03-26 11:42:34 +08:00

27 Commits