Tianshou

Author	SHA1	Message	Date
Trinkle23897	b6c9db6b0b	docs for env	2020-04-04 21:02:06 +08:00
Minghao Zhang	0b08a41610	move mujoco to examples (#12 ) * move mujoco to examples * fix the import mujoco bug * flake8 * flake8 * rm __init__.py	2020-04-02 08:49:19 +08:00
Minghao Zhang	eb7fb37806	fix PointMaze (#8 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest * add args "render" * change the tensorboard writter * change the tensorboard writter * change device, render, tensorboard log location * change device, render, tensorboard log location * remove some wrong local files * fix some tab mistakes and the envs name in continuous/test_xx.py * add examples and point robot maze environment * fix some bugs during testing examples * add dqn network and fix some args * change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally * add a warning to collector * rm some unrelated files * reformat * fix a bug in test_dqn due to the model wrong selection * change atari frame skip and observation to improve performance * readd some files * change import * modified readme * rm tensorboard log * update atari and mujoco which are ignored * rm the wrong lines * readd the import of PointMaze * fix a typo in test/discrete/net.py * add a class declaration to pass the flake8 * fix flake8 errors	2020-03-28 14:36:12 +08:00
Minghao Zhang	3c0a09fefd	minor reformat (#2 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest	2020-03-26 09:01:20 +08:00
Trinkle23897	75364cd986	ppo and early stop	2020-03-20 19:52:29 +08:00
Trinkle23897	f16e05c0e7	maybe finished collector?	2020-03-13 17:49:22 +08:00
Trinkle23897	f58c1397c6	half of collector	2020-03-12 22:20:33 +08:00
Trinkle23897	0dfb900e29	env and data	2020-03-11 09:09:56 +08:00

8 Commits