Tianshou

Author	SHA1	Message	Date
Trinkle23897	befdfb07e8	polish docs	2020-04-11 19:29:46 +08:00
Trinkle23897	6a244d1fbb	save_fn	2020-04-11 16:54:27 +08:00
Trinkle23897	74407e13da	env info log_fn (#28 )	2020-04-10 18:02:05 +08:00
Trinkle23897	ecfcb9f295	fix docs	2020-04-10 11:16:33 +08:00
Trinkle23897	3cc22b7c0c	__call__ -> forward	2020-04-10 10:47:16 +08:00
Trinkle23897	13086b7f64	add ignore_obs_next in buffer	2020-04-10 09:01:17 +08:00
Trinkle23897	19f2cce294	seealso and change policy dir structure	2020-04-09 21:36:53 +08:00
Trinkle23897	6da80e045a	fix rnn (#19 ), add __repr__, and fix #26	2020-04-09 19:53:45 +08:00
Trinkle23897	86572c66d4	maybe finished rnn?	2020-04-08 21:13:15 +08:00
Trinkle23897	d9d2763dad	first version with full documentation v0.2.1	2020-04-07 11:50:34 +08:00
Trinkle23897	6c8edf6a3a	codecov badge	2020-04-07 11:17:10 +08:00
Trinkle23897	e0809ff135	add policy docs (#21 )	2020-04-06 19:36:59 +08:00
Trinkle23897	610390c132	add docs of collector and trainer (#20 )	2020-04-05 18:34:45 +08:00
Oblivion	4d4d0daf9e	Performance improve (#18 ) * improve performance set one thread for NN replace detach() op with torch.no_grad() * fix pep 8 errors	2020-04-05 09:10:21 +08:00
Trinkle23897	b6c9db6b0b	docs for env	2020-04-04 21:02:06 +08:00
Oblivion	9380368ca3	add an example of bullet env (experiment from jiqizhixin) (#15 ) * add_pybullet_ens_test test on pybullet envs modify some log config * delete DS_Store file * add pybullet_envs test add HalfCheetahBulletEnv-v0 test modify log config * fix pep 8 errors * add pybullet to dev * delete a line * by pass F401 * add log_interval to onpolicy_trainer * add comments * Update halfcheetahBullet_v0_sac.py	2020-04-04 11:46:18 +08:00
Trinkle23897	974ade8019	add some docs	2020-04-03 21:28:12 +08:00
Trinkle23897	6cfa876591	hot fix	2020-04-03 15:17:58 +08:00
Trinkle23897	7cb5146611	add docs of trick	2020-04-02 21:57:26 +08:00
Trinkle23897	0e86d44860	finish concepts	2020-04-02 12:31:22 +08:00
Trinkle23897	0acd0d164c	test api doc	2020-04-02 09:07:04 +08:00
Minghao Zhang	0b08a41610	move mujoco to examples (#12 ) * move mujoco to examples * fix the import mujoco bug * flake8 * flake8 * rm __init__.py	2020-04-02 08:49:19 +08:00
Trinkle23897	4f843d3f51	update readme	2020-04-01 10:21:58 +08:00
ShenDezhou	4da857d86e	Fix windows env setup bugs and other typo. (#11 )	2020-03-31 17:22:32 +08:00
Doxie	98feb79057	fix bug in discrete_net.py (#10 )	2020-03-31 16:13:53 +08:00
Trinkle23897	04208e6cce	update some tutorial	2020-03-30 22:52:25 +08:00
Trinkle23897	2169dd2201	update high-res logo	2020-03-29 15:52:47 +08:00
Trinkle23897	4e7df7616a	update dqn tutorial	2020-03-29 15:18:33 +08:00
Trinkle23897	d9e4b9d16f	upd doc	2020-03-29 10:22:03 +08:00
Trinkle23897	a326d30739	shorten quick start	2020-03-28 22:40:47 +08:00
Trinkle23897	57735ce1b5	add logo and sphinx setup	2020-03-28 22:01:23 +08:00
Trinkle23897	f23b0dfac9	add ListReplayBuffer	2020-03-28 15:14:41 +08:00
Minghao Zhang	eb7fb37806	fix PointMaze (#8 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest * add args "render" * change the tensorboard writter * change the tensorboard writter * change device, render, tensorboard log location * change device, render, tensorboard log location * remove some wrong local files * fix some tab mistakes and the envs name in continuous/test_xx.py * add examples and point robot maze environment * fix some bugs during testing examples * add dqn network and fix some args * change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally * add a warning to collector * rm some unrelated files * reformat * fix a bug in test_dqn due to the model wrong selection * change atari frame skip and observation to improve performance * readd some files * change import * modified readme * rm tensorboard log * update atari and mujoco which are ignored * rm the wrong lines * readd the import of PointMaze * fix a typo in test/discrete/net.py * add a class declaration to pass the flake8 * fix flake8 errors	2020-03-28 14:36:12 +08:00
Trinkle23897	f68f23292e	update readme and force flake8	2020-03-28 13:27:01 +08:00
Minghao Zhang	068c4068ec	fix atari/mujoco env (#7 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest * add args "render" * change the tensorboard writter * change the tensorboard writter * change device, render, tensorboard log location * change device, render, tensorboard log location * remove some wrong local files * fix some tab mistakes and the envs name in continuous/test_xx.py * add examples and point robot maze environment * fix some bugs during testing examples * add dqn network and fix some args * change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally * add a warning to collector * rm some unrelated files * reformat * fix a bug in test_dqn due to the model wrong selection * change atari frame skip and observation to improve performance * readd some files * change import * modified readme * rm tensorboard log * update atari and mujoco which are ignored * rm the wrong lines	2020-03-28 12:03:49 +08:00
Trinkle23897	c42990c725	add rllib result and fix pep8	2020-03-28 09:43:35 +08:00
Minghao Zhang	77068af526	add examples, fix some bugs (#5 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest * add args "render" * change the tensorboard writter * change the tensorboard writter * change device, render, tensorboard log location * change device, render, tensorboard log location * remove some wrong local files * fix some tab mistakes and the envs name in continuous/test_xx.py * add examples and point robot maze environment * fix some bugs during testing examples * add dqn network and fix some args * change back the tensorboard writter's frequency to ensure ppo and a2c can write things normally * add a warning to collector * rm some unrelated files * reformat * fix a bug in test_dqn due to the model wrong selection	2020-03-28 07:27:18 +08:00
sproblvem	acb93502cf	Update README.md change "Framework" to "Task"	2020-03-27 16:52:07 +08:00
Trinkle23897	044aae4355	add baseline and rlpyt result	2020-03-27 16:24:07 +08:00
Trinkle23897	44f911bc31	add pytorch drl result	2020-03-27 09:04:29 +08:00
Trinkle23897	519f9f20d0	update readme	2020-03-26 17:32:51 +08:00
Trinkle23897	c505cd8205	update readme	2020-03-26 11:42:34 +08:00
Minghao Zhang	3c0a09fefd	minor reformat (#2 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest	2020-03-26 09:01:20 +08:00
Trinkle23897	fdc969b830	fix collector	2020-03-25 14:08:28 +08:00
Trinkle23897	e95218e295	sac	2020-03-23 17:17:41 +08:00
Trinkle23897	30a0fc079c	td3	2020-03-23 11:34:52 +08:00
Trinkle23897	a87563b8e6	add demo of ppo continuous action task	2020-03-21 17:04:42 +08:00
Trinkle23897	c173f7bfbc	fix ddpg	2020-03-21 15:31:31 +08:00
Trinkle23897	8bd8246b16	refract test code	2020-03-21 10:58:01 +08:00
Trinkle23897	d64d78d769	seed???	2020-03-20 21:51:09 +08:00

1 2

72 Commits