Tianshou

hongshaorou/Tianshou

Fork 0

Commit Graph

Author	SHA1	Message	Date
youkaichao	e767de044b	Remove dummy net code (#123 ) * remove dummy net; delete two files * split code to have backbone and head * rename class * change torch.float to torch.float32 * use flatten(1) instead of view(batch, -1) * remove dummy net in docs * bugfix for rnn * fix cuda error * minor fix of docs * do not change the example code in dqn tutorial, since it is for demonstration Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-07-09 22:57:01 +08:00
danagi	60cfc373f8	fix #98 , support #99 (#102 ) * Add auto alpha tuning and exploration noise for sac. Add class BaseNoise and GaussianNoise for the concept of exploration noise. Add new test for sac tested in MountainCarContinuous-v0, which should benefits from the two above new feature. * add exploration noise to collector, fix example to adapt modification * fix #98 * enable off-policy to update multiple times in one step. (#99)	2020-06-27 21:40:09 +08:00
Oblivion	9380368ca3	add an example of bullet env (experiment from jiqizhixin) (#15 ) * add_pybullet_ens_test test on pybullet envs modify some log config * delete DS_Store file * add pybullet_envs test add HalfCheetahBulletEnv-v0 test modify log config * fix pep 8 errors * add pybullet to dev * delete a line * by pass F401 * add log_interval to onpolicy_trainer * add comments * Update halfcheetahBullet_v0_sac.py	2020-04-04 11:46:18 +08:00

Author

SHA1

Message

Date

youkaichao

e767de044b

Remove dummy net code (#123 )

* remove dummy net; delete two files

* split code to have backbone and head

* rename class

* change torch.float to torch.float32

* use flatten(1) instead of view(batch, -1)

* remove dummy net in docs

* bugfix for rnn

* fix cuda error

* minor fix of docs

* do not change the example code in dqn tutorial, since it is for demonstration

Co-authored-by: Trinkle23897 <463003665@qq.com>

2020-07-09 22:57:01 +08:00

danagi

60cfc373f8

fix #98 , support #99 (#102 )

* Add auto alpha tuning and exploration noise for sac.
Add class BaseNoise and GaussianNoise for the concept of exploration noise.
Add new test for sac tested in MountainCarContinuous-v0,
which should benefits from the two above new feature.

* add exploration noise to collector, fix example to adapt modification

* fix #98

* enable off-policy to update multiple times in one step. (#99)

2020-06-27 21:40:09 +08:00

Oblivion

9380368ca3

add an example of bullet env (experiment from jiqizhixin) (#15 )

* add_pybullet_ens_test

test on pybullet envs
modify some log config

* delete DS_Store file

* add pybullet_envs test

add HalfCheetahBulletEnv-v0 test
modify log config

* fix pep 8 errors

* add pybullet to dev

* delete a line

* by pass F401

* add log_interval to onpolicy_trainer

* add comments

* Update halfcheetahBullet_v0_sac.py

2020-04-04 11:46:18 +08:00

3 Commits