danagi 60cfc373f8
fix #98, support #99 (#102)
* Add auto alpha tuning and exploration noise for sac.
Add class BaseNoise and GaussianNoise for the concept of exploration noise.
Add new test for sac tested in MountainCarContinuous-v0,
which should benefits from the two above new feature.

* add exploration noise to collector, fix example to adapt modification

* fix #98

* enable off-policy to update multiple times in one step. (#99)
2020-06-27 21:40:09 +08:00
..
2020-03-20 19:52:29 +08:00
2020-06-27 21:40:09 +08:00
2020-05-16 20:08:32 +08:00
2020-05-12 11:31:47 +08:00