Tianshou/policy at 0c7117dd557e6e4b0ad3f61bb42accaaa08b1660 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

n+e ec23c7efe9

fix qvalue mask_action error for obs_next (#310 )

* fix #309
* remove for-loop in dqn expl_noise

2021-03-15 08:06:24 +08:00

..

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

Step collector implementation (#280 )

2021-02-19 10:33:49 +08:00

__init__.py

Step collector implementation (#280 )

2021-02-19 10:33:49 +08:00

base.py

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

random.py

Trainer refactor : some definition change (#293 )

2021-02-21 13:06:02 +08:00