Tianshou/modelfree at ec23c7efe9a013dde69e7fd4a11538651574c8a7 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

n+e ec23c7efe9

fix qvalue mask_action error for obs_next (#310 )

* fix #309
* remove for-loop in dqn expl_noise

2021-03-15 08:06:24 +08:00

..

__init__.py

seealso and change policy dir structure

2020-04-09 21:36:53 +08:00

a2c.py

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

c51.py

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

ddpg.py

MuJoCo Benchmark - DDPG, TD3, SAC (#305 )

2021-03-07 19:21:02 +08:00

discrete_sac.py

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

dqn.py

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

pg.py

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

ppo.py

Remove reward_normaliztion option in offpolicy algorithm (#298 )

2021-02-27 11:20:43 +08:00

qrdqn.py

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

sac.py

MuJoCo Benchmark - DDPG, TD3, SAC (#305 )

2021-03-07 19:21:02 +08:00

td3.py

MuJoCo Benchmark - DDPG, TD3, SAC (#305 )

2021-03-07 19:21:02 +08:00