Tianshou/modelfree at f3169b4c1fb972d0673fd20951508d2650fe977b - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

Yi Su f3169b4c1f

Add Implicit Quantile Network (#371 )

2021-05-29 09:44:23 +08:00

..

__init__.py

seealso and change policy dir structure

2020-04-09 21:36:53 +08:00

a2c.py

Support deterministic evaluation for onpolicy algorithms (#354 )

2021-04-27 21:22:39 +08:00

c51.py

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

ddpg.py

Fix SAC loss explode (#333 )

2021-04-04 17:33:35 +08:00

discrete_sac.py

Remap action to fit gym's action space (#313 )

2021-03-21 16:45:50 +08:00

dqn.py

Allow researchers to choose whether to use Double DQN (#368 )

2021-05-21 10:53:34 +08:00

iqn.py

Add Implicit Quantile Network (#371 )

2021-05-29 09:44:23 +08:00

npg.py

Support deterministic evaluation for onpolicy algorithms (#354 )

2021-04-27 21:22:39 +08:00

pg.py

Support deterministic evaluation for onpolicy algorithms (#354 )

2021-04-27 21:22:39 +08:00

ppo.py

Support deterministic evaluation for onpolicy algorithms (#354 )

2021-04-27 21:22:39 +08:00

qrdqn.py

fix qvalue mask_action error for obs_next (#310 )

2021-03-15 08:06:24 +08:00

sac.py

Fix SAC loss explode (#333 )

2021-04-04 17:33:35 +08:00

td3.py

Fix SAC loss explode (#333 )

2021-04-04 17:33:35 +08:00

trpo.py

Add discrete Conservative Q-Learning for offline RL (#359 )

2021-05-12 09:24:48 +08:00