Logo
Explore Help
Sign In
hongshaorou/Tianshou
1
0
Fork 0
You've already forked Tianshou
Code Issues Pull Requests Packages Projects Releases Wiki Activity
Tianshou/tianshou/policy/modelfree
History
Yi Su f3169b4c1f
Add Implicit Quantile Network (#371)
2021-05-29 09:44:23 +08:00
..
__init__.py
seealso and change policy dir structure
2020-04-09 21:36:53 +08:00
a2c.py
Support deterministic evaluation for onpolicy algorithms (#354)
2021-04-27 21:22:39 +08:00
c51.py
fix qvalue mask_action error for obs_next (#310)
2021-03-15 08:06:24 +08:00
ddpg.py
Fix SAC loss explode (#333)
2021-04-04 17:33:35 +08:00
discrete_sac.py
Remap action to fit gym's action space (#313)
2021-03-21 16:45:50 +08:00
dqn.py
Allow researchers to choose whether to use Double DQN (#368)
2021-05-21 10:53:34 +08:00
iqn.py
Add Implicit Quantile Network (#371)
2021-05-29 09:44:23 +08:00
npg.py
Support deterministic evaluation for onpolicy algorithms (#354)
2021-04-27 21:22:39 +08:00
pg.py
Support deterministic evaluation for onpolicy algorithms (#354)
2021-04-27 21:22:39 +08:00
ppo.py
Support deterministic evaluation for onpolicy algorithms (#354)
2021-04-27 21:22:39 +08:00
qrdqn.py
fix qvalue mask_action error for obs_next (#310)
2021-03-15 08:06:24 +08:00
sac.py
Fix SAC loss explode (#333)
2021-04-04 17:33:35 +08:00
td3.py
Fix SAC loss explode (#333)
2021-04-04 17:33:35 +08:00
trpo.py
Add discrete Conservative Q-Learning for offline RL (#359)
2021-05-12 09:24:48 +08:00
Powered by Gitea Version: 23.8.0 Page: 370ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API