This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
tianshou
/
policy
/
modelfree
History
ChenDRAG
e605bdea94
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
...
Releasing Tianshou's SOTA benchmark of 9 out of 13 environments from the MuJoCo Gym task suite.
2021-03-07 19:21:02 +08:00
..
__init__.py
seealso and change policy dir structure
2020-04-09 21:36:53 +08:00
a2c.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
c51.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
ddpg.py
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
2021-03-07 19:21:02 +08:00
discrete_sac.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
dqn.py
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
2021-03-07 19:21:02 +08:00
pg.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
ppo.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
qrdqn.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
sac.py
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
2021-03-07 19:21:02 +08:00
td3.py
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
2021-03-07 19:21:02 +08:00