This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
tianshou
/
policy
History
ChenDRAG
e605bdea94
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
...
Releasing Tianshou's SOTA benchmark of 9 out of 13 environments from the MuJoCo Gym task suite.
2021-03-07 19:21:02 +08:00
..
imitation
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
modelbase
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
modelfree
MuJoCo Benchmark - DDPG, TD3, SAC (
#305
)
2021-03-07 19:21:02 +08:00
multiagent
Step collector implementation (
#280
)
2021-02-19 10:33:49 +08:00
__init__.py
Step collector implementation (
#280
)
2021-02-19 10:33:49 +08:00
base.py
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
random.py
Trainer refactor : some definition change (
#293
)
2021-02-21 13:06:02 +08:00