This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
tianshou
/
policy
History
ChenDRAG
6426a39796
ppo benchmark (
#330
)
2021-03-30 11:50:35 +08:00
..
imitation
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
modelbase
Remove reward_normaliztion option in offpolicy algorithm (
#298
)
2021-02-27 11:20:43 +08:00
modelfree
ppo benchmark (
#330
)
2021-03-30 11:50:35 +08:00
multiagent
Step collector implementation (
#280
)
2021-02-19 10:33:49 +08:00
__init__.py
Step collector implementation (
#280
)
2021-02-19 10:33:49 +08:00
base.py
Refactor PG algorithm and change behavior of
compute_episodic_return
(
#319
)
2021-03-23 22:05:48 +08:00
random.py
Trainer refactor : some definition change (
#293
)
2021-02-21 13:06:02 +08:00