This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
tianshou
History
haoshengzou
d599506dc9
fixed the bugs on Jan 14, which gives inferior or even no improvement. mistook group_ndims. policy will soon need refactoring.
2018-01-15 16:32:30 +08:00
..
agent
architecture design patch two
2017-11-06 15:24:34 +08:00
core
fixed the bugs on Jan 14, which gives inferior or even no improvement. mistook group_ndims. policy will soon need refactoring.
2018-01-15 16:32:30 +08:00
data
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
environment
architecture design patch two
2017-11-06 15:24:34 +08:00
simulator
architecture design patch two
2017-11-06 15:24:34 +08:00
__init__.py
mcts update
2017-11-17 15:09:07 +08:00