This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
examples
History
haoshengzou
983cd36074
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
..
.gitignore
fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research
2018-01-03 20:32:05 +08:00
dqn_example.py
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
ppo_cartpole_alternative.py
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
ppo_cartpole_gym.py
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
ppo_cartpole.py
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00