Tianshou/examples at 983cd36074fa73b2b92a0ef2dfc0d1facdab6cd5 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

haoshengzou 983cd36074 finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.

2018-01-15 00:03:06 +08:00

..

.gitignore

fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research

2018-01-03 20:32:05 +08:00

dqn_example.py

finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.

2018-01-15 00:03:06 +08:00

ppo_cartpole_alternative.py

finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.

2018-01-15 00:03:06 +08:00

ppo_cartpole_gym.py

finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.

2018-01-15 00:03:06 +08:00

ppo_cartpole.py

finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.

2018-01-15 00:03:06 +08:00