Tianshou/examples at 764f7fb5f116c0915767214c06fd2f4887b1ce77 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

Dong Yan 2163d18728 fix the env -> self._env bug

2018-02-10 03:42:00 +08:00

..

.gitignore

fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research

2018-01-03 20:32:05 +08:00

actor_critic_cartpole.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

actor_critic_fail_cartpole.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

actor_critic_separate_cartpole.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

contrib_dqn_replay.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

ddpg_example.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

dqn_example.py

fix the env -> self._env bug

2018-02-10 03:42:00 +08:00

ppo_cartpole_alternative.py

finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.

2018-01-17 14:21:50 +08:00

ppo_cartpole_rllab.py

finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.

2018-01-17 14:21:50 +08:00

ppo_cartpole.py

fix the env -> self._env bug

2018-02-10 03:42:00 +08:00