Tianshou/examples at ace59787eddc73f7e2643d17c087ca3c51bee8d6 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

History

haoshengzou 75e7f14051 towards ddpg

2018-03-28 18:47:41 +08:00

..

.gitignore

fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research

2018-01-03 20:32:05 +08:00

actor_critic_fail_cartpole.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

actor_critic_separate_cartpole.py

finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.

2018-01-18 17:38:52 +08:00

actor_critic.py

actor critic also works. fix some bugs in nstep_q_return. dqn still trains slow.

2018-03-11 15:07:41 +08:00

ddpg_example.py

towards ddpg

2018-03-28 18:47:41 +08:00

dqn.py

towards ddpg

2018-03-28 18:47:41 +08:00

ppo_cartpole_alternative.py

finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.

2018-01-17 14:21:50 +08:00

ppo_cartpole_rllab.py

finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.

2018-01-17 14:21:50 +08:00

ppo.py

actor critic also works. fix some bugs in nstep_q_return. dqn still trains slow.

2018-03-11 15:07:41 +08:00