This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
examples
History
haoshengzou
75e7f14051
towards ddpg
2018-03-28 18:47:41 +08:00
..
.gitignore
fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research
2018-01-03 20:32:05 +08:00
actor_critic_fail_cartpole.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
actor_critic_separate_cartpole.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
actor_critic.py
actor critic also works. fix some bugs in nstep_q_return. dqn still trains slow.
2018-03-11 15:07:41 +08:00
ddpg_example.py
towards ddpg
2018-03-28 18:47:41 +08:00
dqn.py
towards ddpg
2018-03-28 18:47:41 +08:00
ppo_cartpole_alternative.py
finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.
2018-01-17 14:21:50 +08:00
ppo_cartpole_rllab.py
finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.
2018-01-17 14:21:50 +08:00
ppo.py
actor critic also works. fix some bugs in nstep_q_return. dqn still trains slow.
2018-03-11 15:07:41 +08:00