This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
Tianshou
Watch
1
Star
0
Fork
0
You've already forked Tianshou
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Tianshou
/
examples
History
Dong Yan
2163d18728
fix the env -> self._env bug
2018-02-10 03:42:00 +08:00
..
.gitignore
fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research
2018-01-03 20:32:05 +08:00
actor_critic_cartpole.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
actor_critic_fail_cartpole.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
actor_critic_separate_cartpole.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
contrib_dqn_replay.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
ddpg_example.py
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
2018-01-18 17:38:52 +08:00
dqn_example.py
fix the env -> self._env bug
2018-02-10 03:42:00 +08:00
ppo_cartpole_alternative.py
finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.
2018-01-17 14:21:50 +08:00
ppo_cartpole_rllab.py
finish design and running of ppo and actor-critic. advantage estimation module is not complete yet.
2018-01-17 14:21:50 +08:00
ppo_cartpole.py
fix the env -> self._env bug
2018-02-10 03:42:00 +08:00