Logo
Explore Help
Sign In
hongshaorou/Tianshou
1
0
Fork 0
You've already forked Tianshou
Code Issues Pull Requests Packages Projects Releases Wiki Activity
Tianshou/tianshou/core/policy
History
haoshengzou 983cd36074 finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
..
__init__.py
fix imports to support both python2 and python3. move contents from __init__.py to leave for work after major development.
2017-12-23 15:36:10 +08:00
base.py
auto target network. ppo_cartpole.py run ok. but results is different from previous version even with the same random seed, still needs debugging.
2018-01-14 20:58:28 +08:00
dqn.py
fix memory growth and slowness caused by sess.run(tf.multinomial()), now ppo examples are working OK with slight memory growth (1M/min), which still needs research
2018-01-03 20:32:05 +08:00
stochastic.py
finished all ppo examples. Training is remarkably slower than the version before Jan 13. More strangely, in the gym example there's almost no improvement... but this problem comes behind design. I'll first write actor-critic.
2018-01-15 00:03:06 +08:00
Powered by Gitea Version: 23.8.0 Page: 47ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API