Logo
Explore Help
Sign In
hongshaorou/Tianshou
1
0
Fork 0
You've already forked Tianshou
Code Issues Pull Requests Packages Projects Releases Wiki Activity
361 Commits 3 Branches 28 Tags
Commit Graph

7 Commits

Author SHA1 Message Date
haoshengzou
e68dcd3c64 working on off-policy test. other parts of dqn_replay is runnable, but performance not tested. 2018-03-08 16:51:12 +08:00
haoshengzou
f32e1d9c12 finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet. 2018-01-18 17:38:52 +08:00
haoshengzou
8fbde8283f finish dqn example. advantage estimation module is not complete yet. 2018-01-18 12:19:48 +08:00
haoshengzou
ed25bf7586 fixed the bugs on Jan 14, which gives inferior or even no improvement. mistook group_ndims. policy will soon need refactoring. 2018-01-17 11:55:51 +08:00
haoshengzou
b33a141373 towards policy/value refactor 2017-12-23 17:25:16 +08:00
haoshengzou
2addef41d2 fix imports to support both python2 and python3. move contents from __init__.py to leave for work after major development. 2017-12-23 15:36:10 +08:00
Haosheng Zou
6611d948dd add value_function (critic). value_function and policy not finished yet. 2017-12-22 00:22:23 +08:00
Powered by Gitea Version: 23.8.0 Page: 54ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API