Logo
Explore Help
Sign In
hongshaorou/Tianshou
1
0
Fork 0
You've already forked Tianshou
Code Issues Pull Requests Packages Projects Releases Wiki Activity
170 Commits 3 Branches 28 Tags
Commit Graph

10 Commits

Author SHA1 Message Date
haoshengzou
2addef41d2 fix imports to support both python2 and python3. move contents from __init__.py to leave for work after major development. 2017-12-23 15:36:10 +08:00
Haosheng Zou
6611d948dd add value_function (critic). value_function and policy not finished yet. 2017-12-22 00:22:23 +08:00
宋世虹
d220f7f2a8 add comments and todos 2017-12-17 13:28:21 +08:00
宋世虹
3624cc9036 finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00
rtz19970824
e5bf7a9270 implement dqn loss and dpg loss, add TODO for separate actor and critic 2017-12-15 14:24:08 +08:00
Haosheng Zou
039c8140e2 add dqn.py to write 2017-12-13 22:43:45 +08:00
Haosheng Zou
7ab211b63c preliminary design of dqn_example, dqn interface. identify the assign of networks 2017-12-13 20:47:45 +08:00
rtz19970824
0c4a83f3eb vanilla policy gradient 2017-12-11 13:37:27 +08:00
haosheng
a00b930c2c fix naming and comments of coding style, delete .json 2017-12-10 17:23:13 +08:00
haosheng
ff4306ddb9 model-free rl first commit, with ppo_example.py in examples/ and task delegations in ppo_example.py and READMEs 2017-12-08 21:09:23 +08:00
Powered by Gitea Version: 23.8.0 Page: 96ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API