Trinkle23897
7bf202f195
polish docs
2020-06-03 17:04:26 +08:00
Trinkle23897
f818a2467b
zh_CN docs
2020-06-02 08:51:14 +08:00
Trinkle23897
ba1b3e54eb
fix #69
2020-06-01 08:30:09 +08:00
Trinkle23897
de556fd22d
item3 of #51
2020-05-27 11:02:23 +08:00
Trinkle23897
80d661907e
Multimodal obs ( #38 , #27 , #25 )
2020-04-28 20:56:02 +08:00
Trinkle23897
6b96f124ae
fix pdqn
2020-04-26 15:11:20 +08:00
rocknamx
b23749463e
Prioritized DQN ( #30 )
...
* add sum_tree.py
* add prioritized replay buffer
* del sum_tree.py
* fix some format issues
* fix weight_update bug
* simply replace replaybuffer in test_dqn without weight update
* weight default set to 1
* fix sampling bug when buffer is not full
* rename parameter
* fix formula error, add accuracy check
* add PrioritizedDQN test
* add test_pdqn.py
* add update_weight() doc
* add ref of prio dqn in readme.md and index.rst
* restore test_dqn.py, fix args of test_pdqn.py
2020-04-26 12:05:58 +08:00
Trinkle23897
680fc0ffbe
gae
2020-04-14 21:11:06 +08:00
Trinkle23897
7b65d43394
vanilla imitation learning
2020-04-13 19:37:27 +08:00
Trinkle23897
befdfb07e8
polish docs
2020-04-11 19:29:46 +08:00
Trinkle23897
ecfcb9f295
fix docs
2020-04-10 11:16:33 +08:00
Trinkle23897
3cc22b7c0c
__call__ -> forward
2020-04-10 10:47:16 +08:00
Trinkle23897
e0809ff135
add policy docs ( #21 )
2020-04-06 19:36:59 +08:00
Trinkle23897
610390c132
add docs of collector and trainer ( #20 )
2020-04-05 18:34:45 +08:00
Trinkle23897
b6c9db6b0b
docs for env
2020-04-04 21:02:06 +08:00
Trinkle23897
7cb5146611
add docs of trick
2020-04-02 21:57:26 +08:00
Trinkle23897
0acd0d164c
test api doc
2020-04-02 09:07:04 +08:00
Trinkle23897
04208e6cce
update some tutorial
2020-03-30 22:52:25 +08:00
Trinkle23897
4e7df7616a
update dqn tutorial
2020-03-29 15:18:33 +08:00
Trinkle23897
d9e4b9d16f
upd doc
2020-03-29 10:22:03 +08:00
Trinkle23897
57735ce1b5
add logo and sphinx setup
2020-03-28 22:01:23 +08:00