Anas BELFADIL
|
53e6b0408d
|
Add BranchingDQN for large discrete action spaces (#618)
|
2022-05-15 21:40:32 +08:00 |
|
Alex Nikulkov
|
92456cdb68
|
Add learning rate scheduler to BasePolicy (#598)
|
2022-04-17 23:52:30 +08:00 |
|
ChenDRAG
|
c25926dd8f
|
Formalize variable names (#509)
Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>
|
2022-01-30 00:53:56 +08:00 |
|
Yi Su
|
a59d96d041
|
Add Intrinsic Curiosity Module (#503)
|
2022-01-15 02:43:48 +08:00 |
|
n+e
|
fc251ab0b8
|
bump to v0.4.3 (#432)
* add makefile
* bump version
* add isort and yapf
* update contributing.md
* update PR template
* spelling check
|
2021-09-03 05:05:04 +08:00 |
|
ChenDRAG
|
dd4a01132c
|
Fix SAC loss explode (#333)
* change SAC action_bound_method to "clip" (tanh is hardcoded in forward)
* docstring update
* modelbase -> modelbased
|
2021-04-04 17:33:35 +08:00 |
|