Alex Nikulkov
|
92456cdb68
|
Add learning rate scheduler to BasePolicy (#598)
|
2022-04-17 23:52:30 +08:00 |
|
ChenDRAG
|
c25926dd8f
|
Formalize variable names (#509)
Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>
|
2022-01-30 00:53:56 +08:00 |
|
n+e
|
fc251ab0b8
|
bump to v0.4.3 (#432)
* add makefile
* bump version
* add isort and yapf
* update contributing.md
* update PR template
* spelling check
|
2021-09-03 05:05:04 +08:00 |
|
Yi Su
|
b5c3ddabfa
|
Add discrete Conservative Q-Learning for offline RL (#359)
Co-authored-by: Yi Su <yi.su@antgroup.com>
Co-authored-by: Yi Su <yi.su@antfin.com>
|
2021-05-12 09:24:48 +08:00 |
|