Yi Su
|
8f7bc65ac7
|
Add discrete Critic Regularized Regression (#367)
|
2021-05-19 13:29:56 +08:00 |
|
Yi Su
|
b5c3ddabfa
|
Add discrete Conservative Q-Learning for offline RL (#359)
Co-authored-by: Yi Su <yi.su@antgroup.com>
Co-authored-by: Yi Su <yi.su@antfin.com>
|
2021-05-12 09:24:48 +08:00 |
|
ChenDRAG
|
1dcf65fe21
|
Add NPG policy (#344)
|
2021-04-21 09:52:15 +08:00 |
|
ChenDRAG
|
5057b5c89e
|
Add TRPO policy (#337)
|
2021-04-16 20:37:12 +08:00 |
|
n+e
|
454c86c469
|
fix venv seed, add TOC in docs, and split buffer.py into several files (#303)
Things changed in this PR:
- various docs update, add TOC
- split buffer into several files
- fix venv action_space randomness
|
2021-03-02 12:28:28 +08:00 |
|
Trinkle23897
|
0acd0d164c
|
test api doc
|
2020-04-02 09:07:04 +08:00 |
|