Trinkle23897
|
de556fd22d
|
item3 of #51
|
2020-05-27 11:02:23 +08:00 |
|
Trinkle23897
|
0eef0ca198
|
fix optional type syntax
|
2020-05-16 20:08:32 +08:00 |
|
Trinkle23897
|
9b26137cd2
|
add type annotation
|
2020-05-12 11:31:47 +08:00 |
|
Trinkle23897
|
04b091d975
|
fix max-grad-norm err in a2c (#46)
|
2020-05-04 12:33:04 +08:00 |
|
Trinkle23897
|
134f787e24
|
reserve 'policy' keyword in replay buffer
|
2020-04-29 17:48:48 +08:00 |
|
Trinkle23897
|
80d661907e
|
Multimodal obs (#38, #27, #25)
|
2020-04-28 20:56:02 +08:00 |
|
Trinkle23897
|
959955fa2a
|
fix historical issues
|
2020-04-26 16:13:51 +08:00 |
|
Trinkle23897
|
6bf1ea644d
|
fix ppo
|
2020-04-19 14:30:42 +08:00 |
|
Trinkle23897
|
680fc0ffbe
|
gae
|
2020-04-14 21:11:06 +08:00 |
|
Trinkle23897
|
3cc22b7c0c
|
__call__ -> forward
|
2020-04-10 10:47:16 +08:00 |
|
Trinkle23897
|
19f2cce294
|
seealso and change policy dir structure
|
2020-04-09 21:36:53 +08:00 |
|