Trinkle23897
|
134f787e24
|
reserve 'policy' keyword in replay buffer
|
2020-04-29 17:48:48 +08:00 |
|
Trinkle23897
|
6bf1ea644d
|
fix ppo
|
2020-04-19 14:30:42 +08:00 |
|
Trinkle23897
|
680fc0ffbe
|
gae
|
2020-04-14 21:11:06 +08:00 |
|
Trinkle23897
|
ecfcb9f295
|
fix docs
|
2020-04-10 11:16:33 +08:00 |
|
Trinkle23897
|
3cc22b7c0c
|
__call__ -> forward
|
2020-04-10 10:47:16 +08:00 |
|
Trinkle23897
|
e0809ff135
|
add policy docs (#21)
|
2020-04-06 19:36:59 +08:00 |
|
Trinkle23897
|
c87fe3c18c
|
add trainer
|
2020-03-19 17:23:46 +08:00 |
|
Trinkle23897
|
64bab0b6a0
|
ddpg
|
2020-03-18 21:45:41 +08:00 |
|
Trinkle23897
|
39de63592f
|
finish pg
|
2020-03-17 11:37:31 +08:00 |
|
Trinkle23897
|
8b0b970c9b
|
add speed stat
|
2020-03-16 15:04:58 +08:00 |
|
Trinkle23897
|
5983c6b33d
|
finish dqn
|
2020-03-15 17:41:00 +08:00 |
|
Trinkle23897
|
c804662457
|
add cache buf in collector
|
2020-03-14 21:48:31 +08:00 |
|
Trinkle23897
|
543e57cdbd
|
clear
|
2020-03-13 21:47:17 +08:00 |
|
Trinkle23897
|
f16e05c0e7
|
maybe finished collector?
|
2020-03-13 17:49:22 +08:00 |
|
Trinkle23897
|
f58c1397c6
|
half of collector
|
2020-03-12 22:20:33 +08:00 |
|