Jialu Zhu a511cb4779
Add offline trainer and discrete BCQ algorithm (#263)
The result needs to be tuned after `done` issue fixed.

Co-authored-by: n+e <trinkle23897@gmail.com>
2021-01-20 18:13:04 +08:00
..
2020-11-09 16:43:55 +08:00
2020-08-19 15:00:24 +08:00
2020-07-22 14:42:08 +08:00
2020-03-28 22:01:23 +08:00
2020-03-29 15:18:33 +08:00
2020-09-02 13:03:32 +08:00