Jialu Zhu a511cb4779
Add offline trainer and discrete BCQ algorithm (#263)
The result needs to be tuned after `done` issue fixed.

Co-authored-by: n+e <trinkle23897@gmail.com>
2021-01-20 18:13:04 +08:00
..
2021-01-20 16:54:13 +08:00
2021-01-20 16:54:13 +08:00
2021-01-20 16:54:13 +08:00
2020-09-13 19:31:50 +08:00
2020-03-11 17:28:51 +08:00