Yi Su df35718992
Implement TD3+BC for offline RL (#660)
- implement TD3+BC for offline RL;
- fix a bug in trainer about test reward not logged because self.env_step is not set for offline setting;
2022-06-07 00:39:37 +08:00
..
2022-06-04 13:26:08 +08:00
2021-04-04 17:33:35 +08:00
2022-05-30 12:38:47 +08:00