Yi Su df35718992
Implement TD3+BC for offline RL (#660)
- implement TD3+BC for offline RL;
- fix a bug in trainer about test reward not logged because self.env_step is not set for offline setting;
2022-06-07 00:39:37 +08:00
..
2022-04-24 16:49:40 -04:00
2021-09-03 05:05:04 +08:00
2021-09-03 05:05:04 +08:00
2021-09-03 05:05:04 +08:00
2020-03-28 22:01:23 +08:00
2020-03-29 15:18:33 +08:00
2022-06-01 13:56:04 +08:00