Anas BELFADIL
|
d976a5aa91
|
Fixed hardcoded reward_treshold (#548)
|
2022-03-04 10:35:39 +08:00 |
|
Chengqi Duan
|
23fbc3b712
|
upgrade gym version to >=0.21, fix related CI and update examples/atari (#534)
Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>
|
2022-02-25 07:40:33 +08:00 |
|
Jiayi Weng
|
3d697aa4c6
|
make unit test faster (#522)
* test cache expert data in offline training
* faster cql test
* faster tests
* use dummy
* test ray dependency
|
2022-02-09 00:24:52 +08:00 |
|
Bernard Tan
|
bc53ead273
|
Implement CQLPolicy and offline_cql example (#506)
|
2022-01-16 05:30:21 +08:00 |
|