Tianshou

hongshaorou/Tianshou

Fork 0

Commit Graph

Author	SHA1	Message	Date
Chengqi Duan	23fbc3b712	upgrade gym version to >=0.21, fix related CI and update examples/atari (#534 ) Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>	2022-02-25 07:40:33 +08:00
Jiayi Weng	3d697aa4c6	make unit test faster (#522 ) * test cache expert data in offline training * faster cql test * faster tests * use dummy * test ray dependency	2022-02-09 00:24:52 +08:00
Bernard Tan	5c5a3db94e	Implement BCQPolicy and offline_bcq example (#480 ) This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning). Example usage is in the examples/offline/offline_bcq.py.	2021-11-22 22:21:02 +08:00

Author

SHA1

Message

Date

Chengqi Duan

23fbc3b712

upgrade gym version to >=0.21, fix related CI and update examples/atari (#534 )

Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>

2022-02-25 07:40:33 +08:00

Jiayi Weng

3d697aa4c6

make unit test faster (#522 )

* test cache expert data in offline training

* faster cql test

* faster tests

* use dummy

* test ray dependency

2022-02-09 00:24:52 +08:00

Bernard Tan

5c5a3db94e

Implement BCQPolicy and offline_bcq example (#480 )

This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning).
Example usage is in the examples/offline/offline_bcq.py.

2021-11-22 22:21:02 +08:00

3 Commits