Tianshou

hongshaorou/Tianshou

Fork 0

Commit Graph

Author	SHA1	Message	Date
Bernard Tan	5c5a3db94e	Implement BCQPolicy and offline_bcq example (#480 ) This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning). Example usage is in the examples/offline/offline_bcq.py.	2021-11-22 22:21:02 +08:00

Author

SHA1

Message

Date

Bernard Tan

5c5a3db94e

Implement BCQPolicy and offline_bcq example (#480 )

This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning).
Example usage is in the examples/offline/offline_bcq.py.

2021-11-22 22:21:02 +08:00

1 Commits