Tianshou

History

Implement BCQPolicy and offline_bcq example (#480 )

This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning).
Example usage is in the examples/offline/offline_bcq.py.

2021-11-22 22:21:02 +08:00

tianshou.data.rst

fix venv seed, add TOC in docs, and split buffer.py into several files (#303 )

2021-03-02 12:28:28 +08:00

tianshou.env.rst

fix venv seed, add TOC in docs, and split buffer.py into several files (#303 )

2021-03-02 12:28:28 +08:00

tianshou.exploration.rst

test api doc

2020-04-02 09:07:04 +08:00

tianshou.policy.rst

Implement BCQPolicy and offline_bcq example (#480 )