Tianshou/docs/api/tianshou.policy.rst
Bernard Tan 5c5a3db94e
Implement BCQPolicy and offline_bcq example (#480)
This PR implements BCQPolicy, which could be used to train an offline agent in the environment of continuous action space. An experimental result 'halfcheetah-expert-v1' is provided, which is a d4rl environment (for Offline Reinforcement Learning).
Example usage is in the examples/offline/offline_bcq.py.
2021-11-22 22:21:02 +08:00

147 lines
2.5 KiB
ReStructuredText

tianshou.policy
===============
Base
----
.. autoclass:: tianshou.policy.BasePolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.RandomPolicy
:members:
:undoc-members:
:show-inheritance:
Model-free
----------
DQN Family
~~~~~~~~~~
.. autoclass:: tianshou.policy.DQNPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.C51Policy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.RainbowPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.QRDQNPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.IQNPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.FQFPolicy
:members:
:undoc-members:
:show-inheritance:
On-policy
~~~~~~~~~
.. autoclass:: tianshou.policy.PGPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.NPGPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.A2CPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.TRPOPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.PPOPolicy
:members:
:undoc-members:
:show-inheritance:
Off-policy
~~~~~~~~~~
.. autoclass:: tianshou.policy.DDPGPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.TD3Policy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.SACPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.DiscreteSACPolicy
:members:
:undoc-members:
:show-inheritance:
Imitation
---------
.. autoclass:: tianshou.policy.ImitationPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.BCQPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.DiscreteBCQPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.DiscreteCQLPolicy
:members:
:undoc-members:
:show-inheritance:
.. autoclass:: tianshou.policy.DiscreteCRRPolicy
:members:
:undoc-members:
:show-inheritance:
Model-based
-----------
.. autoclass:: tianshou.policy.PSRLPolicy
:members:
:undoc-members:
:show-inheritance:
Multi-agent
-----------
.. autoclass:: tianshou.policy.MultiAgentPolicyManager
:members:
:undoc-members:
:show-inheritance: