{ "cells": [ { "cell_type": "markdown", "metadata": { "id": "PNM9wqstBSY_" }, "source": [ "# Policy\n", "In reinforcement learning, the agent interacts with environments to improve itself. In this tutorial we will concentrate on the agent part. In Tianshou, both the agent and the core DRL algorithm are implemented in the Policy module. Tianshou provides more than 20 Policy modules, each representing one DRL algorithm. See supported algorithms [here](https://github.com/thu-ml/tianshou).\n", "\n", "