Tianshou/notebooks/L5_Collector.ipynb

365 lines
88 KiB
Plaintext

{
"nbformat": 4,
"nbformat_minor": 0,
"metadata": {
"colab": {
"provenance": []
},
"kernelspec": {
"name": "python3",
"display_name": "Python 3"
},
"language_info": {
"name": "python"
}
},
"cells": [
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "AKoInktmr-3t"
},
"outputs": [],
"source": [
"# Remember to install tianshou first\n",
"!pip install tianshou==0.4.8\n",
"!pip install gym"
]
},
{
"cell_type": "markdown",
"source": [
"# Overview\n",
"From its literal meaning, we can easily know that the Collector in Tianshou is used to collect training data. More specificly, the Collector controls the interaction between Policy (agent) and the environment. It also helps save the interaction data into the ReplayBuffer and returns episode statistics.\n",
"\n",
"<center>\n",
"<img src=\"data:image/svg+xml;base64,<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN" "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
<svg xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink" version="1.1" width="611px" height="442px" viewBox="-0.5 -0.5 611 442" content="&lt;mxfile host=&quot;Electron&quot; modified=&quot;2022-04-17T13:33:09.077Z&quot; agent=&quot;5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) draw.io/14.5.1 Chrome/89.0.4389.82 Electron/12.0.1 Safari/537.36&quot; version=&quot;14.5.1&quot; etag=&quot;IopqcE6Ce96CbVsdgW-z&quot; type=&quot;device&quot;&gt;&lt;diagram id=&quot;ES4xs9HjQeWZJXcnOEPi&quot;&gt;7V1bs5s4Ev41rtp9iIv75THnJJl9SLZObaZ2dp6mOBjbzGDkBXwu++tXMpKNQNyMEHCsVCUxQmDT/XWr1Wp9rPTHw9sviXfc/wCbIFppyuZtpX9ZafCPocP/UMt73uJodt6wS8JN3qReG36G/wtwo4JbT+EmSPM23JQBEGXhMaWu9kEcB35GtXlJAl7pa7cgor/16O0Cqgdq+Ol7UVDp9lu4yfalp0Dt/wjC3Z58s2q5+ZmDRzrjG6d7bwNeC03615X+mACQ5Z8Ob49BhIRH5JJf963m7OWHJUGcdblAyy948aITfjb8u7J38rC7BJyOK/0B9wySLHhjSdl7jspCu/4K9fJsEBQBOARZ8g674Bt9chR8DQaEZuLj16t4LaL9fUG0BrnQwyrdXW5+fWr4AT84Wwh6uxCgDOJNgPqrUBSv+zALfh49H519hRiHbfvsEOHTaZaAvy7IQC1bEGffvEMYoed7BKckDBL4Bf8MXvFJjHH4iPqDF4W7GB74UHqwG2xIfHzebNJDUd4aW95CxGlMK86K+PwIxAFGp8JVnPgCvQRepQpew2FIW3U4SNu8c2kbXYWtcRC2VRH25x362bOSOAcZm7SMVb0q5Ivgi0LmIWO7HdDo2UI4HH/3noPoCaRhFgIkkmeQZeAARUA6fMayykBZ5nvviG52eNuhEGW9jcCrv/eSbJ2eDocw3v2xPcV+ftue6vDSYx5zbMM3BIGHTZgE+FZfUnBCAnxIQObhpk+uMo4SNZvWoq5qFS2yBgGVgxKdis6CDQyh8CFIsj3YgdiLvl5bH65GgwSQ/hVk/h4f/Hk6HMnFMRQUbLre4ztA2j2r9c8gy96xu/FOGaCVHryF2X/QHdcmPvq9cObLG/6y88E7OYjhkxcuQoe/F89dLzsfUdc9BUkIRYdgkT9TBhH2GQWg1+c4t30LkSSvfUg4cL7RhlzhR16ahn7eiC/BXQoXwOt3QXb5auR3IJ7x3RuhzMYgUlwzAqGe4Z38gPKR+c8omHQVp0kQQSN4oe/OQt35UigF773Q4QjCOEsLd35CDVf4OyUfZpYi4ebuZJip667ZZlN/+CH/vVdruTx4JwNy270gCczLaoPe5pSkUK7/CtIcFj3G4S7BfMv4rJUkyRihiTJov6NzcDzEed0iuEFCcdlCaX9og8NDq+0PLTQkYQ55TFjCQfKLlxI/v0nA8VfiOG4eFlXlVl3wACBrSm1FGZYhSjd42FVa/z2hef4DLc5LM/y0w/+fL0+PXkzplHRE9/2Uy/Qz7KDax7f6uzyThn8H/sNpu0Xfm5+BT/Zc7g3b8q+tNOcPQ5pLYGuItIJk7cX+HiQ1KCRYioJtxhsAbyvKR7zThwV4OMbadSzHVXTXsEzNcKtggT0sB452DjyvGKrdMHx1hk6HRASfbEwv+TCC/4ul0HEjD1fWIXsgw/8iXurVWBqIDXHhv9ohKzHKKNwiEt2piMCyx0JyNVlAe3LapeLBod27qyzvDj05Olfrn2u8eW2MoLTHCBXkl63uEG4254ndDZOONuSbrSO8aTL0ykOtrPxEzwG6QYWaGF2JU8klKVqxvNE05NyooS0WCNIRPOUd8oFWhyE6FMpFSPE5SKPPoivO/zJVjc9B8WWfsCrQSayNGiToHxQJjOngaEhgzKIftdWDTf1blidSE2s69AgiFLWSxM02jKJSU3eXyNISrcdi2PAapBkdNQwIGmq0Q/IaKM9FRQ2M6M+oqouocIi6NNbcfXmGG39Qw3XEGa7GSGhIw52v4TJyHlJd81WXXtHFnFZnVte1metKTcvqjLoqrs1cl2puXp25rrUUF2jUVeMCDX746+pM/l1kdQYdCV2dIQml4vJMPi2t4rJ23aWy9FFKDxk2jUj8hPk1V1D2Xb4pJS/s4r3aOht2yRyGrcWQihZpL/doL8409nILTk2J07vFqVaT05+5X3cmdezWrA2GKlNRpMFwNhhtkQZjOVMaTHUWNyeDkSPMqJEQVn6xskuvWW+buRENmU50fAJuJjfrUkr1FpNTeJvcGKWU8zA5RjFlnhXqYXJ9bUVV+0y9y715w9+V8L9f+DNGHOHwnxb/pNBK4v8e8e8Mw3/nDZDqnEF223YNCbJe8+CJnWzLrHZsJ6vNGf/SyQpInE6M/5Y06Nj471B4PmrRNbQOyqJ47EGpFGJzrbEWX1NNNgp1qam+vb58+P4vzaALG1SrWtlAdktQe4ctDjLqUXc+pYxIZQeZYogUUX1ZenhzTbrCKmgO4pd1mgXHv/29vSw9rC2Sw1VCXUuDCv6myKuhMhzUpUio4joq0Gjn78A/WL3Vz+jN5UKfHHq46AQXLl5n1hn/XqERPBga4VhiI5wyM0wZWVyjvQ7hE3Gmheipx5pAv+3elRhIL3EEVbhq+BVS6KwdBDMcRjRLo0SiK6a4cYS1Q53HsPHsIW/RcaPpvYwONUVIhLtgosGB1KyOH0s8n3cqr73NRkYTw/EyWTRhdCAqmINjNcwSuweDj24sx0qmlNKxCjAUo5mYcDI7YSVkxnSsqXc4RoH0rcMhM51vnWeV+pU/y1Gc4jRFXSuK3jJRWfr8jNM0leN8jWSeihM2oyb3MHDCZpbKqPQyvU7+uyoTtsqNDKWUUTS7zfz6JuUNt0yY15yVL/dX3WbOMEOxmvoPTuMb1ezn03sGEjTqKz/A5gSBWXYRH4yp0q7GacZYRJXGx6seV9aaffHJ2EkbF68ttiRWtPPG3qh6J/VmZ8uo3hvsazuj80Mnbu9pnbstyrgNmyYjEBjK09kZm/UMLZynN0cQhf77+pgAP0jTP7bxoqc4YiY4zQOubtM5Z1XvlnPmMsWpzzmPg5so8BIJmeGQKcXJDMbFsSBjslLUdRnHaYlzLVpKYnlzTUkhW9SUOSWFrNmaAZ4PhezX+OXj8cfWaZ+94YvFH+uuTdU1YbRvuIquuwyOyRH4Y01R/LH95COUP9bsUCUn+WOLeKlX43T8seZU/LEtIhHJH2uK5I89u/H74Y81p+OPNcfij81VqInRlTiVkGhYHPmguXT+2BwJi+WPbUGCQP5YU/LHdtbODHgNLdZcf3mGu1j+2BbDFcgfa0n+2EUZLiPhIdU1X3VJ/tj7YIEyGdtlTckf27cux5pnZZ60FyH2shz+WEvyx94vTi3JH9vfYD5eBaA0mM4Goy3SYCblj7VmXZQoR5hxIyHGXnFb8seOzB9rSf7YBdb8cp2s0yZn3hV/rCX5Y+8Y/owRRzj8p8U/Kb2V+L9H/DP4Y/vgv+uCAClenCfIJH+sgHnwxE52Wv5YsiVmnviXTlZA4nRi/E/LH2tL/thmxNQXs2FBCaiptkdchfQSX6CPatwPvHZMage7snYt+wN6r/Ku4A7eixhlyy7gmgX+fmwgvRNsJcatEvfGcBc177XNtarpxSB1rZyrC+VAzRqoWQ7gNvgz0mN1Gelx4W+7dM5Ab2aSKbGe9OzO3bZYO1VKxsZg9VP6jKkcOLd1WsZkU54ASj+bw56PTpR+jeXCd7ET3a4ZvcheTauT0rnEW6JJLrYgefWSZROkTgsSEahgUVi0+0rxbyhwdXrMIFsaizJyx3GXZKMAdwZU8JwGyctZpqkkQqVNwm2Ew1Sklg6LlkK+rWLmaJmMA9XRRMHFB1EEPSxI1vjTonEjBjXNIw4cYyjUqLZA3PRgzphuBlOaxqkMssuxZjAOi1aDxwzG88ujsZzDOM05Y4FzGIfFwyH96cxxIgIY9eQknXHA5Jh6JDhYSHw+h5GznA0USK/n1Ge2huHg6ZzTkCDo/soSGgQdX1jCBQM93pY1ZnRUMgPGhmgWE7hu8BBBh6zOmOyJ5cXA6mJ2NxJLanSp2bgoQp4uhwwQ0618B7tdsJSxRdRsvk7TZDavll/oKo7x1x2xslBs0ULNeq2tC1uxbSS477Wci46mrlxgFC44NVkp7vzlbn2aaZh7+jXxwngx/mkOYU87F+tormlEFpRZuCbVtqRrusU1mVO6poW8Jb787iQWO9hYyUy3Pqs1LJmZInmm0JEuJZ8pKsJ0a+qhJ15ddG8rXJrcWFyBtjJW6ZK0lTpbqSlOmdpWbnsnumhLUUuv2LO7zVW5mIrot9OcjhsvW/ZLWwWZVHsqayyzUZUeL6AZZBgtr6W1bDqLY1nVQUS3WAk9LlLo8eLvYVJQmt9NwZohMt/kwuexq4mC9BiFyPSh6Xp11sig7cWvM6k1sgS9EadgF7jgHN7VfFiZX9Cd4GQxvU6SOhkO1lvnRYeKaFmA4vHuTVWRW856K61o6aMhvsf8cxxDF2PYHV5QIl8zQ2FiIjxWp3gQXT6Mm2L4dzl+uHkC3eqHVdau0hscMTxMAIpFL+d+Qcj8ATYB6vF/&lt;/diagram&gt;&lt;/mxfile&gt;" style="background-color: rgb(255, 255, 255);"><defs/><g><rect x="0" y="0" width="610" height="400" rx="20" ry="20" fill="#ffffff" stroke="#000000" pointer-events="all"/><rect x="30" y="200" width="480" height="180" rx="9" ry="9" fill="#ffffff" stroke="#000000" pointer-events="all"/><rect x="30" y="40" width="480" height="120" rx="6" ry="6" fill="#ffffff" stroke="#000000" pointer-events="all"/><rect x="50" y="130" width="240" height="20" rx="3" ry="3" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 238px; height: 1px; padding-top: 140px; margin-left: 51px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">Agent</div></div></div></foreignObject><text x="170" y="144" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Agent</text></switch></g><ellipse cx="275" cy="317" rx="5" ry="5" fill="#ffffff" stroke="#000000" pointer-events="all"/><path d="M 270 317 L 280 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 275 312 L 275 322" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 170 150 L 80 150 L 80 230 L 275 230 L 275 309.38" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 275 310.88 L 274 308.88 L 275 309.38 L 276 308.88 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="330" y="230" width="130" height="150" rx="19.5" ry="19.5" fill="#ffffff" stroke="#000000" transform="rotate(90,395,305)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe flex-start; width: 83px; height: 1px; padding-top: 252px; margin-left: 332px;"><div style="box-sizing: border-box; font-size: 0; text-align: left; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><font face="Courier New"><span style="font-size: 17px"><b>VecBuffer</b></span></font></div></div></div></foreignObject><text x="332" y="256" fill="#000000" font-family="Helvetica" font-size="12px">VecBuffer</text></switch></g><ellipse cx="455" cy="317" rx="5" ry="5" fill="#ffffff" stroke="#000000" pointer-events="all"/><path d="M 450 317 L 460 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 455 312 L 455 322" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="368" y="270" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 277px; margin-left: 369px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span><font style="font-size: 11px">Buf 1</font></span></div></div></div></foreignObject><text x="395" y="281" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Buf 1</text></switch></g><rect x="368" y="290" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 297px; margin-left: 369px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-size: 11px">Buf 2</span></div></div></div></foreignObject><text x="395" y="301" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Buf 2</text></switch></g><rect x="368" y="310" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 317px; margin-left: 369px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-family: &quot;courier new&quot; ; font-size: 11px ; text-align: center">Buf 3</span></div></div></div></foreignObject><text x="395" y="321" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Buf 3</text></switch></g><rect x="388.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,395.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 395.5000000000002 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 390px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="396" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><rect x="368" y="350" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 357px; margin-left: 369px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-family: &quot;courier new&quot; ; font-size: 11px ; text-align: center">Buf n</span></div></div></div></foreignObject><text x="395" y="361" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Buf n</text></switch></g><rect x="388.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,395.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 395.5000000000002 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 390px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="396" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><rect x="388.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,395.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 395.5000000000002 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 390px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="396" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><path d="M 365.38 277 L 350 277 L 350 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 366.88 277 L 364.88 278 L 365.38 277 L 364.88 276 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 365.38 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 366.88 317 L 364.88 318 L 365.38 317 L 364.88 316 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 365.38 357 L 350 357 L 350 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 366.88 357 L 364.88 358 L 365.38 357 L 364.88 356 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 365.38 337.51 L 350 337.57 L 350 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 366.88 337.5 L 364.89 338.51 L 365.38 337.51 L 364.88 336.51 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 365.38 297 L 350 297 L 350 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 366.88 297 L 364.88 298 L 365.38 297 L 364.88 296 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 423 277 L 440 277 L 440 317 L 448.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 448.88 317 L 447.88 317.5 L 448.13 317 L 447.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 423 297 L 440 297 L 440 317 L 448.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 448.88 317 L 447.88 317.5 L 448.13 317 L 447.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 423 317 L 448.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 448.88 317 L 447.88 317.5 L 448.13 317 L 447.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 423 337.5 L 440 337.57 L 440 317 L 448.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 448.88 317 L 447.88 317.5 L 448.13 317 L 447.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 423 357 L 440 357 L 440 317 L 448.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 448.88 317 L 447.88 317.5 L 448.13 317 L 447.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 335.2 312.1 L 339.9 316.8 C 339.96 316.84 340 316.92 340 317 C 340 317.08 339.96 317.16 339.9 317.2 L 335.2 321.9 C 335.16 321.96 335.08 322 335 322 C 334.92 322 334.84 321.96 334.8 321.9 L 330.1 317.2 C 330.04 317.16 330 317.08 330 317 C 330 316.92 330.04 316.84 330.1 316.8 L 334.8 312.1 C 334.84 312.04 334.92 312 335 312 C 335.08 312 335.16 312.04 335.2 312.1 Z" fill="#ffffff" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 330 317 L 340 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="6" y="268" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,41.5,273)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 41.5 273)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 273px; margin-left: 42px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">env.step()</font></i></div></div></div></foreignObject><text x="42" y="276" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">env.step()</text></switch></g><path d="M 280 317 L 328.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 328.88 317 L 327.88 317.5 L 328.13 317 L 327.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="270" y="313" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,305.5,318)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 305.4999999999998 318)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 318px; margin-left: 305px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 10px">batch</font></div></div></div></foreignObject><text x="305" y="321" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">batch</text></switch></g><rect x="254" y="313" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,289.5,318)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 289.4999999999998 318)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 318px; margin-left: 289px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">buffer.add()</font></i></div></div></div></foreignObject><text x="289" y="321" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">buffer.add()</text></switch></g><rect x="463" y="258" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,498.5,263)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 498.4999999999998 263)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 263px; margin-left: 498px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 10px">batch</font></div></div></div></foreignObject><text x="498" y="266" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">batch</text></switch></g><rect x="447" y="258" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,482.5,263)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 482.5 263)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 263px; margin-left: 483px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">buffer.sample()</font></i></div></div></div></foreignObject><text x="483" y="266" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">buffer.sample...</text></switch></g><path d="M 460 317 L 490 317 L 490 190 L 405.43 190 L 405.52 91.93" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 405.52 91.18 L 406.02 92.18 L 405.52 91.93 L 405.02 92.18 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="50" y="70" width="440" height="20" rx="3" ry="3" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 438px; height: 1px; padding-top: 80px; margin-left: 51px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">Pytorch Module</div></div></div></foreignObject><text x="270" y="84" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Pytorch Module</text></switch></g><path d="M 170.01 127.38 L 170.11 92.83" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 170 128.88 L 169.01 126.88 L 170.01 127.38 L 171.01 126.89 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 170.12 92.08 L 170.61 93.08 L 170.11 92.83 L 169.61 93.08 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 200 317 L 268.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 268.88 317 L 267.88 317.5 L 268.13 317 L 267.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="372" y="135" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 140px; margin-left: 408px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">policy.process_fn()</font></i></div></div></div></foreignObject><text x="408" y="143" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">policy.proces...</text></switch></g><rect x="370" y="110" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 115px; margin-left: 405px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">policy.learn()</font></i></div></div></div></foreignObject><text x="405" y="118" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">policy.learn()</text></switch></g><rect x="70" y="230" width="130" height="150" rx="19.5" ry="19.5" fill="#ffffff" stroke="#000000" transform="rotate(90,135,305)" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe flex-start; width: 88px; height: 1px; padding-top: 252px; margin-left: 72px;"><div style="box-sizing: border-box; font-size: 0; text-align: left; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><font face="Courier New"><span style="font-size: 17px"><b>VecEnv</b></span></font></div></div></div></foreignObject><text x="72" y="256" fill="#000000" font-family="Helvetica" font-size="12px">VecEnv</text></switch></g><ellipse cx="195" cy="317" rx="5" ry="5" fill="#ffffff" stroke="#000000" pointer-events="all"/><path d="M 190 317 L 200 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 195 312 L 195 322" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="108" y="270" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 277px; margin-left: 109px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span><font style="font-size: 11px">Env 1</font></span></div></div></div></foreignObject><text x="135" y="281" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Env 1</text></switch></g><rect x="108" y="290" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 297px; margin-left: 109px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-size: 11px">Env 2</span></div></div></div></foreignObject><text x="135" y="301" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Env 2</text></switch></g><rect x="108" y="310" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 317px; margin-left: 109px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-family: &quot;courier new&quot; ; font-size: 11px ; text-align: center">Env 3</span></div></div></div></foreignObject><text x="135" y="321" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Env 3</text></switch></g><rect x="128.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,135.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 135.50000000000023 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 130px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="136" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><rect x="108" y="350" width="55" height="14" fill="#ffffff" stroke="#000000" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 53px; height: 1px; padding-top: 357px; margin-left: 109px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; "><span style="font-family: &quot;courier new&quot; ; font-size: 11px ; text-align: center">Env n</span></div></div></div></foreignObject><text x="135" y="361" fill="#000000" font-family="Courier New" font-size="12px" text-anchor="middle">Env n</text></switch></g><rect x="128.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,135.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 135.50000000000023 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 130px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="136" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><rect x="128.5" y="310" width="14" height="55" fill="none" stroke="none" transform="rotate(270,135.5,337.5)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(90 135.50000000000023 337.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 12px; height: 1px; padding-top: 338px; margin-left: 130px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: normal; word-wrap: normal; ">···</div></div></div></foreignObject><text x="136" y="341" fill="#000000" font-family="Helvetica" font-size="12px" text-anchor="middle">···</text></switch></g><path d="M 105.38 277 L 90 277 L 90 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 106.88 277 L 104.88 278 L 105.38 277 L 104.88 276 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 105.38 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 106.88 317 L 104.88 318 L 105.38 317 L 104.88 316 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 105.38 357 L 90 357 L 90 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 106.88 357 L 104.88 358 L 105.38 357 L 104.88 356 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 105.38 337.51 L 90 337.57 L 90 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 106.88 337.5 L 104.89 338.51 L 105.38 337.51 L 104.88 336.51 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 105.38 297 L 90 297 L 90 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 106.88 297 L 104.88 298 L 105.38 297 L 104.88 296 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 163 277 L 180 277 L 180 317 L 188.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 188.88 317 L 187.88 317.5 L 188.13 317 L 187.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 163 297 L 180 297 L 180 317 L 188.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 188.88 317 L 187.88 317.5 L 188.13 317 L 187.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 163 317 L 188.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 188.88 317 L 187.88 317.5 L 188.13 317 L 187.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 163 337.5 L 180 337.57 L 180 317 L 188.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 188.88 317 L 187.88 317.5 L 188.13 317 L 187.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 163 357 L 180 357 L 180 317 L 188.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 188.88 317 L 187.88 317.5 L 188.13 317 L 187.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 75.2 312.1 L 79.9 316.8 C 79.96 316.84 80 316.92 80 317 C 80 317.08 79.96 317.16 79.9 317.2 L 75.2 321.9 C 75.16 321.96 75.08 322 75 322 C 74.92 322 74.84 321.96 74.8 321.9 L 70.1 317.2 C 70.04 317.16 70 317.08 70 317 C 70 316.92 70.04 316.84 70.1 316.8 L 74.8 312.1 C 74.84 312.04 74.92 312 75 312 C 75.08 312 75.16 312.04 75.2 312.1 Z" fill="#ffffff" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 70 317 L 80 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 200 317 L 254.71 317 L 254.72 233 C 258.62 233 258.62 227 254.72 227 L 254.72 227 L 254.72 151.21" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 254.72 150.46 L 255.22 151.46 L 254.72 151.21 L 254.22 151.46 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 79.52 150.68 L 79.57 230 L 50 230 L 50 317 L 68.13 317" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 68.88 317 L 67.88 317.5 L 68.13 317 L 67.88 316.5 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="139" y="113" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 118px; margin-left: 174px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><span style="font-size: 10px">batch</span></div></div></div></foreignObject><text x="174" y="121" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">batch</text></switch></g><rect x="139" y="97" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 102px; margin-left: 174px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">policy.forward()</font></i></div></div></div></foreignObject><text x="174" y="105" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">policy.forwar...</text></switch></g><rect x="201" y="312" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,236.5,317)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 236.5 317)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 317px; margin-left: 237px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 10px">observations</font></div></div></div></foreignObject><text x="237" y="320" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">observations</text></switch></g><rect x="185" y="312" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,220.5,317)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 220.49999999999977 317)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 317px; margin-left: 220px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">env.step()</font></i></div></div></div></foreignObject><text x="220" y="320" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">env.step()</text></switch></g><rect x="223" y="178" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 183px; margin-left: 258px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">collector.collect()</font></i></div></div></div></foreignObject><text x="258" y="186" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">collector.col...</text></switch></g><rect x="50" y="186" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 191px; margin-left: 85px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><span style="font-size: 10px">actions</span></div></div></div></foreignObject><text x="85" y="194" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">actions</text></switch></g><rect x="50" y="170" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 175px; margin-left: 85px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">collector.collect()</font></i></div></div></div></foreignObject><text x="85" y="178" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">collector.col...</text></switch></g><rect x="50" y="210" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 215px; margin-left: 85px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 17px">Collector</font></div></div></div></foreignObject><text x="85" y="218" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">Collector</text></switch></g><rect x="40" y="50" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 55px; margin-left: 75px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 17px">Policy</font></div></div></div></foreignObject><text x="75" y="58" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">Policy</text></switch></g><rect x="550" y="40" width="40" height="340" rx="2" ry="2" fill="#ffffff" stroke="#000000" pointer-events="all"/><rect x="535" y="205" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,570.5,210)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 570.4999999999998 210)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 210px; margin-left: 570px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 17px">Logger</font></div></div></div></foreignObject><text x="570" y="213" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">Logger</text></switch></g><path d="M 510 290 L 547.38 289.91" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 548.88 289.91 L 546.89 290.92 L 547.38 289.91 L 546.88 288.92 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="10" y="10" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 15px; margin-left: 45px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><font style="font-size: 17px">Trainer</font></div></div></div></foreignObject><text x="45" y="18" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">Trainer</text></switch></g><path d="M 510 100 L 547.38 99.86" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="stroke"/><path d="M 548.88 99.85 L 546.89 100.86 L 547.38 99.86 L 546.87 98.86 Z" fill="#000000" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="498" y="288" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,533.5,293)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 533.4999999999998 293)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 293px; margin-left: 533px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><span style="font-size: 10px">statistics</span></div></div></div></foreignObject><text x="533" y="296" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">statistics</text></switch></g><rect x="498" y="98" width="71" height="10" fill="none" stroke="none" transform="rotate(-90,533.5,103)" pointer-events="all"/><g transform="translate(-0.5 -0.5)rotate(-90 533.4999999999998 103)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 103px; margin-left: 533px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><span style="font-size: 10px">statistics</span></div></div></div></foreignObject><text x="533" y="106" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">statistics</text></switch></g><rect x="416" y="175" width="71" height="10" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe center; width: 1px; height: 1px; padding-top: 180px; margin-left: 452px;"><div style="box-sizing: border-box; font-size: 0; text-align: center; "><div style="display: inline-block; font-size: 11px; font-family: Courier New; color: #000000; line-height: 1.2; pointer-events: all; font-weight: bold; white-space: nowrap; "><i><font style="font-size: 10px">policy.update()</font></i></div></div></div></foreignObject><text x="452" y="183" fill="#000000" font-family="Courier New" font-size="11px" text-anchor="middle" font-weight="bold">policy.update...</text></switch></g><rect x="375" y="415" width="60" height="20" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe flex-start; width: 1px; height: 1px; padding-top: 425px; margin-left: 377px;"><div style="box-sizing: border-box; font-size: 0; text-align: left; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: nowrap; ">split data</div></div></div></foreignObject><text x="377" y="429" fill="#000000" font-family="Helvetica" font-size="12px">split data</text></switch></g><path d="M 350.61 410.31 L 364.69 424.39 C 364.89 424.53 365 424.76 365 425 C 365 425.24 364.89 425.47 364.69 425.61 L 350.61 439.69 C 350.47 439.89 350.24 440 350 440 C 349.76 440 349.53 439.89 349.39 439.69 L 335.31 425.61 C 335.11 425.47 335 425.24 335 425 C 335 424.76 335.11 424.53 335.31 424.39 L 349.39 410.31 C 349.53 410.11 349.76 410 350 410 C 350.24 410 350.47 410.11 350.61 410.31 Z" fill="#ffffff" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 335 425 L 365 425" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><ellipse cx="140" cy="425" rx="15.000000000000002" ry="15.000000000000002" fill="#ffffff" stroke="#000000" pointer-events="all"/><path d="M 125 425 L 155 425" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><path d="M 140 410 L 140 440" fill="none" stroke="#000000" stroke-miterlimit="10" pointer-events="all"/><rect x="165" y="415" width="110" height="20" fill="none" stroke="none" pointer-events="all"/><g transform="translate(-0.5 -0.5)"><switch><foreignObject style="overflow: visible; text-align: left;" pointer-events="none" width="100%" height="100%" requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"><div xmlns="http://www.w3.org/1999/xhtml" style="display: flex; align-items: unsafe center; justify-content: unsafe flex-start; width: 1px; height: 1px; padding-top: 425px; margin-left: 167px;"><div style="box-sizing: border-box; font-size: 0; text-align: left; "><div style="display: inline-block; font-size: 12px; font-family: Helvetica; color: #000000; line-height: 1.2; pointer-events: all; white-space: nowrap; ">concatenate data</div></div></div></foreignObject><text x="167" y="429" fill="#000000" font-family="Helvetica" font-size="12px">concatenate data</text></switch></g></g><switch><g requiredFeatures="http://www.w3.org/TR/SVG11/feature#Extensibility"/><a transform="translate(0,-5)" xlink:href="https://www.diagrams.net/doc/faq/svg-export-text-problems" target="_blank"><text text-anchor="middle" font-size="10px" x="50%" y="100%">Viewer does not support full SVG 1.1</text></a></switch></svg>\"></img>\n",
"</center>\n",
"\n"
],
"metadata": {
"id": "M98bqxdMsTXK"
}
},
{
"cell_type": "markdown",
"source": [
"# Usages\n",
"Collector can be used both for training (data collecting) and evaluation in Tianshou."
],
"metadata": {
"id": "OX5cayLv4Ziu"
}
},
{
"cell_type": "markdown",
"source": [
"## Policy evaluation\n",
"We need to evaluate our trained policy from time to time in DRL experiments. Collector can help us with this.\n",
"\n",
"First we have to initialise a Collector with an (vectorized) environment and a given policy (agent)."
],
"metadata": {
"id": "Z6XKbj28u8Ze"
}
},
{
"cell_type": "code",
"source": [
"import gym\n",
"import numpy as np\n",
"import torch\n",
"\n",
"from tianshou.data import Collector\n",
"from tianshou.env import DummyVectorEnv\n",
"from tianshou.policy import PGPolicy\n",
"from tianshou.utils.net.common import Net\n",
"from tianshou.utils.net.discrete import Actor\n",
"\n",
"import warnings\n",
"warnings.filterwarnings('ignore')\n",
"\n",
"env = gym.make(\"CartPole-v0\")\n",
"test_envs = DummyVectorEnv([lambda: gym.make(\"CartPole-v0\") for _ in range(2)])\n",
"\n",
"# model\n",
"net = Net(env.observation_space.shape, hidden_sizes=[16,])\n",
"actor = Actor(net, env.action_space.shape)\n",
"optim = torch.optim.Adam(actor.parameters(), lr=0.0003)\n",
"\n",
"policy = PGPolicy(actor, optim, dist_fn=torch.distributions.Categorical)\n",
"test_collector = Collector(policy, test_envs)"
],
"metadata": {
"id": "w8t9ubO7u69J"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"Now we would like to collect 9 episodes of data to test how our initialised Policy performs."
],
"metadata": {
"id": "wmt8vuwpzQdR"
}
},
{
"cell_type": "code",
"source": [
"collect_result = test_collector.collect(n_episode=9)\n",
"print(collect_result)\n",
"print(\"Rewards of 9 episodes are {}\".format(collect_result[\"rews\"]))\n",
"print(\"Average episode reward is {}.\".format(collect_result[\"rew\"]))\n",
"print(\"Average episode length is {}.\".format(collect_result[\"len\"]))"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "9SuT6MClyjyH",
"outputId": "1e48f13b-c1fe-4fc2-ca1b-669485efdcae"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"{'n/ep': 9, 'n/st': 85, 'rews': array([ 9., 9., 10., 10., 9., 9., 10., 10., 9.]), 'lens': array([ 9, 9, 10, 10, 9, 9, 10, 10, 9]), 'idxs': array([0, 1, 0, 1, 0, 1, 0, 1, 1]), 'rew': 9.444444444444445, 'len': 9.444444444444445, 'rew_std': 0.49690399499995325, 'len_std': 0.49690399499995325}\n",
"Rewards of 9 episodes are [ 9. 9. 10. 10. 9. 9. 10. 10. 9.]\n",
"Average episode reward is 9.444444444444445.\n",
"Average episode length is 9.444444444444445.\n"
]
}
]
},
{
"cell_type": "markdown",
"source": [
"Now we wonder what is the performance of a random policy."
],
"metadata": {
"id": "zX9AQY0M0R3C"
}
},
{
"cell_type": "code",
"source": [
"# Reset the collector\n",
"test_collector.reset()\n",
"collect_result = test_collector.collect(n_episode=9, random=True)\n",
"print(collect_result)\n",
"print(\"Rewards of 9 episodes are {}\".format(collect_result[\"rews\"]))\n",
"print(\"Average episode reward is {}.\".format(collect_result[\"rew\"]))\n",
"print(\"Average episode length is {}.\".format(collect_result[\"len\"]))"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "UEcs8P8P0RLt",
"outputId": "85f02f9d-b79b-48b2-99c6-36a1602f0884"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"{'n/ep': 9, 'n/st': 187, 'rews': array([13., 14., 11., 44., 35., 14., 19., 17., 20.]), 'lens': array([13, 14, 11, 44, 35, 14, 19, 17, 20]), 'idxs': array([1, 0, 0, 1, 0, 0, 1, 0, 1]), 'rew': 20.77777777777778, 'len': 20.77777777777778, 'rew_std': 10.580671872993257, 'len_std': 10.580671872993257}\n",
"Rewards of 9 episodes are [13. 14. 11. 44. 35. 14. 19. 17. 20.]\n",
"Average episode reward is 20.77777777777778.\n",
"Average episode length is 20.77777777777778.\n"
]
}
]
},
{
"cell_type": "markdown",
"source": [
"Seems that an initialised policy performs even worse than a random policy without any training."
],
"metadata": {
"id": "sKQRTiG10ljU"
}
},
{
"cell_type": "markdown",
"source": [
"## Data Collecting\n",
"Data collecting is mostly used during training, when we need to store the collected data in a ReplayBuffer."
],
"metadata": {
"id": "8RKmHIoG1A1k"
}
},
{
"cell_type": "code",
"source": [
"from tianshou.data import VectorReplayBuffer\n",
"train_env_num = 4\n",
"buffer_size = 100\n",
"train_envs = DummyVectorEnv([lambda: gym.make(\"CartPole-v0\") for _ in range(train_env_num)])\n",
"replaybuffer = VectorReplayBuffer(buffer_size, train_env_num)\n",
"\n",
"train_collector = Collector(policy, train_envs, replaybuffer)"
],
"metadata": {
"id": "CB9XB9bF1YPC"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"Now we can collect 50 steps of data, which will be automatically saved in the replay buffer. You can still choose to collect a certain number of episodes rather than steps. Try it yourself."
],
"metadata": {
"id": "rWKDazA42IUQ"
}
},
{
"cell_type": "code",
"source": [
"print(len(replaybuffer))\n",
"collect_result = train_collector.collect(n_step=50)\n",
"print(len(replaybuffer))\n",
"print(collect_result)"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "-fUtQOnM2Yi1",
"outputId": "dceee987-433e-4b75-ed9e-823c20a9e1c2"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"0\n",
"52\n",
"{'n/ep': 4, 'n/st': 52, 'rews': array([ 8., 10., 10., 10.]), 'lens': array([ 8, 10, 10, 10]), 'idxs': array([25, 0, 50, 75]), 'rew': 9.5, 'len': 9.5, 'rew_std': 0.8660254037844386, 'len_std': 0.8660254037844386}\n"
]
}
]
},
{
"cell_type": "code",
"source": [
"for i in range(13):\n",
" print(i, replaybuffer.next(i))"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "EWO4A7plefwM",
"outputId": "9a6f36d1-2b84-49b0-a03d-a8ebe8acadbf"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"0 1\n",
"1 2\n",
"2 3\n",
"3 4\n",
"4 5\n",
"5 6\n",
"6 7\n",
"7 8\n",
"8 9\n",
"9 9\n",
"10 11\n",
"11 12\n",
"12 12\n"
]
}
]
},
{
"cell_type": "code",
"source": [
"replaybuffer.sample(10)"
],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "HW8PpWH9fLCo",
"outputId": "7ca70c50-23b9-4405-9e42-2e5771cd9c78"
},
"execution_count": null,
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
"(Batch(\n",
" obs: array([[-6.36541036e-02, -1.15297838e+00, 7.85138179e-02,\n",
" 1.77225357e+00],\n",
" [-1.05090645e-02, -3.71521519e-01, -2.96323181e-03,\n",
" 5.76793524e-01],\n",
" [-2.45282997e-02, 4.77850180e-02, 2.21142716e-02,\n",
" 4.96743371e-02],\n",
" [ 1.68433453e-03, 4.47272356e-02, -1.72360346e-02,\n",
" -7.74977680e-03],\n",
" [ 4.29854159e-02, -3.95380051e-01, 3.91005958e-02,\n",
" 6.41183774e-01],\n",
" [ 3.50778149e-02, -5.91024637e-01, 5.19242712e-02,\n",
" 9.45918993e-01],\n",
" [-1.19358173e-01, -1.38179912e+00, 1.04318690e-01,\n",
" 2.10185768e+00],\n",
" [-3.48454722e-02, 1.61145106e-03, -2.49944951e-02,\n",
" -3.82213155e-02],\n",
" [-3.48454722e-02, 1.61145106e-03, -2.49944951e-02,\n",
" -3.82213155e-02],\n",
" [-4.64338716e-02, -5.82710950e-01, -1.02110827e-02,\n",
" 8.16967413e-01]]),\n",
" act: array([0, 0, 0, 0, 0, 0, 0, 0, 0, 0]),\n",
" rew: array([1., 1., 1., 1., 1., 1., 1., 1., 1., 1.]),\n",
" done: array([False, False, False, False, False, False, False, False, False,\n",
" False]),\n",
" obs_next: array([[-0.08671367, -1.34889318, 0.11395889, 2.08827982],\n",
" [-0.01793949, -0.56660181, 0.00857264, 0.86854149],\n",
" [-0.0235726 , -0.14764694, 0.02310776, 0.34925166],\n",
" [ 0.00257888, -0.15014334, -0.01739103, 0.27944553],\n",
" [ 0.03507781, -0.59102464, 0.05192427, 0.94591899],\n",
" [ 0.02325732, -0.78680603, 0.07084265, 1.25445415],\n",
" [-0.14699416, -1.57780202, 0.14635584, 2.42487783],\n",
" [-0.03481324, -0.19314333, -0.02575892, 0.24647199],\n",
" [-0.03481324, -0.19314333, -0.02575892, 0.24647199],\n",
" [-0.05808809, -0.77769163, 0.00612827, 1.10642118]]),\n",
" info: Batch(\n",
" env_id: array([0, 0, 0, 1, 1, 1, 2, 3, 3, 3]),\n",
" ),\n",
" policy: Batch(),\n",
" ), array([ 6, 2, 10, 33, 27, 28, 57, 75, 75, 78]))"
]
},
"metadata": {},
"execution_count": 10
}
]
},
{
"cell_type": "markdown",
"source": [
"# Further Reading\n",
"The above collector actually collects 52 data at a time because 52 % 4 = 0. There is one asynchronous collector which allows you collect exactly 50 steps. Check the [documentation](https://tianshou.readthedocs.io/en/master/api/tianshou.data.html#asynccollector) for details."
],
"metadata": {
"id": "8NP7lOBU3-VS"
}
}
]
}