This commit is contained in:
Dominik Jain 2024-04-29 14:10:47 +02:00
parent d18ded333e
commit 74737416ff

View File

@ -152,7 +152,7 @@
"id": "Lh2-hwE5Dn9I" "id": "Lh2-hwE5Dn9I"
}, },
"source": [ "source": [
"Once we have defined the actor, the critic and the optimizer. We can use them to construct our PPO agent. CartPole is a discrete action space problem, so the distribution of our action space can be a categorical distribution." "Once we have defined the actor, the critic and the optimizer, we can use them to construct our PPO agent. CartPole is a discrete action space problem, so the distribution of our action space can be a categorical distribution."
] ]
}, },
{ {