3 lines
177 B
Markdown
Raw Normal View History

2024-05-03 13:42:59 +08:00
# SPO outperforms PPO in all environments when the network deepens:
![MuJoCo](https://github.com/MyRepositories-hub/Simple-Policy-Optimization/blob/main/draw_return_mujoco.png)