Update README.md

2024-05-16 19:30:58 +08:00 · 2024-05-16 19:30:58 +08:00 · 009f8c6072
commit 009f8c6072
parent 51fbded316
1 changed files with 52 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -1,2 +1,54 @@
 # SPO outperforms PPO in all environments when the network deepens (five random seeds):
 ![MuJoCo](https://github.com/MyRepositories-hub/Simple-Policy-Optimization/blob/main/draw_return_mujoco.png)
+
+# Training
+**The experimental environment is `gymnasium`, and you need to execute the following command to install the dependencies:**
+## MuJoCo
+### Installation
+```bash
+pip install gymnasium
+pip install gymnasium[mujoco]
+```
+### Reminder
+Please change the code from 
+```python
+self.add_overlay(bottomleft, "Solver iterations", str(self.data.solver_iter + 1))
+```
+to 
+```python
+self.add_overlay(bottomleft, "Solver iterations", str(self.data.solver_niter + 1))
+```
+in line 593 of the file path `venv\Lib\site-packages\gymnasium\envs\mujoco\mujoco_rendering.py` to resolve the error
+
+### Running
+```python
+import gymnasium as gym
+
+env = gym.make('Humanoid-v4', render_mode='human')
+while True:
+    s, _ = env.reset()
+    done = False
+    while not done:
+        a = env.action_space.sample()
+        s_next, r, dw, tr, info = env.step(a)
+        done = (dw or tr)
+```
+## Atari
+### Installation
+```bash
+pip install gymnasium[atari]
+pip install gymnasium[accept-rom-license]
+```
+### Running
+```python
+import gymnasium as gym
+
+env = gym.make('ALE/Breakout-v5', render_mode='human')
+while True:
+    s, _ = env.reset()
+    done = False
+    while not done:
+        a = env.action_space.sample()
+        s_next, r, dw, tr, info = env.step(a)
+        done = (dw or tr)
+```