update iqn results and reward plots (#377)
@ -60,13 +60,13 @@ One epoch here is equal to 100,000 env step, 100 epochs stand for 10M.
|
|||||||
|
|
||||||
| task | best reward | reward curve | parameters |
|
| task | best reward | reward curve | parameters |
|
||||||
| --------------------------- | ----------- | ------------------------------------- | ------------------------------------------------------------ |
|
| --------------------------- | ----------- | ------------------------------------- | ------------------------------------------------------------ |
|
||||||
| PongNoFrameskip-v4 | 20.9 |  | `python3 atari_iqn.py --task "PongNoFrameskip-v4" --batch-size 64` |
|
| PongNoFrameskip-v4 | 20.3 |  | `python3 atari_iqn.py --task "PongNoFrameskip-v4" --batch-size 64` |
|
||||||
| BreakoutNoFrameskip-v4 | 578.3 |  | `python3 atari_iqn.py --task "BreakoutNoFrameskip-v4" --n-step 1` |
|
| BreakoutNoFrameskip-v4 | 496.7 |  | `python3 atari_iqn.py --task "BreakoutNoFrameskip-v4" --n-step 1` |
|
||||||
| EnduroNoFrameskip-v4 | 1507 |  | `python3 atari_iqn.py --task "EnduroNoFrameskip-v4"` |
|
| EnduroNoFrameskip-v4 | 1545 |  | `python3 atari_iqn.py --task "EnduroNoFrameskip-v4"` |
|
||||||
| QbertNoFrameskip-v4 | 15520 |  | `python3 atari_iqn.py --task "QbertNoFrameskip-v4"` |
|
| QbertNoFrameskip-v4 | 15342.5 |  | `python3 atari_iqn.py --task "QbertNoFrameskip-v4"` |
|
||||||
| MsPacmanNoFrameskip-v4 | 2911 |  | `python3 atari_iqn.py --task "MsPacmanNoFrameskip-v4"` |
|
| MsPacmanNoFrameskip-v4 | 2915 |  | `python3 atari_iqn.py --task "MsPacmanNoFrameskip-v4"` |
|
||||||
| SeaquestNoFrameskip-v4 | 6236 |  | `python3 atari_iqn.py --task "SeaquestNoFrameskip-v4"` |
|
| SeaquestNoFrameskip-v4 | 4874 |  | `python3 atari_iqn.py --task "SeaquestNoFrameskip-v4"` |
|
||||||
| SpaceInvadersNoFrameskip-v4 | 1370 |  | `python3 atari_iqn.py --task "SpaceInvadersNoFrameskip-v4"` |
|
| SpaceInvadersNoFrameskip-v4 | 1498.5 |  | `python3 atari_iqn.py --task "SpaceInvadersNoFrameskip-v4"` |
|
||||||
|
|
||||||
# BCQ
|
# BCQ
|
||||||
|
|
||||||
|
Before Width: | Height: | Size: 226 KiB After Width: | Height: | Size: 253 KiB |
Before Width: | Height: | Size: 243 KiB After Width: | Height: | Size: 247 KiB |
Before Width: | Height: | Size: 209 KiB After Width: | Height: | Size: 211 KiB |
Before Width: | Height: | Size: 119 KiB After Width: | Height: | Size: 134 KiB |
Before Width: | Height: | Size: 214 KiB After Width: | Height: | Size: 211 KiB |
Before Width: | Height: | Size: 236 KiB After Width: | Height: | Size: 229 KiB |
Before Width: | Height: | Size: 222 KiB After Width: | Height: | Size: 216 KiB |