dreamerv3-torch/README.md

# dreamerv3-torch
Pytorch implementation of [Mastering Diverse Domains through World Models](https://arxiv.org/abs/2301.04104v1). DreamerV3 is a scalable algorithm that outperforms previous approaches across various domains with fixed hyperparameters.

## Instructions

### Method 1: Manual

Get dependencies with python 3.11:
```
pip install -r requirements.txt
```
Run training on DMC Vision:
```
python3 dreamer.py --configs dmc_vision --task dmc_walker_walk --logdir ./logdir/dmc_walker_walk
```
Monitor results:
```
tensorboard --logdir ./logdir
```
To set up Atari or Minecraft environments, please check the scripts located in [env/setup_scripts](https://github.com/NM512/dreamerv3-torch/tree/main/envs/setup_scripts).

### Method 2: Docker

Please refer to the Dockerfile for the instructions, as they are included within.

## Benchmarks
So far, the following benchmarks can be used for testing.
| Environment        | Observation | Action | Budget | Description |
|-------------------|---|---|---|-----------------------|
| [DMC Proprio](https://github.com/deepmind/dm_control) | State | Continuous | 500K | DeepMind Control Suite with low-dimensional inputs. |
| [DMC Vision](https://github.com/deepmind/dm_control) | Image | Continuous |1M| DeepMind Control Suite with high-dimensional images inputs. |
| [Atari 100k](https://github.com/openai/atari-py) | Image | Discrete |400K| 26 Atari games. |
| [Crafter](https://github.com/danijar/crafter) | Image | Discrete |1M| Survival environment to evaluates diverse agent abilities.|
| [Minecraft](https://github.com/minerllabs/minerl) | Image and State |Discrete |100M| Vast 3D open world.|
| [Memory Maze](https://github.com/jurgisp/memory-maze) | Image |Discrete |100M| 3D mazes to evaluate RL agents' long-term memory.|

## Results
#### DMC Proprio
![dmcproprio](imgs/dmcproprio.png)
#### DMC Vision
![dmcvision](imgs/dmcvision.png)
#### Atari 100k
![atari100k](imgs/atari100k.png)

#### Crafter
<img src="https://github.com/NM512/dreamerv3-torch/assets/70328564/a0626038-53f6-4300-a622-7ac257f4c290" width="300" height="150" />

## Acknowledgments
This code is heavily inspired by the following works:
- danijar's Dreamer-v3 jax implementation: https://github.com/danijar/dreamerv3
- danijar's Dreamer-v2 tensorflow implementation: https://github.com/danijar/dreamerv2
- jsikyoon's Dreamer-v2 pytorch implementation: https://github.com/jsikyoon/dreamer-torch
- RajGhugare19's Dreamer-v2 pytorch implementation: https://github.com/RajGhugare19/dreamerv2
- denisyarats's DrQ-v2 original implementation: https://github.com/facebookresearch/drqv2
updated results 2023-02-18 14:42:22 +09:00			`# dreamerv3-torch`
updated README 2023-05-05 18:21:19 +09:00			`Pytorch implementation of [Mastering Diverse Domains through World Models](https://arxiv.org/abs/2301.04104v1). DreamerV3 is a scalable algorithm that outperforms previous approaches across various domains with fixed hyperparameters.`
Initial Commit 2023-02-12 22:35:25 +09:00
			`## Instructions`
updated results 2023-03-18 19:07:12 +09:00
added Docker instruction 2023-10-02 07:37:08 +09:00			`### Method 1: Manual`

updated requirements 2024-09-24 00:33:02 +09:00			`Get dependencies with python 3.11:`
Initial Commit 2023-02-12 22:35:25 +09:00			```
			`pip install -r requirements.txt`
			```
updated README 2023-06-18 19:43:01 +09:00			`Run training on DMC Vision:`
Initial Commit 2023-02-12 22:35:25 +09:00			```
added state input capability 2023-05-14 23:38:46 +09:00			`python3 dreamer.py --configs dmc_vision --task dmc_walker_walk --logdir ./logdir/dmc_walker_walk`
			```
Initial Commit 2023-02-12 22:35:25 +09:00			`Monitor results:`
			```
updated results of DMC vision 2023-06-04 23:49:05 +09:00			`tensorboard --logdir ./logdir`
Initial Commit 2023-02-12 22:35:25 +09:00			```
updated requirements 2024-09-24 00:33:02 +09:00			`To set up Atari or Minecraft environments, please check the scripts located in [env/setup_scripts](https://github.com/NM512/dreamerv3-torch/tree/main/envs/setup_scripts).`

added Docker instruction 2023-10-02 07:37:08 +09:00			`### Method 2: Docker`

			`Please refer to the Dockerfile for the instructions, as they are included within.`
Initial Commit 2023-02-12 22:35:25 +09:00
updated README 2023-06-18 19:43:01 +09:00			`## Benchmarks`
updated README 2023-06-19 06:02:35 +09:00			`So far, the following benchmarks can be used for testing.`
			`\| Environment \| Observation \| Action \| Budget \| Description \|`
			`\|-------------------\|---\|---\|---\|-----------------------\|`
			`\| [DMC Proprio](https://github.com/deepmind/dm_control) \| State \| Continuous \| 500K \| DeepMind Control Suite with low-dimensional inputs. \|`
			`\| [DMC Vision](https://github.com/deepmind/dm_control) \| Image \| Continuous \|1M\| DeepMind Control Suite with high-dimensional images inputs. \|`
			`\| [Atari 100k](https://github.com/openai/atari-py) \| Image \| Discrete \|400K\| 26 Atari games. \|`
			`\| [Crafter](https://github.com/danijar/crafter) \| Image \| Discrete \|1M\| Survival environment to evaluates diverse agent abilities.\|`
updated README 2023-07-23 22:40:32 +09:00			`\| [Minecraft](https://github.com/minerllabs/minerl) \| Image and State \|Discrete \|100M\| Vast 3D open world.\|`
updated README 2023-06-19 06:02:35 +09:00			`\| [Memory Maze](https://github.com/jurgisp/memory-maze) \| Image \|Discrete \|100M\| 3D mazes to evaluate RL agents' long-term memory.\|`
updated README 2023-06-18 19:43:01 +09:00
added state input capability 2023-05-14 23:38:46 +09:00			`## Results`
updated README 2023-06-19 06:02:35 +09:00			`#### DMC Proprio`
updated results 2024-03-11 06:22:09 +09:00			`![dmcproprio](imgs/dmcproprio.png)`
added results of DMC proprio 2023-05-21 23:12:51 +09:00			`#### DMC Vision`
updated results 2024-03-11 06:22:09 +09:00			`![dmcvision](imgs/dmcvision.png)`
added results of DMC proprio 2023-05-21 23:12:51 +09:00			`#### Atari 100k`
updated atari100k result 2024-09-24 00:18:47 +09:00			`![atari100k](imgs/atari100k.png)`
added results of DMC proprio 2023-05-21 23:12:51 +09:00
added crafter result 2023-08-15 20:11:15 +09:00			`#### Crafter`
added log for inventory items in minecraft 2023-08-16 15:52:33 +09:00			`<img src="https://github.com/NM512/dreamerv3-torch/assets/70328564/a0626038-53f6-4300-a622-7ac257f4c290" width="300" height="150" />`
added crafter result 2023-08-15 20:11:15 +09:00
Initial Commit 2023-02-12 22:35:25 +09:00			`## Acknowledgments`
			`This code is heavily inspired by the following works:`
modified based on author's implementation 2023-03-18 08:38:23 +09:00			`- danijar's Dreamer-v3 jax implementation: https://github.com/danijar/dreamerv3`
Initial Commit 2023-02-12 22:35:25 +09:00			`- danijar's Dreamer-v2 tensorflow implementation: https://github.com/danijar/dreamerv2`
			`- jsikyoon's Dreamer-v2 pytorch implementation: https://github.com/jsikyoon/dreamer-torch`
			`- RajGhugare19's Dreamer-v2 pytorch implementation: https://github.com/RajGhugare19/dreamerv2`
			`- denisyarats's DrQ-v2 original implementation: https://github.com/facebookresearch/drqv2`