2023-02-18 14:42:22 +09:00
# dreamerv3-torch
2023-05-05 18:21:19 +09:00
Pytorch implementation of [Mastering Diverse Domains through World Models ](https://arxiv.org/abs/2301.04104v1 ). DreamerV3 is a scalable algorithm that outperforms previous approaches across various domains with fixed hyperparameters.
2023-02-12 22:35:25 +09:00
## Instructions
2023-03-18 19:07:12 +09:00
2023-02-12 22:35:25 +09:00
Get dependencies:
```
pip install -r requirements.txt
```
2023-05-14 23:38:46 +09:00
Train the agent on Walker Walk in DMC Vision:
2023-02-12 22:35:25 +09:00
```
2023-05-14 23:38:46 +09:00
python3 dreamer.py --configs dmc_vision --task dmc_walker_walk --logdir ./logdir/dmc_walker_walk
```
Train the agent on Walker Walk in DMC Proprio:
```
python3 dreamer.py --configs dmc_proprio --task dmc_walker_walk --logdir ./logdir/dmc_walker_walk
2023-03-24 07:51:57 +09:00
```
Train the agent on Alien in Atari 100K:
```
2023-05-14 23:38:46 +09:00
python3 dreamer.py --configs atari100k --task atari_alien --logdir ./logdir/atari_alien
2023-02-12 22:35:25 +09:00
```
Monitor results:
```
2023-03-24 07:51:57 +09:00
tensorboard --logdir ~/dreamerv3-torch/logdir
2023-02-12 22:35:25 +09:00
```
2023-05-14 23:38:46 +09:00
## Results
2023-05-21 23:12:51 +09:00
#### DMC Vision

#### Atari 100k

#### DMC Proprio

2023-05-05 18:21:19 +09:00
2023-03-18 08:38:23 +09:00
2023-02-12 22:35:25 +09:00
## Acknowledgments
This code is heavily inspired by the following works:
2023-03-18 08:38:23 +09:00
- danijar's Dreamer-v3 jax implementation: https://github.com/danijar/dreamerv3
2023-02-12 22:35:25 +09:00
- danijar's Dreamer-v2 tensorflow implementation: https://github.com/danijar/dreamerv2
- jsikyoon's Dreamer-v2 pytorch implementation: https://github.com/jsikyoon/dreamer-torch
- RajGhugare19's Dreamer-v2 pytorch implementation: https://github.com/RajGhugare19/dreamerv2
- denisyarats's DrQ-v2 original implementation: https://github.com/facebookresearch/drqv2