2023-02-18 14:42:22 +09:00
|
|
|
# dreamerv3-torch
|
2023-03-20 21:00:01 +09:00
|
|
|
Pytorch implementation of [Mastering Diverse Domains through World Models](https://arxiv.org/abs/2301.04104v1).
|
|
|
|
|
|
|
|
|
2023-03-24 07:51:57 +09:00
|
|
|

|
2023-02-12 22:35:25 +09:00
|
|
|
|
|
|
|
## Instructions
|
2023-03-18 19:07:12 +09:00
|
|
|
|
2023-02-12 22:35:25 +09:00
|
|
|
Get dependencies:
|
|
|
|
```
|
|
|
|
pip install -r requirements.txt
|
|
|
|
```
|
2023-03-24 07:51:57 +09:00
|
|
|
Train the agent on Walker Walk in Vision DMC:
|
2023-02-12 22:35:25 +09:00
|
|
|
```
|
2023-03-24 07:51:57 +09:00
|
|
|
python3 dreamer.py --configs defaults --task dmc_walker_walk --logdir ~/dreamerv3-torch/logdir/dmc_walker_walk
|
|
|
|
```
|
|
|
|
Train the agent on Alien in Atari 100K:
|
|
|
|
```
|
|
|
|
python3 dreamer.py --configs defaults atari --task atari_alien --logdir ~/dreamerv3-torch/logdir/atari_alien
|
2023-02-12 22:35:25 +09:00
|
|
|
```
|
|
|
|
Monitor results:
|
|
|
|
```
|
2023-03-24 07:51:57 +09:00
|
|
|
tensorboard --logdir ~/dreamerv3-torch/logdir
|
2023-02-12 22:35:25 +09:00
|
|
|
```
|
|
|
|
|
2023-03-18 08:38:23 +09:00
|
|
|
## ToDo
|
2023-03-18 19:07:12 +09:00
|
|
|
- [x] Prototyping
|
|
|
|
- [x] Modify implementation details based on the author's implementation
|
2023-03-24 07:51:57 +09:00
|
|
|
- [x] Evaluate on DMC vision
|
|
|
|
- [ ] Evaluate on Atari 100K
|
|
|
|
- [ ] Add state input capability
|
|
|
|
- [ ] Evaluate on DMC Proprio
|
2023-03-18 19:07:12 +09:00
|
|
|
- [ ] etc.
|
2023-03-18 08:38:23 +09:00
|
|
|
|
|
|
|
|
2023-02-12 22:35:25 +09:00
|
|
|
## Acknowledgments
|
|
|
|
This code is heavily inspired by the following works:
|
2023-03-18 08:38:23 +09:00
|
|
|
- danijar's Dreamer-v3 jax implementation: https://github.com/danijar/dreamerv3
|
2023-02-12 22:35:25 +09:00
|
|
|
- danijar's Dreamer-v2 tensorflow implementation: https://github.com/danijar/dreamerv2
|
|
|
|
- jsikyoon's Dreamer-v2 pytorch implementation: https://github.com/jsikyoon/dreamer-torch
|
|
|
|
- RajGhugare19's Dreamer-v2 pytorch implementation: https://github.com/RajGhugare19/dreamerv2
|
|
|
|
- denisyarats's DrQ-v2 original implementation: https://github.com/facebookresearch/drqv2
|