Implementation of Danijar's latest iteration for his Dreamer line of work
artificial-intelligence
attention
deep-learning
model-based-reinforcement-learning
transformers
world-models
Updated 2025-12-23 22:25:16 +08:00
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
Updated 2024-09-23 15:51:56 +08:00