Logo
Explore Help
Sign In
hongshaorou/dreamer4
1
0
Fork 0
You've already forked dreamer4
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
dreamer4/dreamer4
History
lucidrains 2a902eaaf7 allow reward tokens to be attended to as state optionally, DT-esque. figure out multi-agent scenario once i get around to it
2025-10-16 06:41:02 -07:00
..
__init__.py
for the value head, we will go for symexp encoding as well (following the "stop regressing" paper from Farebrother et al), also use layernormed mlp given recent papers
2025-10-08 07:37:34 -07:00
dreamer4.py
allow reward tokens to be attended to as state optionally, DT-esque. figure out multi-agent scenario once i get around to it
2025-10-16 06:41:02 -07:00
trainers.py
add the noising of the latent context during generation, technique i think was from EPFL, or perhaps some google group that built on top of EPFL work
2025-10-07 09:37:37 -07:00
Powered by Gitea Version: 23.8.0 Page: 51ms Template: 2ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API