Logo
Explore Help
Sign In
hongshaorou/dreamer4
1
0
Fork 0
You've already forked dreamer4
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
dreamer4/dreamer4
History
lucidrains c5e64ff4ce separate out the key from the value projections in attention for muon
2025-10-12 09:42:22 -07:00
..
__init__.py
for the value head, we will go for symexp encoding as well (following the "stop regressing" paper from Farebrother et al), also use layernormed mlp given recent papers
2025-10-08 07:37:34 -07:00
dreamer4.py
separate out the key from the value projections in attention for muon
2025-10-12 09:42:22 -07:00
trainers.py
add the noising of the latent context during generation, technique i think was from EPFL, or perhaps some google group that built on top of EPFL work
2025-10-07 09:37:37 -07:00
Powered by Gitea Version: 23.8.0 Page: 51ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API