This website requires JavaScript.
Explore
Help
Sign In
hongshaorou
/
dreamer4
Watch
1
Star
0
Fork
0
You've already forked dreamer4
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
dreamer4
/
dreamer4
History
lucidrains
c5e64ff4ce
separate out the key from the value projections in attention for muon
2025-10-12 09:42:22 -07:00
..
__init__.py
for the value head, we will go for symexp encoding as well (following the "stop regressing" paper from Farebrother et al), also use layernormed mlp given recent papers
2025-10-08 07:37:34 -07:00
dreamer4.py
separate out the key from the value projections in attention for muon
2025-10-12 09:42:22 -07:00
trainers.py
add the noising of the latent context during generation, technique i think was from EPFL, or perhaps some google group that built on top of EPFL work
2025-10-07 09:37:37 -07:00