Logo
Explore Help
Sign In
hongshaorou/dreamer4
1
0
Fork 0
You've already forked dreamer4
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
138 Commits 2 Branches 119 Tags
Commit Graph

5 Commits

Author SHA1 Message Date
lucidrains
d82debb7a6 first pass through gathering experience with a mock env for online rl 2025-10-22 08:32:46 -07:00
lucidrains
03b16a48f2 sketch out the dream trainer, seems like they only fine tune the heads 2025-10-22 06:41:10 -07:00
lucidrains
c4e0f46528 for the value head, we will go for symexp encoding as well (following the "stop regressing" paper from Farebrother et al), also use layernormed mlp given recent papers 2025-10-08 07:37:34 -07:00
lucidrains
e3cbcd94c6 sketch out top down 2025-10-01 10:25:56 -07:00
lucidrains
bdc7dd30a6 scaffold 2025-10-01 07:18:23 -07:00
Powered by Gitea Version: 23.8.0 Page: 160ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API