24 Commits

Author SHA1 Message Date
lucidrains
187edc1414 all set for generating the perceived rewards once the RL components fall into place 2025-10-08 06:33:28 -07:00
lucidrains
36ccb08500 allow for step_sizes to be passed in, log2 is not that intuitive 2025-10-07 08:36:46 -07:00
lucidrains
a8e14f4b7c oops 2025-10-07 08:09:33 -07:00
lucidrains
c6bef85984 generating video with raw teacher forcing 2025-10-07 07:22:57 -07:00
lucidrains
83ba9a285a reorganize tokenizer to generate video from the dynamics model 2025-10-06 11:37:45 -07:00
lucidrains
7180a8cf43 start carving into the reinforcement learning portion, starting with reward prediction head (single for now) 2025-10-06 11:17:25 -07:00
lucidrains
25b8de91cc handle spatial tokens less than latent tokens in dynamics model 2025-10-06 09:19:27 -07:00
lucidrains
f507afa0d3 last commit for the day - take care of the task embed 2025-10-05 11:40:48 -07:00
lucidrains
fe99efecba make a first pass through the shortcut training logic (Frans et al from Berkeley) maintaining both v-space and x-space 2025-10-05 11:17:36 -07:00
lucidrains
971637673b complete all the types of attention masking patterns as proposed in the paper 2025-10-04 12:45:54 -07:00
lucidrains
5c6be4d979 take care of blocked causal in video tokenizer, still need the special attention pattern for latents to and from though 2025-10-04 12:03:50 -07:00
lucidrains
6c994db341 first nail down the attention masking for the dynamics transformer model using a factory function 2025-10-04 11:20:57 -07:00
lucidrains
895a867a66 able to accept raw video for dynamics model, if tokenizer passed in 2025-10-04 06:57:54 -07:00
lucidrains
8373cb13ec grouped query attention is necessary 2025-10-04 06:31:32 -07:00
lucidrains
046f8927d1 complete the symexp two hot proposed by Hafner from the previous versions of Dreamer, but will also bring in hl gauss 2025-10-03 08:08:44 -07:00
lucidrains
8d1cd311bb Revert "address https://github.com/lucidrains/dreamer4/issues/1"
This reverts commit e23a5294ec2f49d58d3ccb936c498eb86939fa96.
2025-10-02 12:25:05 -07:00
lucidrains
e23a5294ec address https://github.com/lucidrains/dreamer4/issues/1 2025-10-02 11:49:22 -07:00
lucidrains
49082d8629 x-space and v-space prediction in dynamics model 2025-10-02 08:36:00 -07:00
lucidrains
8b66b703e0 add the discretized signal level + step size embeddings necessary for diffusion forcing + shortcut 2025-10-02 07:39:34 -07:00
lucidrains
bb7a5d1680 sketch out the axial space time transformer in dynamics model 2025-10-02 07:17:58 -07:00
lucidrains
0285bba821 flesh out tokenizer even more 2025-10-02 06:11:04 -07:00
lucidrains
31c4aa28c7 start setting up tokenizer 2025-10-02 05:37:43 -07:00
lucidrains
e8678364ba swish glu feedforward from shazeer et al 2025-10-01 09:28:25 -07:00
lucidrains
bdc7dd30a6 scaffold 2025-10-01 07:18:23 -07:00