dreamer4

Author	SHA1	Message	Date
lucidrains	4c2ed100a3	fix masking for multiple agent tokens	2025-10-08 08:26:44 -07:00
lucidrains	63b63dfedd	add shard	2025-10-08 06:56:03 -07:00
lucidrains	187edc1414	all set for generating the perceived rewards once the RL components fall into place	2025-10-08 06:33:28 -07:00
lucidrains	c056835aea	address https://github.com/lucidrains/dreamer4/issues/2	2025-10-08 05:55:22 -07:00
lucidrains	0fdb67bafa	add the noising of the latent context during generation, technique i think was from EPFL, or perhaps some google group that built on top of EPFL work	2025-10-07 09:37:37 -07:00
lucidrains	36ccb08500	allow for step_sizes to be passed in, log2 is not that intuitive	2025-10-07 08:36:46 -07:00
lucidrains	1176269927	correct signal levels when doing teacher forcing generation	2025-10-07 07:41:02 -07:00
lucidrains	0f4783f23c	use a newly built module from x-mlps for multi token prediction	2025-10-04 07:56:56 -07:00
lucidrains	0a26e0f92f	complete the lpips loss used for the video tokenizer	2025-10-04 07:47:27 -07:00
lucidrains	986bf4c529	allow for the video tokenizer to accept any spatial dimensions by parameterizing the decoder positional embedding with an MLP	2025-10-03 10:08:05 -07:00
lucidrains	046f8927d1	complete the symexp two hot proposed by Hafner from the previous versions of Dreamer, but will also bring in hl gauss	2025-10-03 08:08:44 -07:00
lucidrains	8b66b703e0	add the discretized signal level + step size embeddings necessary for diffusion forcing + shortcut	2025-10-02 07:39:34 -07:00
lucidrains	e3cbcd94c6	sketch out top down	2025-10-01 10:25:56 -07:00
lucidrains	2e92c0121a	they employ two stability measures, qk rmsnorm and softclamping of attention logits	2025-10-01 09:40:24 -07:00
lucidrains	bdc7dd30a6	scaffold	2025-10-01 07:18:23 -07:00

15 Commits