dreamer4

hongshaorou/dreamer4

Fork 0

Commit Graph

Select branches

Hide Pull Requests

main

pytest-shard

#3

#4

#5

0.0.1

0.0.10

0.0.100

0.0.101

0.0.102

0.0.11

0.0.12

0.0.14

0.0.15

0.0.16

0.0.17

0.0.18

0.0.19

0.0.2

0.0.20

0.0.21

0.0.22

0.0.23

0.0.24

0.0.25

0.0.26

0.0.27

0.0.28

0.0.29

0.0.3

0.0.30

0.0.31

0.0.32

0.0.33

0.0.34

0.0.35

0.0.36

0.0.37

0.0.38

0.0.39

0.0.4

0.0.40

0.0.41

0.0.42

0.0.43

0.0.44

0.0.45

0.0.46

0.0.47

0.0.48

0.0.49

0.0.5

0.0.50

0.0.52

0.0.53

0.0.54

0.0.55

0.0.56

0.0.57

0.0.58

0.0.59

0.0.60

0.0.61

0.0.62

0.0.64

0.0.65

0.0.66

0.0.67

0.0.68

0.0.69

0.0.7

0.0.70

0.0.71

0.0.72

0.0.73

0.0.74

0.0.75

0.0.76

0.0.77

0.0.78

0.0.79

0.0.8

0.0.80

0.0.81

0.0.82

0.0.83

0.0.84

0.0.85

0.0.87

0.0.88

0.0.89

0.0.9

0.0.90

0.0.91

0.0.92

0.0.93

0.0.94

0.0.95

0.0.96

0.0.97

0.0.98

0.0.99

0.1.0

0.1.1

0.1.10

0.1.11

0.1.12

0.1.14

0.1.15

0.1.16

0.1.17

0.1.18

0.1.19

0.1.2

0.1.20

0.1.21

0.1.22

0.1.23

0.1.24

0.1.4

0.1.5

0.1.6

0.1.7

0.1.8

0285bba821 flesh out tokenizer even more lucidrains 2025-10-02 06:11:04 -07:00
31c4aa28c7 start setting up tokenizer lucidrains 2025-10-02 05:37:43 -07:00
67519a451d softclamping in flex lucidrains 2025-10-01 12:19:41 -07:00
8e7a35b89c cover the attention masking for tokenizer encoder, decoder, as well as dynamics model (latent and agent tokens are "special" and placed on the right) lucidrains 2025-10-01 12:11:06 -07:00
c18c624be6 their latent bottleneck is tanh it seems, constraining it to -1 to 1 for flow matching in dynamics model. please open an issue if mistakened lucidrains 2025-10-01 10:39:16 -07:00
e3cbcd94c6 sketch out top down lucidrains 2025-10-01 10:25:56 -07:00
882e63511b will apply the golden gate rotary for this work as an option lucidrains 2025-10-01 10:07:54 -07:00
ceb1af263e oops lucidrains 2025-10-01 09:49:04 -07:00
c979883f21 ready the block causal mask lucidrains 2025-10-01 09:45:54 -07:00
2e92c0121a they employ two stability measures, qk rmsnorm and softclamping of attention logits lucidrains 2025-10-01 09:40:24 -07:00
e8678364ba swish glu feedforward from shazeer et al lucidrains 2025-10-01 09:28:25 -07:00
8ebb8a9661 finished a first pass at digesting the paper, start with transformer lucidrains 2025-10-01 09:21:55 -07:00
e0dd4cfeaa they replace the recurrent state-space model with a transformer, with the implication that the former does not scale lucidrains 2025-10-01 07:59:02 -07:00
bdc7dd30a6 scaffold lucidrains 2025-10-01 07:18:18 -07:00
62e9c4eecf

project page Phil Wang 2025-10-01 06:56:03 -07:00
febbc73284 dreamer fig2 lucidrains 2025-10-01 06:30:29 -07:00
deecd30f52

wip Phil Wang 2025-09-30 05:59:20 -07:00
4eeb4ee7fc

Initial commit Phil Wang 2025-09-30 05:58:16 -07:00