张德祥
|
ea446adaf4
|
mem maze env ok 1
|
2023-06-17 23:29:53 +08:00 |
|
张德祥
|
1cf0149c10
|
env v0.13
|
2023-06-14 20:22:17 +08:00 |
|
张德祥
|
b9120a7440
|
env v0.12
|
2023-06-13 21:39:04 +08:00 |
|
张德祥
|
7879c6cfe7
|
env v01
|
2023-06-13 09:58:03 +08:00 |
|
NM512
|
0faa10ff46
|
expanded the supported image sizes
|
2023-05-21 22:00:59 +09:00 |
|
NM512
|
7e67dc6910
|
set default precision as 32
|
2023-05-17 22:16:55 +09:00 |
|
NM512
|
b984e69b6e
|
added state input capability
|
2023-05-14 23:38:46 +09:00 |
|
NM512
|
3ebb8ad617
|
updated README
|
2023-05-05 18:21:19 +09:00 |
|
NM512
|
0eb66997fb
|
learnable initial state options for RSSM
|
2023-04-29 07:54:03 +09:00 |
|
NM512
|
1328ff1088
|
sampling from the replay buffer across episodes
|
2023-04-29 07:43:02 +09:00 |
|
NM512
|
628b856c63
|
changed the discount head to predict terminal
|
2023-04-22 09:34:23 +09:00 |
|
NM512
|
1e070a3daf
|
cleaned up envs
|
2023-04-15 23:16:43 +09:00 |
|
NM512
|
55ed69bdf7
|
fix bug when using envs > 1
|
2023-04-15 15:25:25 +09:00 |
|
NM512
|
cd935b7dd9
|
set default replay buffer size as 1M
|
2023-04-05 21:38:51 +09:00 |
|
NM512
|
942eae10a9
|
updated result, requirements and torch version
|
2023-03-24 07:51:57 +09:00 |
|
NM512
|
6273444394
|
modified based on author's implementation
|
2023-03-18 08:38:23 +09:00 |
|
NM512
|
f96ad071d1
|
modified network structures to match the paper
|
2023-02-18 10:13:02 +09:00 |
|
NM512
|
fb5c21557a
|
Initial Commit
|
2023-02-12 22:35:25 +09:00 |
|