18 Commits

Author SHA1 Message Date
张德祥
ea446adaf4 mem maze env ok 1 2023-06-17 23:29:53 +08:00
张德祥
1cf0149c10 env v0.13 2023-06-14 20:22:17 +08:00
张德祥
b9120a7440 env v0.12 2023-06-13 21:39:04 +08:00
张德祥
7879c6cfe7 env v01 2023-06-13 09:58:03 +08:00
NM512
0faa10ff46 expanded the supported image sizes 2023-05-21 22:00:59 +09:00
NM512
7e67dc6910 set default precision as 32 2023-05-17 22:16:55 +09:00
NM512
b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512
3ebb8ad617 updated README 2023-05-05 18:21:19 +09:00
NM512
0eb66997fb learnable initial state options for RSSM 2023-04-29 07:54:03 +09:00
NM512
1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512
628b856c63 changed the discount head to predict terminal 2023-04-22 09:34:23 +09:00
NM512
1e070a3daf cleaned up envs 2023-04-15 23:16:43 +09:00
NM512
55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512
cd935b7dd9 set default replay buffer size as 1M 2023-04-05 21:38:51 +09:00
NM512
942eae10a9 updated result, requirements and torch version 2023-03-24 07:51:57 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
f96ad071d1 modified network structures to match the paper 2023-02-18 10:13:02 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00