23 Commits

Author SHA1 Message Date
NM512
59939222d1 clean code 2024-09-24 00:16:12 +09:00
NM512
4e50f302cd updated results 2024-03-11 06:22:09 +09:00
NM512
7f66ed5333 erased unused options 2024-01-05 23:23:09 +09:00
NM512
a27711ab96 limit action values in sampling stage 2024-01-05 11:42:45 +09:00
NM512
a9e85e8b7c modified weight initialization 2024-01-05 10:46:54 +09:00
NM512
78e86703f4 modified loss calculation 2024-01-05 10:44:04 +09:00
NM512
e0487f8206 merged action head into MLP and modified configs 2024-01-05 10:26:48 +09:00
NM512
e0f2017e28 unified the place to initialize the latents 2024-01-05 10:09:13 +09:00
NM512
8571cf656a modifications for minecraft 2023-08-05 21:13:57 +09:00
NM512
d94a719421 bug fix when using normal_1 2023-07-27 10:01:40 +09:00
NM512
5dce8cf13b added benchmark task Crafter 2023-06-18 00:02:22 +09:00
NM512
ccc98d46c9 applied weight_init for linear layer in Decoder 2023-06-17 15:32:27 +09:00
NM512
f7c505579c erased unnecessary lines 2023-06-17 15:27:09 +09:00
ktolnos
308eeff09a Fixed RSSM trajectory starts initialization. 2023-06-05 22:49:40 +03:00
NM512
0faa10ff46 expanded the supported image sizes 2023-05-21 22:00:59 +09:00
NM512
b8ef214efa bug fix for gym==0.19.0 2023-05-18 21:30:08 +09:00
NM512
b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512
0eb66997fb learnable initial state options for RSSM 2023-04-29 07:54:03 +09:00
NM512
628b856c63 changed the discount head to predict terminal 2023-04-22 09:34:23 +09:00
NM512
942eae10a9 updated result, requirements and torch version 2023-03-24 07:51:57 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
f96ad071d1 modified network structures to match the paper 2023-02-18 10:13:02 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00