34 Commits

Author SHA1 Message Date
NM512
7433d1e877 avoid ".to(device)" 2024-09-28 07:58:15 +09:00
NM512
59939222d1 clean code 2024-09-24 00:16:12 +09:00
NM512
2cfcaefea2 avoid mutable default argument 2024-03-11 06:21:35 +09:00
NM512
7f66ed5333 erased unused options 2024-01-05 23:23:09 +09:00
NM512
a27711ab96 limit action values in sampling stage 2024-01-05 11:42:45 +09:00
NM512
a9e85e8b7c modified weight initialization 2024-01-05 10:46:54 +09:00
NM512
78e86703f4 modified loss calculation 2024-01-05 10:44:04 +09:00
NM512
1002d8b115 avoid cyclic reference 2023-10-02 07:27:26 +09:00
NM512
f35480f2a6 policy is not given logs 2023-10-01 06:25:23 +09:00
NM512
d3576c5a98 added save and load for optimizers 2023-09-27 09:15:37 +09:00
NM512
16635df3e4 removed scheduling function 2023-09-26 20:58:55 +09:00
NM512
606ec8af8c added the option for a deterministic run 2023-08-16 21:46:06 +09:00
NM512
68096d1f62 added log for inventory items in minecraft 2023-08-16 15:52:33 +09:00
NM512
8c471e12d6 erased unnecessary lines of code 2023-08-05 21:11:34 +09:00
NM512
43e1b2ab88 fix bug when resetting envs at different time 2023-07-24 22:26:21 +09:00
NM512
12ed21e06d applied formatter 2023-07-23 22:02:06 +09:00
NM512
afa5ab988d introduced parallel processing for envs 2023-07-23 21:58:46 +09:00
NM512
106317015d erased unused lines of code 2023-07-22 21:20:55 +09:00
NM512
03d91cb2c1 make sure "is_first" is set 0 at beginning 2023-07-22 21:08:53 +09:00
NM512
f07d843953 erased unnecessary reward input 2023-07-22 20:53:43 +09:00
NM512
9ca5082da3 separated cache management of episode from env 2023-07-22 19:22:41 +09:00
NM512
88514ec022 removed unnecessary imports 2023-07-02 11:52:33 +09:00
NM512
0ae6d2d1e0 step-based counting 2023-07-02 11:51:11 +09:00
NM512
b408067d9a avoid DeprecationWarning 2023-06-18 17:18:24 +09:00
NM512
970d1dc3e9 bug fix of limits for trunc_normal_ 2023-06-17 15:28:26 +09:00
NM512
b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512
e5e8bcb284 modified a variable name 2023-04-29 07:57:05 +09:00
NM512
1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512
432a359bcf put running episode into replay buffer 2023-04-24 06:25:17 +09:00
NM512
fba87a33e0 applied formatter to tools 2023-04-15 15:28:09 +09:00
NM512
55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512
57ac1c11d3 replaced all tf function to torch 2023-04-03 08:06:34 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00