NM512
|
7433d1e877
|
avoid ".to(device)"
|
2024-09-28 07:58:15 +09:00 |
|
NM512
|
59939222d1
|
clean code
|
2024-09-24 00:16:12 +09:00 |
|
NM512
|
2cfcaefea2
|
avoid mutable default argument
|
2024-03-11 06:21:35 +09:00 |
|
NM512
|
7f66ed5333
|
erased unused options
|
2024-01-05 23:23:09 +09:00 |
|
NM512
|
a27711ab96
|
limit action values in sampling stage
|
2024-01-05 11:42:45 +09:00 |
|
NM512
|
a9e85e8b7c
|
modified weight initialization
|
2024-01-05 10:46:54 +09:00 |
|
NM512
|
78e86703f4
|
modified loss calculation
|
2024-01-05 10:44:04 +09:00 |
|
NM512
|
1002d8b115
|
avoid cyclic reference
|
2023-10-02 07:27:26 +09:00 |
|
NM512
|
f35480f2a6
|
policy is not given logs
|
2023-10-01 06:25:23 +09:00 |
|
NM512
|
d3576c5a98
|
added save and load for optimizers
|
2023-09-27 09:15:37 +09:00 |
|
NM512
|
16635df3e4
|
removed scheduling function
|
2023-09-26 20:58:55 +09:00 |
|
NM512
|
606ec8af8c
|
added the option for a deterministic run
|
2023-08-16 21:46:06 +09:00 |
|
NM512
|
68096d1f62
|
added log for inventory items in minecraft
|
2023-08-16 15:52:33 +09:00 |
|
NM512
|
8c471e12d6
|
erased unnecessary lines of code
|
2023-08-05 21:11:34 +09:00 |
|
NM512
|
43e1b2ab88
|
fix bug when resetting envs at different time
|
2023-07-24 22:26:21 +09:00 |
|
NM512
|
12ed21e06d
|
applied formatter
|
2023-07-23 22:02:06 +09:00 |
|
NM512
|
afa5ab988d
|
introduced parallel processing for envs
|
2023-07-23 21:58:46 +09:00 |
|
NM512
|
106317015d
|
erased unused lines of code
|
2023-07-22 21:20:55 +09:00 |
|
NM512
|
03d91cb2c1
|
make sure "is_first" is set 0 at beginning
|
2023-07-22 21:08:53 +09:00 |
|
NM512
|
f07d843953
|
erased unnecessary reward input
|
2023-07-22 20:53:43 +09:00 |
|
NM512
|
9ca5082da3
|
separated cache management of episode from env
|
2023-07-22 19:22:41 +09:00 |
|
NM512
|
88514ec022
|
removed unnecessary imports
|
2023-07-02 11:52:33 +09:00 |
|
NM512
|
0ae6d2d1e0
|
step-based counting
|
2023-07-02 11:51:11 +09:00 |
|
NM512
|
b408067d9a
|
avoid DeprecationWarning
|
2023-06-18 17:18:24 +09:00 |
|
NM512
|
970d1dc3e9
|
bug fix of limits for trunc_normal_
|
2023-06-17 15:28:26 +09:00 |
|
NM512
|
b984e69b6e
|
added state input capability
|
2023-05-14 23:38:46 +09:00 |
|
NM512
|
e5e8bcb284
|
modified a variable name
|
2023-04-29 07:57:05 +09:00 |
|
NM512
|
1328ff1088
|
sampling from the replay buffer across episodes
|
2023-04-29 07:43:02 +09:00 |
|
NM512
|
432a359bcf
|
put running episode into replay buffer
|
2023-04-24 06:25:17 +09:00 |
|
NM512
|
fba87a33e0
|
applied formatter to tools
|
2023-04-15 15:28:09 +09:00 |
|
NM512
|
55ed69bdf7
|
fix bug when using envs > 1
|
2023-04-15 15:25:25 +09:00 |
|
NM512
|
57ac1c11d3
|
replaced all tf function to torch
|
2023-04-03 08:06:34 +09:00 |
|
NM512
|
6273444394
|
modified based on author's implementation
|
2023-03-18 08:38:23 +09:00 |
|
NM512
|
fb5c21557a
|
Initial Commit
|
2023-02-12 22:35:25 +09:00 |
|