19 Commits

Author SHA1 Message Date
NM512
12ed21e06d applied formatter 2023-07-23 22:02:06 +09:00
NM512
afa5ab988d introduced parallel processing for envs 2023-07-23 21:58:46 +09:00
NM512
106317015d erased unused lines of code 2023-07-22 21:20:55 +09:00
NM512
03d91cb2c1 make sure "is_first" is set 0 at beginning 2023-07-22 21:08:53 +09:00
NM512
f07d843953 erased unnecessary reward input 2023-07-22 20:53:43 +09:00
NM512
9ca5082da3 separated cache management of episode from env 2023-07-22 19:22:41 +09:00
NM512
88514ec022 removed unnecessary imports 2023-07-02 11:52:33 +09:00
NM512
0ae6d2d1e0 step-based counting 2023-07-02 11:51:11 +09:00
NM512
b408067d9a avoid DeprecationWarning 2023-06-18 17:18:24 +09:00
NM512
970d1dc3e9 bug fix of limits for trunc_normal_ 2023-06-17 15:28:26 +09:00
NM512
b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512
e5e8bcb284 modified a variable name 2023-04-29 07:57:05 +09:00
NM512
1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512
432a359bcf put running episode into replay buffer 2023-04-24 06:25:17 +09:00
NM512
fba87a33e0 applied formatter to tools 2023-04-15 15:28:09 +09:00
NM512
55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512
57ac1c11d3 replaced all tf function to torch 2023-04-03 08:06:34 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00