13 Commits

Author SHA1 Message Date
NM512
88514ec022 removed unnecessary imports 2023-07-02 11:52:33 +09:00
NM512
0ae6d2d1e0 step-based counting 2023-07-02 11:51:11 +09:00
NM512
b408067d9a avoid DeprecationWarning 2023-06-18 17:18:24 +09:00
NM512
970d1dc3e9 bug fix of limits for trunc_normal_ 2023-06-17 15:28:26 +09:00
NM512
b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512
e5e8bcb284 modified a variable name 2023-04-29 07:57:05 +09:00
NM512
1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512
432a359bcf put running episode into replay buffer 2023-04-24 06:25:17 +09:00
NM512
fba87a33e0 applied formatter to tools 2023-04-15 15:28:09 +09:00
NM512
55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512
57ac1c11d3 replaced all tf function to torch 2023-04-03 08:06:34 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00