8 Commits

Author SHA1 Message Date
NM512
e5e8bcb284 modified a variable name 2023-04-29 07:57:05 +09:00
NM512
1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512
432a359bcf put running episode into replay buffer 2023-04-24 06:25:17 +09:00
NM512
fba87a33e0 applied formatter to tools 2023-04-15 15:28:09 +09:00
NM512
55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512
57ac1c11d3 replaced all tf function to torch 2023-04-03 08:06:34 +09:00
NM512
6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512
fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00