sproblvem
|
acb93502cf
|
Update README.md
change "Framework" to "Task"
|
2020-03-27 16:52:07 +08:00 |
|
Trinkle23897
|
044aae4355
|
add baseline and rlpyt result
|
2020-03-27 16:24:07 +08:00 |
|
Trinkle23897
|
44f911bc31
|
add pytorch drl result
|
2020-03-27 09:04:29 +08:00 |
|
Trinkle23897
|
519f9f20d0
|
update readme
|
2020-03-26 17:32:51 +08:00 |
|
Trinkle23897
|
c505cd8205
|
update readme
|
2020-03-26 11:42:34 +08:00 |
|
Minghao Zhang
|
3c0a09fefd
|
minor reformat (#2)
* update atari.py
* fix setup.py
pass the pytest
* fix setup.py
pass the pytest
|
2020-03-26 09:01:20 +08:00 |
|
Trinkle23897
|
fdc969b830
|
fix collector
|
2020-03-25 14:08:28 +08:00 |
|
Trinkle23897
|
e95218e295
|
sac
|
2020-03-23 17:17:41 +08:00 |
|
Trinkle23897
|
30a0fc079c
|
td3
|
2020-03-23 11:34:52 +08:00 |
|
Trinkle23897
|
a87563b8e6
|
add demo of ppo continuous action task
|
2020-03-21 17:04:42 +08:00 |
|
Trinkle23897
|
c173f7bfbc
|
fix ddpg
|
2020-03-21 15:31:31 +08:00 |
|
Trinkle23897
|
8bd8246b16
|
refract test code
|
2020-03-21 10:58:01 +08:00 |
|
Trinkle23897
|
d64d78d769
|
seed???
|
2020-03-20 21:51:09 +08:00 |
|
Trinkle23897
|
75364cd986
|
ppo and early stop
|
2020-03-20 19:52:29 +08:00 |
|
Trinkle23897
|
c87fe3c18c
|
add trainer
|
2020-03-19 17:23:46 +08:00 |
|
Trinkle23897
|
9c5417dd51
|
change env to vecenv for higher code coverage rate
|
2020-03-18 21:56:03 +08:00 |
|
Trinkle23897
|
64bab0b6a0
|
ddpg
|
2020-03-18 21:45:41 +08:00 |
|
Trinkle23897
|
6e563fe61a
|
a2c
|
2020-03-17 20:22:37 +08:00 |
|
Trinkle23897
|
fd621971e5
|
fix bug in test
|
2020-03-17 15:16:30 +08:00 |
|
Trinkle23897
|
39de63592f
|
finish pg
|
2020-03-17 11:37:31 +08:00 |
|
Trinkle23897
|
8b0b970c9b
|
add speed stat
|
2020-03-16 15:04:58 +08:00 |
|
Trinkle23897
|
cef5de8b83
|
fix some bugs
|
2020-03-16 11:11:29 +08:00 |
|
Trinkle23897
|
5983c6b33d
|
finish dqn
|
2020-03-15 17:41:00 +08:00 |
|
Trinkle23897
|
c804662457
|
add cache buf in collector
|
2020-03-14 21:48:31 +08:00 |
|
Trinkle23897
|
543e57cdbd
|
clear
|
2020-03-13 21:47:17 +08:00 |
|
Trinkle23897
|
f16e05c0e7
|
maybe finished collector?
|
2020-03-13 17:49:22 +08:00 |
|
Trinkle23897
|
f58c1397c6
|
half of collector
|
2020-03-12 22:20:33 +08:00 |
|
Trinkle23897
|
4a1a7dd670
|
fix a bug
|
2020-03-11 18:02:19 +08:00 |
|
Trinkle23897
|
6632e47b9d
|
add test_buffer
|
2020-03-11 17:28:51 +08:00 |
|
Trinkle23897
|
04557fdb82
|
env test \ ray
|
2020-03-11 16:14:53 +08:00 |
|
Trinkle23897
|
7533e5b0ac
|
add first test
|
2020-03-11 10:56:38 +08:00 |
|
Trinkle23897
|
5550aed0a1
|
flake8 fix
|
2020-03-11 09:38:14 +08:00 |
|
Trinkle23897
|
776acd9f13
|
github ci
|
2020-03-11 09:18:28 +08:00 |
|
Trinkle23897
|
0dfb900e29
|
env and data
|
2020-03-11 09:09:56 +08:00 |
|
Trinkle23897
|
0c944eab68
|
init
|
2020-03-09 11:38:04 +08:00 |
|