haoshengzou
|
75e7f14051
|
towards ddpg
|
2018-03-28 18:47:41 +08:00 |
|
haoshengzou
|
52e6b09768
|
finish ddpg. now ppo, actor-critic, dqn works. ddpg is not working, check!
|
2018-03-11 17:47:42 +08:00 |
|
haoshengzou
|
54a7b1343d
|
design exploration and evaluators for off-policy algos
|
2018-03-04 13:53:29 +08:00 |
|
Dong Yan
|
528c4be93c
|
add render option for ddpg
|
2018-02-28 18:44:06 +08:00 |
|
haoshengzou
|
f32e1d9c12
|
finish ddpg example. all examples under examples/ (except those containing 'contrib' and 'fail') can run! advantage estimation module is not complete yet.
|
2018-01-18 17:38:52 +08:00 |
|