rtz19970824
|
e5bf7a9270
|
implement dqn loss and dpg loss, add TODO for separate actor and critic
|
2017-12-15 14:24:08 +08:00 |
|
Haosheng Zou
|
7ab211b63c
|
preliminary design of dqn_example, dqn interface. identify the assign of networks
|
2017-12-13 20:47:45 +08:00 |
|
haosheng
|
ff4306ddb9
|
model-free rl first commit, with ppo_example.py in examples/ and task delegations in ppo_example.py and READMEs
|
2017-12-08 21:09:23 +08:00 |
|
JialianLee
|
d9a50569f5
|
modification to docs of mcts
|
2017-11-18 09:37:15 +08:00 |
|
Tongzheng Ren
|
595e62e111
|
architecture design
|
2017-11-06 15:15:44 +08:00 |
|