haoshengzou
|
909dc786d1
|
advantage estimation function all take my_feed_dict (all examples runnable); such requirement should be made a signature
|
2018-11-22 08:03:03 +08:00 |
|
haoshengzou
|
e6d477f9a3
|
modified top-level .gitignore to include tianshou/data
|
2018-01-25 16:08:04 +08:00 |
|
rtz19970824
|
fcaa571b42
|
add the interface in engine.py
|
2018-01-12 21:48:01 +08:00 |
|
rtz19970824
|
deea09b2b7
|
minor fixed
|
2017-12-23 14:45:07 +08:00 |
|
rtz19970824
|
c11eccbc90
|
implement the training process
|
2017-12-21 23:30:24 +08:00 |
|
Tongzheng Ren
|
0e1287b5cb
|
update gitignore
|
2017-12-18 23:34:32 +08:00 |
|
haosheng
|
a00b930c2c
|
fix naming and comments of coding style, delete .json
|
2017-12-10 17:23:13 +08:00 |
|
rtz19970824
|
ec6114edf1
|
rm ckpts
|
2017-12-09 21:53:12 +08:00 |
|
rtz19970824
|
0341e0d21e
|
modify
|
2017-12-09 21:42:52 +08:00 |
|
haosheng
|
ff4306ddb9
|
model-free rl first commit, with ppo_example.py in examples/ and task delegations in ppo_example.py and READMEs
|
2017-12-08 21:09:23 +08:00 |
|
rtz19970824
|
2f95a1d854
|
remove .swp
|
2017-11-28 15:04:00 +08:00 |
|
Dong Yan
|
2fc87f7020
|
add test interface for Network
|
2017-11-06 23:13:11 +08:00 |
|
Dong Yan
|
a8030c95f2
|
upload the architecture image
|
2017-11-06 15:56:16 +08:00 |
|
Tongzheng Ren
|
2734af8530
|
modify the network
|
2017-11-05 16:47:01 +08:00 |
|
Dong Yan
|
be2200c06f
|
add the gitignore file
|
2017-11-05 14:48:53 +08:00 |
|