27c1017259add a detailed Chinese google coding style for convenience
Tongzheng Ren
2017-12-18 23:32:41 +08:00
75bc2968d2add a detailed Chinese google coding style for convenience
Tongzheng Ren
2017-12-18 23:32:41 +08:00
d220f7f2a8add comments and todos
宋世虹
2017-12-17 13:28:21 +08:00
7693c38f44add comments and todos
宋世虹
2017-12-17 13:28:21 +08:00
3624cc9036finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit
宋世虹
2017-12-17 12:52:00 +08:00
62e2c6582dfinished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit
宋世虹
2017-12-17 12:52:00 +08:00
31199c7d0d0. code refactor, try to merge Go and GoEnv
Dong Yan
2017-12-16 23:29:11 +08:00
e10acf51300. code refactor, try to merge Go and GoEnv
Dong Yan
2017-12-16 23:29:11 +08:00
01c0c2483acheck if the network weights exists for every player
Dong Yan
2017-12-16 14:55:19 +08:00
431f551ce9check if the network weights exists for every player
Dong Yan
2017-12-16 14:55:19 +08:00
d115c586d4start the player server in a more robost way.
Dong Yan
2017-12-16 14:33:31 +08:00
b8bdfea8bdstart the player server in a more robost way.
Dong Yan
2017-12-16 14:33:31 +08:00
4fc50c5f1bmerge class strategy with class game. Next, merge Go with GoEnv
Dong Yan
2017-12-15 22:19:44 +08:00
6cb4b02fcamerge class strategy with class game. Next, merge Go with GoEnv
Dong Yan
2017-12-15 22:19:44 +08:00
d0bdccc25aassign TODO to Haosheng and Tongzheng
rtz19970824
2017-12-15 14:27:04 +08:00
00f599bba3assign TODO to Haosheng and Tongzheng
rtz19970824
2017-12-15 14:27:04 +08:00
ff4306ddb9model-free rl first commit, with ppo_example.py in examples/ and task delegations in ppo_example.py and READMEs
haosheng
2017-12-08 21:09:23 +08:00