diff --git a/README.md b/README.md index 1d728ba..d82b9df 100644 --- a/README.md +++ b/README.md @@ -52,6 +52,8 @@ YouQiaoben: data/Batch, implement num_timesteps, fix memory growth in num_episod ShihongSong: data/Replay; then adv_estimate.dqn after YongRen's DQN +HaoshengZou: collaborate mainly on Policy and losses; interfaces and architecture + Note: install openai/gym first to run the Atari environment; note that interfaces between modules may not be finalized; the management of placeholders and `feed_dict` may have to be done manually for the time being; Without preprocessing and other tricks, this example will not train to any meaningful results. Codes should past two tests: individual module test and run through this example code. \ No newline at end of file