minor fixed

This commit is contained in:
rtz19970824 2017-12-10 13:34:07 +08:00
parent 18b3b0b850
commit 8de92378c2

View File

@ -44,8 +44,9 @@ Tianshou(天授) is a reinforcement learning platform. The following image illus
## TODO
Search based method parallel.
<font color=Red> Write comments. Please do not use abbreviations unless others can know it well. (e.g. adv can short for advantage/adversarial, please use the full name instead) </font>
<font color=Red> Please name the module formally. (e.g. use more lower case "_", I think a module called "Batch" seems terrible)</font>
<font color=red> Write comments. Please do not use abbreviations unless others can know it well. (e.g. adv can short for advantage/adversarial, please use the full name instead) </font>
<font color=red> Please name the module formally. (e.g. use more lower case "_", I think a module called "Batch" seems terrible)</font>
YongRen: Policy Wrapper, in order of Gaussian, DQN and DDPG