add some TODO
This commit is contained in:
parent
ec6114edf1
commit
18b3b0b850
@ -44,6 +44,9 @@ Tianshou(天授) is a reinforcement learning platform. The following image illus
|
|||||||
## TODO
|
## TODO
|
||||||
Search based method parallel.
|
Search based method parallel.
|
||||||
|
|
||||||
|
<font color=Red> Write comments. Please do not use abbreviations unless others can know it well. (e.g. adv can short for advantage/adversarial, please use the full name instead) </font>
|
||||||
|
<font color=Red> Please name the module formally. (e.g. use more lower case "_", I think a module called "Batch" seems terrible)</font>
|
||||||
|
|
||||||
YongRen: Policy Wrapper, in order of Gaussian, DQN and DDPG
|
YongRen: Policy Wrapper, in order of Gaussian, DQN and DDPG
|
||||||
|
|
||||||
TongzhengRen: losses, in order of ppo, pg, DQN, DDPG with management of placeholders
|
TongzhengRen: losses, in order of ppo, pg, DQN, DDPG with management of placeholders
|
||||||
|
@ -2,6 +2,7 @@ import numpy as np
|
|||||||
import gc
|
import gc
|
||||||
|
|
||||||
|
|
||||||
|
# TODO: Refactor with tf.train.slice_input_producer, tf.train.Coordinator, tf.train.QueueRunner
|
||||||
class Batch(object):
|
class Batch(object):
|
||||||
"""
|
"""
|
||||||
class for batch datasets. Collect multiple states (actions, rewards, etc.) on-policy.
|
class for batch datasets. Collect multiple states (actions, rewards, etc.) on-policy.
|
||||||
|
Loading…
x
Reference in New Issue
Block a user