add some TODO
This commit is contained in:
parent
ec6114edf1
commit
18b3b0b850
@ -44,6 +44,9 @@ Tianshou(天授) is a reinforcement learning platform. The following image illus
|
||||
## TODO
|
||||
Search based method parallel.
|
||||
|
||||
<font color=Red> Write comments. Please do not use abbreviations unless others can know it well. (e.g. adv can short for advantage/adversarial, please use the full name instead) </font>
|
||||
<font color=Red> Please name the module formally. (e.g. use more lower case "_", I think a module called "Batch" seems terrible)</font>
|
||||
|
||||
YongRen: Policy Wrapper, in order of Gaussian, DQN and DDPG
|
||||
|
||||
TongzhengRen: losses, in order of ppo, pg, DQN, DDPG with management of placeholders
|
||||
|
@ -2,6 +2,7 @@ import numpy as np
|
||||
import gc
|
||||
|
||||
|
||||
# TODO: Refactor with tf.train.slice_input_producer, tf.train.Coordinator, tf.train.QueueRunner
|
||||
class Batch(object):
|
||||
"""
|
||||
class for batch datasets. Collect multiple states (actions, rewards, etc.) on-policy.
|
||||
|
Loading…
x
Reference in New Issue
Block a user