Tianshou/tianshou/data/replay_buffer/buffer.py

class ReplayBuffer(object):
    def __init__(self, env, policy, qnet, target_qnet, conf):
        """
		Initialize a replay buffer with parameters in conf.
		"""
        pass

    def add(self, data, priority):
        """
		Add a data with priority = priority to replay buffer.
		"""
        pass

    def collect(self):
        """
		Collect data from current environment and policy.
		"""
        pass

    def next_batch(self, batch_size):
        """
		get batch of data from the replay buffer.
		"""
        pass

    def update_priority(self, indices, priorities):
        """
		Update the data's priority whose indices = indices.
		For proportional replay buffer, the priority is the priority.
		For rank based replay buffer, the priorities parameter will be the delta used to update the priority.
		"""
        pass

    def reset_alpha(self, alpha):
        """
		This function only works for proportional replay buffer.
		This function resets alpha.
		"""
        pass

    def sample(self, conf):
        """
		Sample from replay buffer with parameters in conf.
		"""
        pass

    def rebalance(self):
        """
		This is for rank based priority replay buffer, which is used to rebalance the sum tree of the priority queue.
		"""
        pass
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`class ReplayBuffer(object):`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def __init__(self, env, policy, qnet, target_qnet, conf):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`Initialize a replay buffer with parameters in conf.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`
replay buffer initial commit 2017-12-10 14:53:57 +08:00
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def add(self, data, priority):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`Add a data with priority = priority to replay buffer.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`
replay buffer initial commit 2017-12-10 14:53:57 +08:00
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def collect(self):`
			`"""`
			`Collect data from current environment and policy.`
			`"""`
			`pass`

			`def next_batch(self, batch_size):`
			`"""`
			`get batch of data from the replay buffer.`
			`"""`
			`pass`

			`def update_priority(self, indices, priorities):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`Update the data's priority whose indices = indices.`
			`For proportional replay buffer, the priority is the priority.`
			`For rank based replay buffer, the priorities parameter will be the delta used to update the priority.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`
replay buffer initial commit 2017-12-10 14:53:57 +08:00
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def reset_alpha(self, alpha):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`This function only works for proportional replay buffer.`
			`This function resets alpha.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`
replay buffer initial commit 2017-12-10 14:53:57 +08:00
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def sample(self, conf):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`Sample from replay buffer with parameters in conf.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`
replay buffer initial commit 2017-12-10 14:53:57 +08:00
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`def rebalance(self):`
			`"""`
replay buffer initial commit 2017-12-10 14:53:57 +08:00			`This is for rank based priority replay buffer, which is used to rebalance the sum tree of the priority queue.`
finished very naive dqn: changed the interface of replay buffer by adding collect and next_batch, but still need refactoring; added implementation of dqn.py, but still need to consider the interface to make it more extensive; slightly refactored the code style of the codebase; more comments and todos will be in the next commit 2017-12-17 12:52:00 +08:00			`"""`
			`pass`