Tianshou/tianshou.data.rst at 150d0ec51b77c30e0c3fd261e17d2639fe59b1a4 - Tianshou - Gitea: Git with a cup of tea

hongshaorou/Tianshou

ChenDRAG 150d0ec51b

Step collector implementation (#280 )

This is the third PR of 6 commits mentioned in #274, which features refactor of Collector to fix #245. You can check #274 for more detail.

Things changed in this PR:

1. refactor collector to be more cleaner, split AsyncCollector to support asyncvenv;
2. change buffer.add api to add(batch, bffer_ids); add several types of buffer (VectorReplayBuffer, PrioritizedVectorReplayBuffer, etc.)
3. add policy.exploration_noise(act, batch) -> act
4. small change in BasePolicy.compute_*_returns
5. move reward_metric from collector to trainer
6. fix np.asanyarray issue (different version's numpy will result in different output)
7. flake8 maxlength=88
8. polish docs and fix test

Co-authored-by: n+e <trinkle23897@gmail.com>

2021-02-19 10:33:49 +08:00

30 lines

358 B

ReStructuredText

Raw Blame History

 tianshou.data
 =============
 Batch
 -----
 .. automodule:: tianshou.data.batch
    :members:
    :undoc-members:
    :show-inheritance:
 Buffer
 ------
 .. automodule:: tianshou.data.buffer
    :members:
    :undoc-members:
    :show-inheritance:
 Collector
 ---------
 .. automodule:: tianshou.data.collector
    :members:
    :undoc-members:
    :show-inheritance: