This is the third PR of 6 commits mentioned in #274, which features refactor of Collector to fix #245. You can check #274 for more detail. Things changed in this PR: 1. refactor collector to be more cleaner, split AsyncCollector to support asyncvenv; 2. change buffer.add api to add(batch, bffer_ids); add several types of buffer (VectorReplayBuffer, PrioritizedVectorReplayBuffer, etc.) 3. add policy.exploration_noise(act, batch) -> act 4. small change in BasePolicy.compute_*_returns 5. move reward_metric from collector to trainer 6. fix np.asanyarray issue (different version's numpy will result in different output) 7. flake8 maxlength=88 8. polish docs and fix test Co-authored-by: n+e <trinkle23897@gmail.com>
27 lines
457 B
ReStructuredText
27 lines
457 B
ReStructuredText
tianshou.utils
|
|
==============
|
|
|
|
.. automodule:: tianshou.utils
|
|
:members:
|
|
:undoc-members:
|
|
:show-inheritance:
|
|
|
|
|
|
Pre-defined Networks
|
|
--------------------
|
|
|
|
.. automodule:: tianshou.utils.net.common
|
|
:members:
|
|
:undoc-members:
|
|
:show-inheritance:
|
|
|
|
.. automodule:: tianshou.utils.net.discrete
|
|
:members:
|
|
:undoc-members:
|
|
:show-inheritance:
|
|
|
|
.. automodule:: tianshou.utils.net.continuous
|
|
:members:
|
|
:undoc-members:
|
|
:show-inheritance:
|