youkaichao bfeffe1f97
unify single-env and multi-env in collector (#157)
Unify the implementation with multi-environments (wrap a single environment in a multi-environment with one envs) to greatly simplify the code.

This changed the behavior of single-environment.
Prior to this pr, for single environment, collector.collect(n_step=n) will step n steps.
After this pr, for single environment, collector.collect(n_step=n) will step m episodes until the steps are greater than n.

That is to say, collectors now always collect full episodes.
2020-07-23 16:40:53 +08:00
..
2020-07-09 22:57:01 +08:00
2020-07-23 15:12:02 +08:00
2020-07-22 14:42:08 +08:00
2020-07-22 14:42:08 +08:00
2020-07-22 14:42:08 +08:00
2020-03-28 22:01:23 +08:00
2020-03-29 15:18:33 +08:00
2020-04-29 14:16:38 +08:00