Tianshou

Author	SHA1	Message	Date
youkaichao	a9f9940d17	code refactor for venv (#179 ) - Refacor code to remove duplicate code - Enable async simulation for all vector envs - Remove `collector.close` and rename `VectorEnv` to `DummyVectorEnv` The abstraction of vector env changed. Prior to this pr, each vector env is almost independent. After this pr, each env is wrapped into a worker, and vector envs differ with their worker type. In fact, users can just use `BaseVectorEnv` with different workers, I keep `SubprocVectorEnv`, `ShmemVectorEnv` for backward compatibility. Co-authored-by: n+e <463003665@qq.com> Co-authored-by: magicly <magicly007@gmail.com>	2020-08-19 15:00:24 +08:00
youkaichao	7f3b817b24	add policy.update to enable post process and remove collector.sample (#180 ) * add policy.update to enable post process and remove collector.sample * update doc in policy concept * remove collector.sample in doc * doc update of concepts * docs * polish * polish policy * remove collector.sample in docs * minor fix * Apply suggestions from code review just a test * doc fix Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-08-15 16:10:42 +08:00
ChenDRAG	f2bcc55a25	ShmemVectorEnv Implementation (#174 ) * add shmem vecenv, some add&fix in test_env * generalize test_env IO * pep8 fix * comment update * style change * pep8 fix * style fix * minor fix * fix a bug * test fix * change env * testenv bug fix& shmem support recurse dict * bugfix * pep8 fix * _NP_TO_CT enhance * doc update * docstring update * pep8 fix * style change * style fix * remove assert * minor Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-08-04 13:39:05 +08:00
Alexis DUBURCQ	e024afab8c	Asynchronous sampling vector environment (#134 ) Fix #103 Co-authored-by: youkaichao <youkaichao@126.com> Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-07-26 18:01:21 +08:00
n+e	38a95c19da	Yet another 3 fix (#160 ) 1. DQN learn should keep eps=0 2. Add a warning of env.seed in VecEnv 3. fix #162 of multi-dim action	2020-07-24 17:38:12 +08:00
youkaichao	bfeffe1f97	unify single-env and multi-env in collector (#157 ) Unify the implementation with multi-environments (wrap a single environment in a multi-environment with one envs) to greatly simplify the code. This changed the behavior of single-environment. Prior to this pr, for single environment, collector.collect(n_step=n) will step n steps. After this pr, for single environment, collector.collect(n_step=n) will step m episodes until the steps are greater than n. That is to say, collectors now always collect full episodes.	2020-07-23 16:40:53 +08:00
youkaichao	26fb87433d	Improve collector (#125 ) * remove multibuf * reward_metric * make fileds with empty Batch rather than None after reset * many fixes and refactor Co-authored-by: Trinkle23897 <463003665@qq.com>	2020-07-13 17:33:01 +08:00
Alexis DUBURCQ	ec270759ab	Batch refactoring (#87 ) * Enable to stack Batch instances. Add Batch cat static method. Rename cat in cat_ since inplace. * Properly handle Batch init using np.array of dict. * WIP * Get rid of metadata. * Update UT. Replace cat by cat_ everywhere. * Do not sort Batch keys anymore for efficiency. Add items method. * Fix cat copy issue. * Add unit test to chack cat and stack methods. * Remove used import. * Fix linter issues. * Fix unit tests. Co-authored-by: Alexis Duburcq <alexis.duburcq@wandercraft.eu>	2020-06-23 22:50:59 +08:00
Trinkle23897	1a914336f7	add random action in collector (fix #78 )	2020-06-11 08:57:37 +08:00
Trinkle23897	f1951780ab	fix a bug of storing batch over batch data into buffer	2020-06-09 18:46:14 +08:00
Trinkle23897	560116d0b2	cheat sheet	2020-06-08 21:53:00 +08:00
Trinkle23897	075825325e	add preprocess_fn (#42 )	2020-05-05 13:39:51 +08:00
Trinkle23897	bb2f833d0e	support Batch of Batch and fix bugs (#38 )	2020-04-29 12:14:53 +08:00
Trinkle23897	80d661907e	Multimodal obs (#38 , #27 , #25 )	2020-04-28 20:56:02 +08:00
Trinkle23897	680fc0ffbe	gae	2020-04-14 21:11:06 +08:00
Trinkle23897	74407e13da	env info log_fn (#28 )	2020-04-10 18:02:05 +08:00
Trinkle23897	3cc22b7c0c	__call__ -> forward	2020-04-10 10:47:16 +08:00
Trinkle23897	13086b7f64	add ignore_obs_next in buffer	2020-04-10 09:01:17 +08:00
Minghao Zhang	3c0a09fefd	minor reformat (#2 ) * update atari.py * fix setup.py pass the pytest * fix setup.py pass the pytest	2020-03-26 09:01:20 +08:00
Trinkle23897	fdc969b830	fix collector	2020-03-25 14:08:28 +08:00

20 Commits