Tianshou

Author	SHA1	Message	Date
Dominik Jain	05a8cf4e74	Refactoring, improving class name EnvFactoryGymnasium -> EnvFactoryRegistered	2024-01-16 14:52:31 +01:00
Dominik Jain	c9cb41bf55	Make envpool usage configuration more explicit	2024-01-16 14:52:31 +01:00
Dominik Jain	eaab7b0a4b	Improve environment factory abstractions in high-level API: * EnvFactory now uses the creation of a single environment as the basic functionality which the more high-level functions build upon * Introduce enum EnvMode to indicate the purpose for which an env is created, allowing the factory creation process to change its behaviour accordingly * Add EnvFactoryGymnasium to provide direct support for envs that can be created via gymnasium.make - EnvPool is supported via an injectible EnvPoolFactory - Existing EnvFactory implementations are now derived from EnvFactoryGymnasium * Use a separate environment (which uses new EnvMode.WATCH) for watching agent performance after training (instead of using test environments, which the user may want to configure differently)	2024-01-12 17:13:42 +01:00
Dominik Jain	dd4a0eb430	Fix: Add MujocoEnvObsRmsPersistence only if obs_norm is enabled	2023-10-24 13:52:30 +02:00
Dominik Jain	b5a891557f	Revert to simplified environment factory, removing unnecessary config object (configuration shall be part of the factory instance)	2023-10-24 13:14:23 +02:00
Dominik Jain	6cbee188b8	Change interface of EnvFactory to ensure that configuration of number of environments in SamplingConfig is used (values are now passed to factory method) This is clearer and removes the need to pass otherwise unnecessary configuration to environment factories at construction	2023-10-19 11:37:20 +02:00
Dominik Jain	ed06ab7ff0	Handle obs_norm setting in MuJoCo envs	2023-10-18 20:44:18 +02:00
Dominik Jain	76e870207d	Improve persistence handling * Add persistence/restoration of Experiment instance * Add file logging in experiment * Allow all persistence/logging to be disabled * Disable persistence in tests	2023-10-18 20:44:18 +02:00
Dominik Jain	3691ed2abc	Support obs_rms persistence for MuJoCo by adding a general mechanism for attaching persistence to Environments instances	2023-10-18 20:44:17 +02:00
Dominik Jain	d269063e6a	Remove 'RL' prefix from class names	2023-10-18 20:44:17 +02:00
Michael Panchenko	5bcf514c55	Add alternative functional interface for environment creation where a persistable configuration object is passed as an argument, as this can help to ensure persistability (making the requirement explicit)	2023-10-18 20:44:16 +02:00
Dominik Jain	3fd60f9e70	Unify PPO configuration objects, use experiment-specific configuration in mujoco_ppo_hl	2023-10-09 13:02:29 +02:00
Dominik Jain	8ec42009cb	Move RLSamplingConfig to separate module config, fixing cyclic import	2023-10-09 13:02:23 +02:00
Dominik Jain	997b520580	Refactoring, dropping package config	2023-10-09 13:02:07 +02:00
Dominik Jain	316eb3c579	Add SAC high-level interface	2023-10-09 13:02:01 +02:00
Dominik Jain	16ed5fd2a5	Initial high-level interfaces, demonstrated in mujoco_ppo_hl	2023-10-09 13:01:35 +02:00
Michael Panchenko	a54aade730	Addition of dataclasses based config for scripts, major refactoring So far only for one script (mujoco_ppo_cfg), extension will follow Conflicts: examples/mujoco/mujoco_env.py examples/mujoco/mujoco_ppo.py setup.py	2023-10-09 13:01:27 +02:00
Michael Panchenko	600f4bbd55	Python 3.9, black + ruff formatting (#921 ) Preparation for #914 and #920 Changes formatting to ruff and black. Remove python 3.8 ## Additional Changes - Removed flake8 dependencies - Adjusted pre-commit. Now CI and Make use pre-commit, reducing the duplication of linting calls - Removed check-docstyle option (ruff is doing that) - Merged format and lint. In CI the format-lint step fails if any changes are done, so it fulfills the lint functionality. --------- Co-authored-by: Jiayi Weng <jiayi@openai.com>	2023-08-25 14:40:56 -07:00
Markus Krimmel	6c6c872523	Gymnasium Integration (#789 ) Changes: - Disclaimer in README - Replaced all occurences of Gym with Gymnasium - Removed code that is now dead since we no longer need to support the old step API - Updated type hints to only allow new step API - Increased required version of envpool to support Gymnasium - Increased required version of PettingZoo to support Gymnasium - Updated `PettingZooEnv` to only use the new step API, removed hack to also support old API - I had to add some `# type: ignore` comments, due to new type hinting in Gymnasium. I'm not that familiar with type hinting but I believe that the issue is on the Gymnasium side and we are looking into it. - Had to update `MyTestEnv` to support `options` kwarg - Skip NNI tests because they still use OpenAI Gym - Also allow `PettingZooEnv` in vector environment - Updated doc page about ReplayBuffer to also talk about terminated and truncated flags. Still need to do: - Update the Jupyter notebooks in docs - Check the entire code base for more dead code (from compatibility stuff) - Check the reset functions of all environments/wrappers in code base to make sure they use the `options` kwarg - Someone might want to check test_env_finite.py - Is it okay to allow `PettingZooEnv` in vector environments? Might need to update docs?	2023-02-03 11:57:27 -08:00
Jiayi Weng	109875d43d	Fix num_envs=test_num (#653 ) * fix num_envs=test_num * fix mypy	2022-05-30 12:38:47 +08:00
Michal Gregor	c87b9f49bc	Add show_progress option for trainer (#641 ) - A DummyTqdm class added to utils: it replicates the interface used by trainers, but does not show the progress bar; - Added a show_progress argument to the base trainer: when show_progress == True, dummy_tqdm is used in place of tqdm.	2022-05-17 23:41:59 +08:00
Jiayi Weng	2a7c151738	Add vecenv wrappers for obs_norm to support running mujoco experiment with envpool (#628 ) - add VectorEnvWrapper and VectorEnvNormObs - obs_rms store in policy save/load - align mujoco scripts with atari: obs_norm, envpool, wandb and README	2022-05-05 19:55:15 +08:00

22 Commits