Dominik Jain
41bd463a7b
Allow to configure activation function in default networks
...
* Set ReLU as default in all actor and critic factories
* Configure non-default in applicable MuJoCo examples
2023-10-18 20:44:18 +02:00
Dominik Jain
ed06ab7ff0
Handle obs_norm setting in MuJoCo envs
2023-10-18 20:44:18 +02:00
Dominik Jain
6bb3abb2f0
Support PG/Reinforce in high-level API
...
* Add example mujoco_reinforce_hl
* Extended functionality of ActorFactory to support creation of ModuleOpt
2023-10-18 20:44:17 +02:00
Dominik Jain
a161a9cf58
Improve type annotations, fix type issues and add checks
2023-10-18 20:44:17 +02:00
Dominik Jain
1243894eb8
Add DistributionFunctionFactory subclasses for discrete/continuous default
2023-10-18 20:44:17 +02:00
Dominik Jain
837ff13c04
Reorder ExperimentBuilder args (EnvFactory first)
2023-10-18 20:44:17 +02:00
Dominik Jain
d269063e6a
Remove 'RL' prefix from class names
2023-10-18 20:44:17 +02:00
Dominik Jain
9f0a410bb1
Log full experiment configuration, adding string representations to relevant classes
2023-10-18 20:44:16 +02:00
Dominik Jain
cd79cf8661
Add A2C high-level API
...
* Add common based class for A2C and PPO agent factories
* Add default for dist_fn parameter, adding corresponding factories
* Add example mujoco_a2c_hl
2023-10-18 20:44:16 +02:00
Dominik Jain
78b6dd1f49
Adapt class naming scheme
...
* Use prefix convention (subclasses have superclass names as prefix) to
facilitate discoverability of relevant classes via IDE autocompletion
* Use dual naming, adding an alternative concise name that omits the
precise OO semantics and retains only the essential part of the name
(which can be more pleasing to users not accustomed to
convoluted OO naming)
2023-10-18 20:44:16 +02:00
Dominik Jain
e993425aa1
Add high-level API support for TD3
...
* Created mixins for agent factories to reduce code duplication
* Further factorised params & mixins for experiment factories
* Additional parameter abstractions
* Implement high-level MuJoCo TD3 example
2023-10-18 20:44:16 +02:00
Dominik Jain
367778d37f
Improve high-level policy parametrisation
...
Policy objects are now parametrised by converting the parameter
dataclass instances to kwargs, using some injectable conversions
along the way
2023-10-18 20:44:16 +02:00
Dominik Jain
37dc07e487
Add high-level experiment builder interface
2023-10-18 20:44:05 +02:00
Dominik Jain
3fd60f9e70
Unify PPO configuration objects, use experiment-specific configuration
...
in mujoco_ppo_hl
2023-10-09 13:02:29 +02:00
Dominik Jain
8ec42009cb
Move RLSamplingConfig to separate module config, fixing cyclic import
2023-10-09 13:02:23 +02:00
Dominik Jain
adc324038a
Remove LoggerConfig
2023-10-09 13:02:13 +02:00
Dominik Jain
997b520580
Refactoring, dropping package config
2023-10-09 13:02:07 +02:00
Dominik Jain
316eb3c579
Add SAC high-level interface
2023-10-09 13:02:01 +02:00
Dominik Jain
16ed5fd2a5
Initial high-level interfaces, demonstrated in mujoco_ppo_hl
2023-10-09 13:01:35 +02:00