Dominik Jain
|
367778d37f
|
Improve high-level policy parametrisation
Policy objects are now parametrised by converting the parameter
dataclass instances to kwargs, using some injectable conversions
along the way
|
2023-10-18 20:44:16 +02:00 |
|
Dominik Jain
|
37dc07e487
|
Add high-level experiment builder interface
|
2023-10-18 20:44:05 +02:00 |
|
Dominik Jain
|
3fd60f9e70
|
Unify PPO configuration objects, use experiment-specific configuration
in mujoco_ppo_hl
|
2023-10-09 13:02:29 +02:00 |
|
Dominik Jain
|
8ec42009cb
|
Move RLSamplingConfig to separate module config, fixing cyclic import
|
2023-10-09 13:02:23 +02:00 |
|
Dominik Jain
|
d26b8cb40c
|
Use experiment-specific config in mujoco_sac_hl, adding auto-alpha
|
2023-10-09 13:02:18 +02:00 |
|
Dominik Jain
|
adc324038a
|
Remove LoggerConfig
|
2023-10-09 13:02:13 +02:00 |
|
Dominik Jain
|
997b520580
|
Refactoring, dropping package config
|
2023-10-09 13:02:07 +02:00 |
|
Dominik Jain
|
316eb3c579
|
Add SAC high-level interface
|
2023-10-09 13:02:01 +02:00 |
|