Dominik Jain
73a6d15eee
Log Environments
2023-10-18 20:44:17 +02:00
Dominik Jain
a161a9cf58
Improve type annotations, fix type issues and add checks
2023-10-18 20:44:17 +02:00
Dominik Jain
6b6d9ea609
Add support for discrete PPO
...
* Refactored module `module` (split into submodules)
* Basic support for discrete environments
* Implement Atari env. factory
* Implement DQN-based actor factory
* Implement notion of reusing agent preprocessing network for critic
* Add example atari_ppo_hl
2023-10-18 20:44:16 +02:00
Michael Panchenko
5bcf514c55
Add alternative functional interface for environment creation
...
where a persistable configuration object is passed as an
argument, as this can help to ensure persistability (making the
requirement explicit)
2023-10-18 20:44:16 +02:00
Dominik Jain
e993425aa1
Add high-level API support for TD3
...
* Created mixins for agent factories to reduce code duplication
* Further factorised params & mixins for experiment factories
* Additional parameter abstractions
* Implement high-level MuJoCo TD3 example
2023-10-18 20:44:16 +02:00
Dominik Jain
367778d37f
Improve high-level policy parametrisation
...
Policy objects are now parametrised by converting the parameter
dataclass instances to kwargs, using some injectable conversions
along the way
2023-10-18 20:44:16 +02:00
Dominik Jain
37dc07e487
Add high-level experiment builder interface
2023-10-18 20:44:05 +02:00
Dominik Jain
316eb3c579
Add SAC high-level interface
2023-10-09 13:02:01 +02:00
Dominik Jain
16ed5fd2a5
Initial high-level interfaces, demonstrated in mujoco_ppo_hl
2023-10-09 13:01:35 +02:00