Dominik Jain
|
6bb3abb2f0
|
Support PG/Reinforce in high-level API
* Add example mujoco_reinforce_hl
* Extended functionality of ActorFactory to support creation of ModuleOpt
|
2023-10-18 20:44:17 +02:00 |
|
Dominik Jain
|
a161a9cf58
|
Improve type annotations, fix type issues and add checks
|
2023-10-18 20:44:17 +02:00 |
|
Dominik Jain
|
9f0a410bb1
|
Log full experiment configuration, adding string representations to relevant classes
|
2023-10-18 20:44:16 +02:00 |
|
Dominik Jain
|
6b6d9ea609
|
Add support for discrete PPO
* Refactored module `module` (split into submodules)
* Basic support for discrete environments
* Implement Atari env. factory
* Implement DQN-based actor factory
* Implement notion of reusing agent preprocessing network for critic
* Add example atari_ppo_hl
|
2023-10-18 20:44:16 +02:00 |
|