Dominik Jain
dae4000cd2
Revert "Depend on sensAI instead of copying its utils (logging, string)"
...
This reverts commit fdb0eba93d81fa5e698770b4f7088c87fc1238da.
2023-11-08 19:11:39 +01:00
Dominik Jain
fdb0eba93d
Depend on sensAI instead of copying its utils (logging, string)
2023-10-27 20:15:58 +02:00
Dominik Jain
d684dae6cd
Change default number of environments (train=#CPUs, test=1)
2023-10-26 12:50:08 +02:00
Dominik Jain
e63d8d4147
Use ToStringMixin in dataclasses to detect recurring objects in larger object trees
2023-10-18 20:44:18 +02:00
Dominik Jain
d269063e6a
Remove 'RL' prefix from class names
2023-10-18 20:44:17 +02:00
Dominik Jain
1cba589bd4
Add DQN support in high-level API
...
* Allow to specify trainer callbacks (train_fn, test_fn, stop_fn)
in high-level API, adding the necessary abstractions and pass-on
mechanisms
* Add example atari_dqn_hl
2023-10-18 20:44:16 +02:00
Dominik Jain
2671580c6c
Add DDPG high-level API and MuJoCo example
2023-10-18 20:44:16 +02:00
Dominik Jain
6b6d9ea609
Add support for discrete PPO
...
* Refactored module `module` (split into submodules)
* Basic support for discrete environments
* Implement Atari env. factory
* Implement DQN-based actor factory
* Implement notion of reusing agent preprocessing network for critic
* Add example atari_ppo_hl
2023-10-18 20:44:16 +02:00
Dominik Jain
e993425aa1
Add high-level API support for TD3
...
* Created mixins for agent factories to reduce code duplication
* Further factorised params & mixins for experiment factories
* Additional parameter abstractions
* Implement high-level MuJoCo TD3 example
2023-10-18 20:44:16 +02:00
Dominik Jain
8ec42009cb
Move RLSamplingConfig to separate module config, fixing cyclic import
2023-10-09 13:02:23 +02:00