3691ed2abcSupport obs_rms persistence for MuJoCo by adding a general mechanism for attaching persistence to Environments instances
Dominik Jain
2023-10-12 15:01:49 +02:00
d269063e6aRemove 'RL' prefix from class names
Dominik Jain
2023-10-06 13:50:23 +02:00
50ac385321Add some basic tests for high-level experiment builder API
Dominik Jain
2023-10-05 19:22:04 +02:00
b54fcd12cbChange high-level DQN interface to expect an actor instead of a critic, because that is what is functionally required
Dominik Jain
2023-10-05 19:21:08 +02:00
1cba589bd4Add DQN support in high-level API
Dominik Jain
2023-10-05 15:39:32 +02:00
358978c65dAdd ToStringMixin to further high-level parameter classes
Dominik Jain
2023-10-05 13:15:24 +02:00
de70147752Add string module from sensAI
Dominik Jain
2023-10-03 20:40:49 +02:00
2671580c6cAdd DDPG high-level API and MuJoCo example
Dominik Jain
2023-10-03 20:26:39 +02:00
6b6d9ea609Add support for discrete PPO
Dominik Jain
2023-09-28 20:07:52 +02:00
e0e7349b0aAdd base class BaseActor with method get_preprocess_net for high-level API
Dominik Jain
2023-09-28 20:08:55 +02:00
cd79cf8661Add A2C high-level API
Dominik Jain
2023-09-28 14:28:03 +02:00
acd89fa3b0Remove parameter transformers from config object state, composing the list dynamically instead
Dominik Jain
2023-09-27 18:20:49 +02:00
78b6dd1f49Adapt class naming scheme
Dominik Jain
2023-09-27 17:20:35 +02:00
5bcf514c55Add alternative functional interface for environment creation where a persistable configuration object is passed as an argument, as this can help to ensure persistability (making the requirement explicit)
Michael Panchenko
2023-09-27 14:10:45 +02:00
d4e604b46eMove parameter transformation directly into parameter objects, achieving greater separation of concerns and improved maintainability
Dominik Jain
2023-09-26 17:43:16 +02:00
38cf982034Disable Ruff rule D205 (blank-line-after-summary) because it disallows, in particular, class docstrings that consist only of a summary line
Dominik Jain
2023-09-26 17:40:21 +02:00
e993425aa1Add high-level API support for TD3
Dominik Jain
2023-09-26 15:35:18 +02:00
6a739384eeWandbLogger: Use less restrictive type annotation for config
Dominik Jain
2023-09-26 10:48:13 +02:00
Minor dep update (#961)
Michael Panchenko
2023-10-09 22:10:09 +02:00
4d53d345d6Ignore Ruff rule RET505, because it sacrifices visual discernability of control flow paths for brevity (regarding return statements)
Dominik Jain
2023-09-25 11:18:17 +02:00
3fd60f9e70Unify PPO configuration objects, use experiment-specific configuration in mujoco_ppo_hl
Dominik Jain
2023-09-20 15:45:09 +02:00
8ec42009cbMove RLSamplingConfig to separate module config, fixing cyclic import
Dominik Jain
2023-09-20 15:28:33 +02:00