Tianshou

Author	SHA1	Message	Date
Dominik Jain	6cbee188b8	Change interface of EnvFactory to ensure that configuration of number of environments in SamplingConfig is used (values are now passed to factory method) This is clearer and removes the need to pass otherwise unnecessary configuration to environment factories at construction	2023-10-19 11:37:20 +02:00
Dominik Jain	d84e936430	Apply centrally defined callbacks	2023-10-18 20:44:18 +02:00
Dominik Jain	ae4850692f	DQNExperimentBuilder: Use IntermediateModuleFactory instead of ActorFactory (similar to IQN implementation)	2023-10-18 20:44:18 +02:00
Dominik Jain	83048788a1	Add generalised DQN network representation, adding specialised class for feature_only=True	2023-10-18 20:44:18 +02:00
Dominik Jain	76e870207d	Improve persistence handling * Add persistence/restoration of Experiment instance * Add file logging in experiment * Allow all persistence/logging to be disabled * Disable persistence in tests	2023-10-18 20:44:18 +02:00
Dominik Jain	a8a367c42d	Support IQN in high-level API * Add example atari_iqn_hl * Factor out trainer callbacks to new module atari_callbacks * Extract base class for DQN-based agent factories * Improved module factory interface design, achieving higher generality	2023-10-18 20:44:17 +02:00
Dominik Jain	a161a9cf58	Improve type annotations, fix type issues and add checks	2023-10-18 20:44:17 +02:00
Dominik Jain	837ff13c04	Reorder ExperimentBuilder args (EnvFactory first)	2023-10-18 20:44:17 +02:00
Dominik Jain	d269063e6a	Remove 'RL' prefix from class names	2023-10-18 20:44:17 +02:00
Dominik Jain	b54fcd12cb	Change high-level DQN interface to expect an actor instead of a critic, because that is what is functionally required	2023-10-18 20:44:16 +02:00
Dominik Jain	1cba589bd4	Add DQN support in high-level API * Allow to specify trainer callbacks (train_fn, test_fn, stop_fn) in high-level API, adding the necessary abstractions and pass-on mechanisms * Add example atari_dqn_hl	2023-10-18 20:44:16 +02:00

11 Commits