Tianshou

Author	SHA1	Message	Date
Dominik Jain	1e5ebc2a2d	Improve naming of callback classes and related methods/attributes Add EpochStopCallbackRewardThreshold	2024-01-12 17:13:42 +01:00
Dominik Jain	ff398beed9	Move callbacks for setting DQN epsilon values to the library	2024-01-12 17:13:42 +01:00
Dominik Jain	19a98c3b2a	Fix models using scale_obs not being persistable (due to locally defined class)	2024-01-12 17:13:42 +01:00
Dominik Jain	dae4000cd2	Revert "Depend on sensAI instead of copying its utils (logging, string)" This reverts commit fdb0eba93d81fa5e698770b4f7088c87fc1238da.	2023-11-08 19:11:39 +01:00
Dominik Jain	fdb0eba93d	Depend on sensAI instead of copying its utils (logging, string)	2023-10-27 20:15:58 +02:00
Dominik Jain	da2194eff6	Force kwargs in PolicyWrapperFactoryIntrinsicCuriosity init	2023-10-26 10:43:59 +02:00
Dominik Jain	7437131d79	Fix tianshou.highlevel depending on jsonargparse (should be dev dependency only) by introducing a new place where jsonargparse can be configured: logging.run_cli, which is also slightly more convenient	2023-10-19 11:40:49 +02:00
Dominik Jain	6cbee188b8	Change interface of EnvFactory to ensure that configuration of number of environments in SamplingConfig is used (values are now passed to factory method) This is clearer and removes the need to pass otherwise unnecessary configuration to environment factories at construction	2023-10-19 11:37:20 +02:00
Dominik Jain	d84e936430	Apply centrally defined callbacks	2023-10-18 20:44:18 +02:00
Dominik Jain	ae4850692f	DQNExperimentBuilder: Use IntermediateModuleFactory instead of ActorFactory (similar to IQN implementation)	2023-10-18 20:44:18 +02:00
Dominik Jain	83048788a1	Add generalised DQN network representation, adding specialised class for feature_only=True	2023-10-18 20:44:18 +02:00
Dominik Jain	76e870207d	Improve persistence handling * Add persistence/restoration of Experiment instance * Add file logging in experiment * Allow all persistence/logging to be disabled * Disable persistence in tests	2023-10-18 20:44:18 +02:00
Dominik Jain	a8a367c42d	Support IQN in high-level API * Add example atari_iqn_hl * Factor out trainer callbacks to new module atari_callbacks * Extract base class for DQN-based agent factories * Improved module factory interface design, achieving higher generality	2023-10-18 20:44:17 +02:00
Dominik Jain	a161a9cf58	Improve type annotations, fix type issues and add checks	2023-10-18 20:44:17 +02:00
Dominik Jain	837ff13c04	Reorder ExperimentBuilder args (EnvFactory first)	2023-10-18 20:44:17 +02:00
Dominik Jain	d269063e6a	Remove 'RL' prefix from class names	2023-10-18 20:44:17 +02:00
Dominik Jain	b54fcd12cb	Change high-level DQN interface to expect an actor instead of a critic, because that is what is functionally required	2023-10-18 20:44:16 +02:00
Dominik Jain	1cba589bd4	Add DQN support in high-level API * Allow to specify trainer callbacks (train_fn, test_fn, stop_fn) in high-level API, adding the necessary abstractions and pass-on mechanisms * Add example atari_dqn_hl	2023-10-18 20:44:16 +02:00

18 Commits