Tianshou

Author	SHA1	Message	Date
Dominik Jain	dae4000cd2	Revert "Depend on sensAI instead of copying its utils (logging, string)" This reverts commit fdb0eba93d81fa5e698770b4f7088c87fc1238da.	2023-11-08 19:11:39 +01:00
Dominik Jain	fdb0eba93d	Depend on sensAI instead of copying its utils (logging, string)	2023-10-27 20:15:58 +02:00
Dominik Jain	da2194eff6	Force kwargs in PolicyWrapperFactoryIntrinsicCuriosity init	2023-10-26 10:43:59 +02:00
Dominik Jain	4b270eaa2d	Add documentation, improve structure of 'module' package	2023-10-18 20:44:18 +02:00
Dominik Jain	90eaacb606	PolicyWrapperFactory: Remove unnecessary input type variable	2023-10-18 20:44:18 +02:00
Dominik Jain	a8a367c42d	Support IQN in high-level API * Add example atari_iqn_hl * Factor out trainer callbacks to new module atari_callbacks * Extract base class for DQN-based agent factories * Improved module factory interface design, achieving higher generality	2023-10-18 20:44:17 +02:00
Dominik Jain	a8ea6808c3	Fix ruff type comparison complaint	2023-10-18 20:44:17 +02:00
Dominik Jain	a161a9cf58	Improve type annotations, fix type issues and add checks	2023-10-18 20:44:17 +02:00
Dominik Jain	358978c65d	Add ToStringMixin to further high-level parameter classes	2023-10-18 20:44:16 +02:00
Dominik Jain	2671580c6c	Add DDPG high-level API and MuJoCo example	2023-10-18 20:44:16 +02:00
Dominik Jain	6b6d9ea609	Add support for discrete PPO * Refactored module `module` (split into submodules) * Basic support for discrete environments * Implement Atari env. factory * Implement DQN-based actor factory * Implement notion of reusing agent preprocessing network for critic * Add example atari_ppo_hl	2023-10-18 20:44:16 +02:00

11 Commits