Tianshou

Author	SHA1	Message	Date
Jiayi Weng	f0afdeaf6a	update version to 0.5.0 (#826 )	2023-03-12 22:07:16 -07:00
Jose Antonio Martin H	10d919052b	Add Trainers as generators (#559 ) The new proposed feature is to have trainers as generators. The usage pattern is: ```python trainer = OnPolicyTrainer(...) for epoch, epoch_stat, info in trainer: print(f"Epoch: {epoch}") print(epoch_stat) print(info) do_something_with_policy() query_something_about_policy() make_a_plot_with(epoch_stat) display(info) ``` - epoch int: the epoch number - epoch_stat dict: a large collection of metrics of the current epoch, including stat - info dict: the usual dict out of the non-generator version of the trainer You can even iterate on several different trainers at the same time: ```python trainer1 = OnPolicyTrainer(...) trainer2 = OnPolicyTrainer(...) for result1, result2, ... in zip(trainer1, trainer2, ...): compare_results(result1, result2, ...) ``` Co-authored-by: Jiayi Weng <trinkle23897@gmail.com>	2022-03-18 00:26:14 +08:00
n+e	825da9bc53	add cross-platform test and release 0.4.1 (#331 ) * bump to 0.4.1 * add cross-platform test	2021-03-31 15:14:22 +08:00
n+e	bd9c3c7f8d	docs fix and v0.2.5 (#156 ) * pre * update docs * update docs * $ in bash * size -> hidden_layer_size * doctest * doctest again * filter a warning * fix bug * fix examples * test fail * test succ	2020-07-22 14:42:08 +08:00
Trinkle23897	9b26137cd2	add type annotation	2020-05-12 11:31:47 +08:00
Trinkle23897	6b96f124ae	fix pdqn	2020-04-26 15:11:20 +08:00
Trinkle23897	befdfb07e8	polish docs	2020-04-11 19:29:46 +08:00
Trinkle23897	044aae4355	add baseline and rlpyt result	2020-03-27 16:24:07 +08:00

8 Commits