Tianshou

History

Dominik Jain ca4dad1139 BaseTrainer: Refactoring

New method training_step, which
    * collects training data (method _collect_training_data)
    * performs "test in train" (method _test_in_train)
    * performs policy update
  The old method named train_step performed only the first two points
  and was now split into two separate methods

2024-05-03 10:12:35 +02:00

data

Dosctring add return [skip-ci]

2024-04-26 18:14:20 +02:00

env

Deleted long deprecated functionality, removed unused warning module

2024-04-26 14:42:44 +02:00

evaluation

Feature/algo eval (#1074 )

2024-04-20 23:25:33 +00:00

exploration

Remove kwargs in policy init (#950 )