Dominik Jain ca4dad1139 BaseTrainer: Refactoring
New method training_step, which
    * collects training data (method _collect_training_data)
    * performs "test in train" (method _test_in_train)
    * performs policy update
  The old method named train_step performed only the first two points
  and was now split into two separate methods
2024-05-03 10:12:35 +02:00
..
2024-04-26 18:14:20 +02:00
2024-04-20 23:25:33 +00:00
2024-05-03 10:12:35 +02:00