n+e 140b1c2cab
Improve PER (#159)
- use segment tree to rewrite the previous PrioReplayBuffer code, add the test

- enable all Q-learning algorithms to use PER
2020-08-06 10:26:24 +08:00
..
2020-08-06 10:26:24 +08:00
2020-08-04 13:39:05 +08:00
2020-08-06 10:26:24 +08:00
2020-08-02 15:14:44 +08:00
2020-07-22 14:42:08 +08:00