Training FPS improvement (base commit is 94bfb32): test_pdqn: 1660 (without numba) -> 1930 discrete/test_ppo: 5100 -> 5170 since nstep has little impact on overall performance, the unit test result is: GAE: 4.1s -> 0.057s nstep: 0.3s -> 0.15s (little improvement) Others: - fix a bug in ttt set_eps - keep only sumtree in segment tree implementation - dirty fix for asyncVenv check_id test
7 lines
54 B
Plaintext
7 lines
54 B
Plaintext
gym
|
|
tqdm
|
|
torch
|
|
numba
|
|
tensorboard
|
|
sphinxcontrib-bibtex
|