gym numba numpy>=1.20 sphinx<4 sphinxcontrib-bibtex tensorboard torch tqdm