Tianshou

History

This is the PR for QR-DQN algorithm: https://arxiv.org/abs/1710.10044

1. add QR-DQN policy in tianshou/policy/modelfree/qrdqn.py.
2. add QR-DQN net in examples/atari/atari_network.py.
3. add QR-DQN atari example in examples/atari/atari_qrdqn.py.
4. add QR-DQN statement in tianshou/policy/init.py.
5. add QR-DQN unit test in test/discrete/test_qrdqn.py.
6. add QR-DQN atari results in examples/atari/results/qrdqn/.
7. add compute_q_value in DQNPolicy and C51Policy for simplify forward function.
8. move `with torch.no_grad():` from `_target_q` to BasePolicy

By running "python3 atari_qrdqn.py --task "PongNoFrameskip-v4" --batch-size 64", get best_result': '19.8 ± 0.40', in epoch 8.

2021-01-28 09:27:05 +08:00

_static

sac mujoco result (#246 )

2020-11-09 16:43:55 +08:00

api

code refactor for venv (#179 )

2020-08-19 15:00:24 +08:00

tutorials

Add offline trainer and discrete BCQ algorithm (#263 )

2021-01-20 18:13:04 +08:00

bibtex.json

Saving and loading replay buffer with HDF5 (#261 )