- add RainbowPolicy - add `set_beta` method in prio_buffer - add NoisyLinear in utils/network
Things changed in this PR: - various docs update, add TOC - split buffer into several files - fix venv action_space randomness