Change the behavior of to_numpy and to_torch: from now on, dict is automatically converted to Batch and list is automatically converted to np.ndarray (if an error occurs, raise the exception instead of converting each element in the list).
Things changed in this PR: - various docs update, add TOC - split buffer into several files - fix venv action_space randomness