of number of environments in SamplingConfig is used (values are now passed to factory method) This is clearer and removes the need to pass otherwise unnecessary configuration to environment factories at construction
* Set ReLU as default in all actor and critic factories * Configure non-default in applicable MuJoCo examples
* Add example mujoco_reinforce_hl * Extended functionality of ActorFactory to support creation of ModuleOpt