of number of environments in SamplingConfig is used (values are now passed to factory method) This is clearer and removes the need to pass otherwise unnecessary configuration to environment factories at construction
* Set ReLU as default in all actor and critic factories * Configure non-default in applicable MuJoCo examples