Every algorithm should have at least 2 performance tests

To reliably check the reliability of our algorithms, we should check them with at least two or three environments. Also one environment that is more complicated than FetchReach-v2.