Handle crashing experiments due to unstable SB3 algos for hyperopt
With the wrong hyperparams, SB3 algorithms can become unstable and crash. Catch the corresponding ValueError
(nan in tensors) and return a hyperopt score of 0.
related to #67 (closed)