Clip q-values produced by critic
As far as I remember, this is what HAC does. In our current implementation we do not clip the q-values. This is probably not realizable with our current scheme where we want to fully re-use the implemented algorithms. It requires to alter the implementation of the algorithms.