A new version of cleanppo

Thilo Fryen requested to merge cleanerppo into devel

This version of cleanppo is more performant and was created differently. The old version was a slightly modified version of the cleanrl version of ppo. This version is the stable-baselines3 version condensed into a one-file implementation.

