Quark: Controllable Text Generation with Reinforced Unlearning - 42Papers