Accelerating the Convergence of Human-in-the-Loop Reinforcement Learning with Counterfactual Explanations - 42Papers