Reinforcement learning for extremal combinatorics and graph theory - 42Papers