A Bayesian Perspective on Generalization and Stochastic Gradient Descent - 42Papers