Sparse Updates for Gradient-Based Deep Neural Network Training - 42Papers