Variance Aware Training for Deep Neural Networks - 42Papers