A New Class of Gradient Communication Mechanisms for Communication-efficient Training - 42Papers