Do Transformer Modifications Transfer Across Implementations and Applications? - 42Papers