Mesa: A Resource-efficient Training Framework for High-Performance Transformers - 42Papers