Alignment-Aware Acoustic-Text Pretraining for Speech Representation Learning - 42Papers