Contrastive Language-Audio Pretraining for Multimodal Representation Learning - 42Papers