Text-Only Training for Image Captioning using Noise-Injected CLIP - 42Papers