DDCap: A diffusion-based text decoding framework for image captioning - 42Papers