ReCo: Region-Controlled Text-to-Image Generation - 42Papers