Visual Representations Learned in Text-Only Language Models - 42Papers