Aligned 3D Representation by Pre-training Visual, Text, and 3D Language Models - 42Papers