Swin Transformer: Scaling up to 3 billion parameters and making it capable of training with images of up to resolution - 42Papers