textitUser: An Efficient Sparse Transformer for Long Sequence Modeling - 42Papers