A Bidirectional Gated State-Space Model for NLP Pretraining - 42Papers