Hierarchical Pruning for Transformer Model Deployment - 42Papers