Pruning self-attentions into convolutional layers in single path

Publication
IEEE transactions on pattern analysis and machine intelligence