Scalable vision transformers with hierarchical pooling

Publication
Proceedings of the IEEE/cvf international conference on computer vision