(Selected Publications. * equal contribution, # corresponding author)
Preprint
FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion
Akide Liu*, Zeyu Zhang*, Zhexin Li, Xuehai Bai, Yizeng Han, Jiasheng Tang, Yuanjie Xing, Jichao Wu, Mingyang Yang, Weihua Chen, Jiahao He, Yuanyu He, Fan Wang#, Gholamreza Haffari, Bohan Zhuang#
[Paper][Project]ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS
Weijie Wang#, Donny Y. Chen#, Zeyu Zhang, Duochao Shi, Akide Liu, Bohan Zhuang
[Paper][Project]ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models
Jing Liu, Ruihao Gong, Mingyang Zhang, Yefei He, Jianfei Cai, Bohan Zhuang#
[Paper]Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion
Zhuokun Chen, Jinwu Hu, Zeshuai Deng, Yufeng Wang, Bohan Zhuang#, Mingkui Tan#
[Paper]Evaluating and Advancing Multimodal Large Language Models in Ability Lens
Feng Chen, Chenhui Gou, Jing Liu, Yang Yang, Zhaoyang Li, Jiyuan Zhang, Zhenbang Sun, Bohan Zhuang, Qi Wu
[Paper]Motion Anything: Any to Motion Generation
Zeyu Zhang, Yiran Wang, Wei Mao, Danning Li, Rui Zhao, Biao Wu, Zirui Song, Bohan Zhuang, Ian Reid, Richard Hartley
[Paper]
2025
Neighboring Autoregressive Modeling for Efficient Visual Generation
Yefei He*, Yuanyu He*, Shaoxuan He*, Feng Chen*, Hong Zhou, Kaipeng Zhang, Bohan Zhuang#
[Paper][Code][Project] ICCV 2025ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification
Yefei He, Feng Chen, Jing Liu, Wenqi Shao, Hong Zhou, Kaipeng Zhang, Bohan Zhuang
[Paper] ICCV 2025Frequency-Aware Autoregressive Modeling for Efficient High-Resolution Image Synthesis
Zhuokun Chen, Jugang Fan, Zhuowei Yu, Bohan Zhuang#, Mingkui Tan#
[Paper][Code] ICCV 2025ZipAR: Parallel Autoregressive Image Generation through Spatial Locality
Yefei He, Feng Chen, Yuanyu He, Shaoxuan He, Hong Zhou, Kaipeng Zhang, Bohan Zhuang
[Paper][Code] ICML 2025T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
Zizheng Pan, Bohan Zhuang#, De-An Huang, Weili Nie, Zhiding Yu, Chaowei Xiao, Jianfei Cai, Anima Anandkumar
[Paper][Code][Project] ICLR 2025Are Large Vision Language Models Good Game Players?
Xinyu Wang, Bohan Zhuang, Qi Wu
[Paper][Code] ICLR 2025Channel Merging: Preserving Specialization for Merged Experts
Mingyang Zhang, Jing Liu, Ganggui Ding, Xinyi Yu, Linlin Ou, Bohan Zhuang#
[Paper] AAAI 2025 (Oral)
2024
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models
Akide Liu, Jing Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang#
[Paper][Code] NeurIPS 2024ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
Yefei He, Luoming Zhang, Weijia Wu, Jing Liu, Hong Zhou, Bohan Zhuang#
[Paper][Code] NeurIPS 2024MVSplat360: Feed Forward 360° Scene Synthesis from Sparse Views
Yuedong Chen, Chuanxia Zheng, Haofei Xu, Bohan Zhuang, Andrea Vedaldi, Tat-Jen Cham, Jianfei Cai
[Paper][Homepage] NeurIPS 2024QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jing Liu, Ruihao Gong, Xiuying Wei, Zhiwei Dong, Jianfei Cai, Bohan Zhuang#
[Paper][Code] ICLR 2024Object-Aware Inversion and Reassembly for Image Editing
Zhen Yang, Ganggui Ding, Wen Wang, Hao Chen#, Bohan Zhuang#, Chunhua Shen
[Paper][Homepage] ICLR 2024EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
Hefei He, Jing Liu, Weijia Wu, Hong Zhou, Bohan Zhuang#
[Paper][Code] ICLR 2024 (Spotlight)GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI
Pengcheng Chen, Jin Ye#, Guoan Wang*, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J Seibel, Junjun He, Yu Qiao
[Paper][Homepage] NeurIPS 2024 Datasets and Benchmarks TrackLongVLM: Efficient Long Video Understanding via Large Language Models
Yuetian Weng, Mingfei Han, Haoyu He, Xiaojun Chang, Bohan Zhuang#
[Paper][Code] ECCV 2024 (Oral)MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Yuedong Chen, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, Jianfei Cai
[Paper][Homepage] ECCV 2024 (Oral)Stitched ViTs are Flexible Vision Backbones
Zizheng Pan, Jing Liu, Haoyu He, Jianfei Cai, Bohan Zhuang#
[Paper][Code] ECCV 2024Motion Mamba: Efficient and Long Sequence Motion Generation
Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang
[Paper][HomePage] ECCV 2024Efficient Stitchable Task Adaptation
Haoyu He, Zizheng Pan, Jing Liu, Jianfei Cai, Bohan Zhuang#
[Paper] CVPR 2024ModaVerse: Efficiently Transforming Modalities with LLMs
Xinyu Wang, Bohan Zhuang, Qi Wu
[Paper][Code] CVPR 2024LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang
[Paper][Code] ACL Findings 2024SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation
Guoan Wang, Jin Ye, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Junjun He, Bohan Zhuang#
[Paper] MICCAI 2024
2023
Stitchable Neural Networks
Zizheng Pan; Jianfei Cai; Bohan Zhuang#
[Paper][Homepage] CVPR 2023 (Highlight)PTQD: Accurate Post-Training Quantization for Diffusion Models
Yefei He, Luping Liu, Jing Liu, Weijia Wu, Hong Zhou#, Bohan Zhuang#
[Paper][Code] NeurIPS 2023Mask Propagation for Efficient Video Semantic Segmentation
Yuetian Weng, Mingfei Han, Haoyu He, Mingjie Li, Xiaojun Chang, Bohan Zhuang#
[Paper][Code] NeurIPS 2023Second-Order Degradation and Reconstruction for Test-Time Image Super-Resolution
Zeshuai Deng, Zhuokun Chen, Shuaicheng Niu, Thomas H. Li, Bohan Zhuang#, Mingkui Tan#
[Paper][Code] NeurIPS 2023Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He, Jianfei Cai, Jing Zhang, Dacheng Tao, Bohan Zhuang#
[Paper][Code] ICCV 2023 (Oral)BiViT: Extremely Compressed Binary Vision Transformer
Yefei He, Zhenyu Lou, Luoming Zhang, Hong Zhou#, Bohan Zhuang#
[Paper] ICCV 2023Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He, Jianfei Cai, Zizheng Pan, Jing Liu, Jing Zhang, Dacheng Tao, Bohan Zhuang#
[Paper][Code] CVPR 2023End-to-end One-shot Human Parsing
Haoyu He, Jing Zhang, Bohan Zhuang#, Jianfei Cai, Dacheng Tao
[Paper][Code] TPAMI 2023Single-path Bit Sharing for Automatic Loss-aware Model Compression
Jing Liu, Bohan Zhuang, Peng Chen, Yong Guo, Chunhua Shen, Jianfei Cai, Mingkui Tan
[Paper] TPAMI 2023Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He, Jing Liu, Zizheng Pan, Jianfei Cai, Jing Zhang, Dacheng Tao, Bohan Zhuang#
[Paper][Code] TPAMI 2023A Survey on Efficient Training of Transformers
Bohan Zhuang#, Jing Liu, Zizheng Pan, Haoyu He, Yuetian Weng, Chunhua Shen
[Paper] IJCAI 2023
2022
EcoFormer: Energy-Saving Attention with Linear Complexity
Jing Liu, Zizheng Pan, Haoyu He, Jianfei Cai, Bohan Zhuang#
[Paper][Code] NeurIPS 2022 (Spotlight)Fast Vision Transformers with HiLo Attention
Zizheng Pan, Jianfei Cai, Bohan Zhuang#
[Paper][Code] NeurIPS 2022 (Spotlight)Automated Progressive Learning for Efficient Training of Vision Transformers
Changlin Li, Bohan Zhuang#, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang
[Paper] CVPR 2022Less is More: Pay Less Attention in Vision Transformers
Zizheng Pan, Bohan Zhuang#, Haoyu He, Jing Liu, Jianfei Cai
[Paper][Code] AAAI 2022An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang#
[Paper] ECCV 2022Structured Binary Neural Networks for Image Recognition
Bohan Zhuang, Chunhua Shen, Mingkui Tan, Peng Chen, Lingqiao Liu, Ian Reid
[Paper] IJCV 2022
2021
Mesa: A Memory-saving Training Framework for Transformers
Zizheng Pan, Peng Chen, Haoyu He, Jing Liu, Jianfei Cai, Bohan Zhuang#
[Paper][Code]Sharpness-aware Quantization for Deep Neural Networks
Jing Liu, Jianfei Cai, Bohan Zhuang#
[Paper][Code]Scalable Visual Transformers with Hierarchical Pooling
Zizheng Pan, Bohan Zhuang#, Jing Liu, Haoyu He, Jianfei Cai
[Paper][Code] ICCV 2021FATNN: Fast and Accurate Ternary Neural Networks
Peng Chen, Bohan Zhuang*, Chunhua Shen
[Paper][Code] ICCV 2021Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
Bohan Zhuang, Mingkui Tan, Jing Liu, Lingqiao Liu, Ian Reid, Chunhua Shen
[Paper][Code] TPAMI 2021Discrimination-aware Network Pruning for Deep Model Compression
Jing Liu, Bohan Zhuang, Zhuangwei Zhuang*, Yong Guo, Junzhou Huang, Jinhui Zhu, Mingkui Tan
[Paper][Code] TPAMI 2021AQD: Towards Accurate Quantized Object Detection
Peng Chen, Jing Liu, Bohan Zhuang#, Mingkui Tan, Chunhua Shen
[Paper][Code] CVPR 2021 (Oral)