📢 News

  • 2025/01/10 [Preprint] We released GlobalCom2, a “global-to-local” approach for training-free acceleration of high-resolution MLLMs with AnyRes strategy. Code has been available!
  • 2024/12/13 [Preprint] We released Score and Distribution Matching Policy, which transforms diffusion-based policies into single-step generators through a two-stage optimization process: score matching ensures alignment with true action distributions, and distribution matching minimizes KL divergence for consistency. Project page has been available.
  • 2024/12/10 [Preprint] We released CARP, Coarse-to-fine AutoRegressive Prediction for visuomotor policy learning. The approach produces highly accurate and smooth robot actions, achieving up to a 10% improvement of success rates, and delivers 10x faster inference compared to state-of-the-art policies. Project page with cool videos has been available. Code will be available soon!
  • 2024/12/10 [AAAI’25] Cobra, the first Mamba-based MLLM for efficient inference, got accepted for AAAI 2025! See Project page.
  • 2024/11/27 [Preprint] We released a new work on token reduction for MLLM inference acceleration, which proposes a unified paradigm to demystify the popular works and guide the future designs, and further offers a suite of methods FiCoCo grounded in the paradigm. Project page and code have been available!
  • 2024/09/09 [New Start] Joined Alibaba DAMO Academy as an Algorithm Expert!
  • 2024/07/16 [MM’24] One paper (ProFD) got accepted for ACM MM 2024. Congratulations to all collaborators!
  • 2024/07/09 [Scholar’24] 2024 Scholar Metrics was released by Google Scholar. Our paper “DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting” ranked 7th of the CIKM 2019 conference according to the citations, and 13th within five years.
  • 2024/07/01 [ECCV’24] Two papers (PiTe and QUAR-VLA) got accepted for ECCV 2024. 2024/08/12 PiTe got accepted as an Oral paper!
  • 2024/06/04 [Graduation] I successfully defended my dissertation. So many thanks to my Ph.D. committee (Prof. Xiaogang Jin, Prof. Mai Xu, Prof. Changxin Gao, Prof. Fajie Yuan, Prof. Peidong Liu, Prof. Xiaofei Li) and my advisor!
  • 2024/03/29 [VALSE’24] Troika got accepted as VALSE 2024 Poster! 2024/05/05 Our Cobra was selected for VALSE 2024 Annual Progress Representation. Thanks to all the committee for the approval!
  • 2024/03/13 [ICME’24] One paper (DARA) about parameter-efficient tuning for visual grounding got accepted for ICME 2024 (Oral).
  • 2024/02/27 [Award] Awarded as Zhejiang University 2024 Outstanding Graduates!
  • 2024/02/27 [CVPR’24] Three papers (ADI, Troika, SimM) as first/co-first author got accepted for CVPR 2024. Congratulations to all collaborators! <!– * 2023/12/13 [ICASSP’24] One paper (VGDiffZero) on diffusion model-based zero-shot visual grounding got accepted for ICASSP 2024. Congratulations to all collaborators!
  • 2023/12/09 [AAAI’24] One paper on VLM-based unsupervised domain adaptation got accepted for AAAI 2024.
  • 2023/04/02 [ICMR’23] One paper (RL-CZSL) about reference-limited compositional learning got accepted for ICMR 2023. Congratulations to all collaborators!
  • 2023/02/28 [CVPR’23] One paper (VoP) about parameter-efficient text-video retrieval got accepted for CVPR 2023. Congratulations to all collaborators! –>