👋 Hi! I am Siteng Huang (黄思腾 in Chinese). I work at
DAMO Academy, Alibaba Group, as an Algorithm Expert through the AliStar program. I received my Ph.D. degree from
Zhejiang University in June 2024, affiliated with a joint program with
Westlake University at Machine Intelligence Laboratory (MiLAB) and advised by Prof. Donglin Wang. In my Ph.D. study, I also spent wonderful internship time at
TongYi Lab, Alibaba Group. Before that, I received my B.Eng. Degree from School of Computer Science,
Wuhan University in June 2019.
🔬 My research has centered on the perception, understanding, reasoning, and generation of multimodal (including images, videos, language, dynamics, etc.) data from both the internet and the physical world. I also focus on efficientAI (in terms of data, time, parameters, memory, etc.) for multimodal applications. I have published 30+ papers on the above topics at the top-tier international AI conferences and journals. Recently, I devote myself to the development of multi-modal generative, embodied, and unified foundation models.