Hi there
Welcome to my Homepage!
I am an undergraduate (2022-2026) at Univeristy of Science and Technology Beijing, focusing on Multimodal Learning and Robot Manipulation.
I’m also an incoming PhD student at Institute Automation, Chinese Academy of Sciences under supervision Prof. Yan Huang and Prof. Liang Wang
Currently I conduct the Multimodal Agent research at Microsoft Research.
News
- BridgeVLA is accepted in NIPS 2025 🔥
Experience



University of Science and Technology Beijing
Sep 2022 - July 2026
Rank 1/115, National Scholarship x 2
B.E at AE
Sep 2022 - July 2026
Rank 1/115, National Scholarship x 2
B.E at AE
Publications

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models
Peiyan Li, Yixiang Chen, Hongtao Wu, Xiao Ma, Xiangnan Wu, Yan Huang, Liang Wang, Tao Kong, Tieniu Tan
BridgeVLA enables efficient 3D robot manipulation by aligning 3D inputs and action outputs within a consistent 2D image space, leveraging pre-trained vision-language models.
NIPS 2025 [arxiv] [Project Website]
Peiyan Li, Yixiang Chen, Hongtao Wu, Xiao Ma, Xiangnan Wu, Yan Huang, Liang Wang, Tao Kong, Tieniu Tan
BridgeVLA enables efficient 3D robot manipulation by aligning 3D inputs and action outputs within a consistent 2D image space, leveraging pre-trained vision-language models.
NIPS 2025 [arxiv] [Project Website]

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Yuan Xu, Jiabing Yang, Xiaofeng Wang, Yixiang Chen, Zheng Zhu, Bowen Fang, Guan Huang, Xinze Chen, Yun Ye, Qiang Zhang, Peiyan Li, Xiangnan Wu, Kai Wang, Bing Zhan, Shuo Lu, Jing Liu, Nianfeng Liu, Yan Huang, Liang Wang
The paper proposes EgoDemoGen, which generates novel egocentric demonstrations by retargeting actions and synthesizing corresponding videos using the EgoViewTransfer model.
Preprint [Project Website]
Yuan Xu, Jiabing Yang, Xiaofeng Wang, Yixiang Chen, Zheng Zhu, Bowen Fang, Guan Huang, Xinze Chen, Yun Ye, Qiang Zhang, Peiyan Li, Xiangnan Wu, Kai Wang, Bing Zhan, Shuo Lu, Jing Liu, Nianfeng Liu, Yan Huang, Liang Wang
The paper proposes EgoDemoGen, which generates novel egocentric demonstrations by retargeting actions and synthesizing corresponding videos using the EgoViewTransfer model.
Preprint [Project Website]
Awards
- National Scholarship 2025
- Beijing “San Hao” Student 2024
- National Scholarship 2024