I am an Master’s student at Peking University, advised by Prof. Shanghang Zhang.
I completed my Bachelor’s degree in Computer Science from The University of Hong Kong (HKU). I have worked as an AI Developer at Intact Lab since July 2022, developing full-stack AI systems with a focus on machine learning infrastructure and cloud-based solutions.
I explore how World Models and Vision-Language-Action (VLA) models can revolutionize robotic perception and decision-making. My work focuses on:
Developing Vision-Language-Action (VLA) foundation models that integrate visual perception, natural language understanding, and robotic control for embodied intelligence
Building world models that enable robots to simulate and predict environmental dynamics, supporting improved planning and decision-making in complex scenarios
Creating physical benchmarks for foundation models to evaluate and validate robotic capabilities in real-world manipulation and navigation tasks, ensuring that generated video outputs align with real-world physics
In my free time, I enjoy competitive programming 💻, playing snooker 🎱, running 🏃, and table tennis 🏓. I also like playing guitar 🎸 to relax.