Welcome to Yuheng Zha’s page!
Feel free to reach out if you are interested in working with me.
Bio
I’m a PhD candidate at UC San Diego, advised by Professor Zhiting Hu. I got my Bachelor’s degree (with Honors) from Zhejiang University.
Research Interests
My research interest lies in VLMs, World Models and Agentic systems. Currently, I’m actively doing research in
- Reasoning with Vision-Language Models: Enhancing the reasoning capabilities of vision-language models through reinforcement learning with various reward signals.
- Agentic Systems: Building autonomous agents that can perceive, reason, and act in complex environments (e.g., real websites) using multimodal inputs.
My previous research also includes
- Evaluating the factual consistency of generative language models and building a unified model for many NLP tasks
- Training video world models with natural language actions and video states
- Visual Concept Learning
