Zh1hao Wang

Reality · Robotics · RL

I am currently a second-year master's student at School of Advanced Manufacturing and Robotics, Peking University, advised by Professor Junzhi Yu.

I am also a research intern at the Institute for AI Research (AIR), Tsinghua University, advised by Professor Xianyuan Zhan.

Prior to that, I earned my B.Eng. degree in Intelligent Automotive Engineering from the Harbin Institute of Technology (HIT).

My research interests focus on Embodied Intelligence, including VLA, VLM, Embodied Agents, and related areas.

Email CV Scholar GitHub Linkedin WeChat Zhihu

News

10/2025: X-VLA won the 🏆 Championship at AgiBot World Challenge @ IROS 2025!
05/2025: Our paper LBP (Latent space Backward Planning) has been accepted by ICML 2025!
04/2025: Our paper Novel ViDAR with RL-based Active SLAM has been accepted by TII 2025!
02/2025: Our paper UniAct has been accepted by CVPR 2025!
01/2025: Our paper Robo_MUTUAL has been accepted by ICRA 2025!

Thanks to all collaborators and supporters, looking forward to more exciting research ahead!

Research

My long-term vision is to build the BRIDGE between artificial intelligence and the physical world — shaping truly embodied intelligence. 🤖🧠

X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model

Jinliang Zheng^*, Jianxiong Li^*, me, Dongxiu Liu, Xirui Kang, Yuchun Feng, Yinan Zheng, Jiayin Zou, Yilun Chen, Jia Zeng, Ya-Qin Zhang, Jiangmiao Pang, Jingjing Liu, Tai Wang, Xianyuan Zhan

🏆 Champion @ AgiBot World Challenge @ IROS 2025 Preprint

project page arXiv

A highly scalable cross-embodiment VLA model that achieves remarkable performance with a tiny model size.

PhysiAgent: An Embodied Agent Framework in Physical World

me^*, Jianxiong Li^*, Jinliang Zheng^*, Wencong Zhang, Dongxiu Liu, Yinan Zheng, Haoyi Niu, Junzhi Yu, Xianyuan Zhan

New In ML @ ICML 2025 Workshop

arXiv

An autonomous scaffolding framework to seamlessly integrate VLA and VLM into real-world embodied agents.

Efficient Robotic Policy Learning via Latent Space Backward Planning

Dongxiu Liu^*, Haoyi Niu^*, Zh1hao Wang, Jinliang Zheng, Yinan Zheng, Zhonghong Ou, Jianming Hu, Jianxiong Li, Xianyuan Zhan

ICML 2025 Robot Learning @ ICLR 2025 Workshop

project page arXiv

A lightweight visuomotor policy controls robots with latent, backward, recursive subgoals.

A Novel ViDAR Device With Visual Inertial Encoder Odometry and RL-Based Active SLAM Method

Zhanhua Xin, Zh1hao Wang, Shenghao Zhang, Wanchao Chi, Yan Meng, Shihan Kong, Yan Xiong, Chong Zhang, Yuzhen Liu, Junzhi Yu

IEEE Transactions on Industrial Informatics 2025

arXiv

A novel ViDAR device with reinforcement learning-based active SLAM method.

UniAct: Universal Actions for Enhanced Embodied Foundation Models

Jinliang Zheng^*, Jianxiong Li^*, Dongxiu Liu^*, Yinan Zheng, Zh1hao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan

CVPR 2025 Robot Learning @ ICLR 2025 Workshop

project page arXiv

A new embodied foundation modeling framework operating in the Universal Action Space.

Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning

Jianxiong Li^*, Zh1hao Wang^*, Jinliang Zheng^*, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan

ICRA 2025 Open-World Agents @ NeurIPS 2024 Workshop

project page arXiv

An approach for training a multimodal robotic policy via only unimodal datasets.

^* Equal contribution.

Awards & Achievements

[2025] Peking University Graduate Student Special Academic Scholarship
[2024] 1st at AIR Summer 2024 Projects Competition
[2024] Shandong Outstanding Graduates
[2023] Undergraduate National Scholarship
[2023] Huawei Intelligent Foundation Scholarship
[2022] 1st at Formula Student Autonomous China Competition

Academic Service

Reviewer at conferences: ICLR 2026, AAAI 2026
Reviewer at workshops: EWM@NeurIPS 2025, WRL@ICLR 2025