
PhysiAgent: An Embodied Agent Framework in Physical World
An autonomous scaffolding framework to seamlessly integrate VLA and VLM into real-world embodied agents.
RL · VLA · Embodied Agents.
I am currently a second-year master's student at the School of Mechanics and Engineering Science, Peking University, advised by Professor Junzhi Yu (IEEE Fellow).
I am also a research intern at the Institute for AI Research (AIR), Tsinghua University, advised by Professor Xianyuan Zhan.
Prior to that, I earned my B.Eng. degree in Intelligent Automotive Engineering from the Harbin Institute of Technology (HIT).
My research interests focus on Embodied Intelligence, including VLA, VLM, Embodied Agents, and related areas.
My long-term goal is building the BRIDGE between AI and the physical world.
An autonomous scaffolding framework to seamlessly integrate VLA and VLM into real-world embodied agents.
A lightweight visuomotor policy controls robots with latent, backward, recursive subgoals.
A novel ViDAR device with reinforcement learning-based active SLAM method.
A new embodied foundation modeling framework operating in the Universal Action Space.
An approach for training a multimodal robotic policy via only unimodal datasets.
* Equal contribution.