Embodied AI

bridges intelligence and the physical world through real-world perception, action, and learning. Focusing on humanoids, manipulation, and dexterous hands, we aim to uncover robot scaling laws, develop general world models, and unlock reinforcement learning for general-purpose embodied agents.For a complete list of publications, please see here.

2026.02.11RISE: Self-Improving Robot Policy with Compositional World ModelThe first study on leveraging world models as an effective learning environment for challenging real-world manipulation, bootstrapping performance on tasks requiring high dynamics, dexterity, and precision.
Paper |
Page |
Video
2026.02.10EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric DemonstrationThe first endorsement of human-to-humanoid transfer for whole-body locomanipulation.
Paper |
Page
2026.02.05Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigationwe investigate beyond-the-view navigation task in the real world by introducing video generation model in this field for the first time, pioneering such capability in challenging night scenarios.
Paper |
Page |
GitHub |
Video
2025.12.24χ0: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies"Veni, Vidi, Vici" - I came, I saw, I conquered. We aim to conquer the "Mount Everest" of robotics: 100% reliability in real-world garment manipulation.
Paper |
Page |
GitHub |
Community
2025.12.11WholeBodyVLA: Towards Unified Latent VLA for Whole-body Loco-manipulation ControlA unified VLA framework enabling large-space humanoid loco-manipulation via unified latent learning and loco-manipulation-oriented RL.
Paper |
Page |
GitHub
2025.12.03Intelligent Robot Manipulation Requires Self-Directed Learning
Paper | Cite
2025.11.21Agility Meets Stability: Versatile Humanoid Control with Heterogeneous DataA unified whole-body control policy for humanoid robots that enables zero-shot execution of diverse motions, including Ip Man'squat, dancing, running and real-time teleoperation.
Paper |
Page |
Video |
GitHub
2025.07.08GO-1-Pro: Is Diversity All You Need for Scalable Robotic Manipulation?The first comprehensive analysis of data diversity principles revealing optimal scaling strategies for large-scale robotic manipulation training.
Paper |
Blog |
GitHub |
Hugging Face
2025.06.02FreeTacMan: Robot-free Visuo-Tactile Data Collection System for Contact-rich ManipulationA human-centric and robot-free visuo-tactile data collection system for high-quality and efficient robot manipulation.
Paper |
GitHub |
Page |
Dataset |
Hardware Guide |
Video
2025.05.09UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsA unified vision-language-action framework that enables policy learning across different environments.
Paper |
GitHub
2025.03.10AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied SystemsA novel generalist policy that leverages latent action representations to maximize data utilization, demonstrating predictable performance scaling with increased data volume.
Paper |
GitHub |
Blog |
Hugging Face |
Video
| Challenge
2024.10.10Towards Synergistic, Generalized, and Efficient Dual-System for Robotic ManipulationOur objective is to develop a synergistic dual-system framework which supplements the generalizability of large-scale pre-trained generalist with the efficient and task-specific adaptation of specialist.
Paper |
GitHub |
Page
2024.09.13Closed-Loop Visuomotor Control with Generative Expectation for Robotic ManipulationCLOVER employs a text-conditioned video diffusion model for generating visual plans as reference inputs, then these sub-goals guide the feedback-driven policy to generate actions with an error measurement strategy.
Paper |
GitHub |
Video
2024.06.01Learning Manipulation by Predicting InteractionWe propose a general pre-training pipeline that learns Manipulation by Predicting the Interaction (MPI).
Paper |
Page |
GitHub