网站背景

AdaMimic

Towards Adaptable Humanoid Control via Adaptive Motion Tracking

Towards Adaptable Humanoid Control via
Adaptive Motion Tracking

1Shanghai AI Laboratory 2Shanghai Jiao Tong University Equal Advising

Overview

Humanoid robots are envisioned to adapt demonstrated motions to diverse real-world conditions while accurately preserving motion patterns. Existing motion prior approaches enable well adaptability with a few motions but often sacrifice imitation accuracy, whereas motion tracking methods achieve accurate imitation yet require many training motions and a test-time target motion to adapt. To combine their strengths, we introduce AdaMimic, a novel motion tracking algorithm that enables adaptable humanoid control from a single reference motion. To reduce data dependence while ensuring adaptability, our method first creates an augmented dataset by sparsifying the single reference motion into keyframes and applying light editing with minimal physical assumptions. A policy is then initialized by tracking these sparse keyframes to generate dense intermediate motions, and adapters are subsequently trained to adjust tracking speed and refine low-level actions based on the adjustment, enabling flexible time warping that further improves imitation accuracy and adaptability. We validate these significant improvements in our approach in both simulation and the real-world Unitree G1 humanoid robot in multiple tasks across a wide range of adaptation conditions.


Adaptation Performance

Jump Up Step

Tennis Hit

Far Jump

Jump Down Step

Badminton Hit

Triple Jump


Baseline Comparision

Tennis Hit

High Jump

Far Jump

Jump Up Step


Method

Interpolate start reference image.

Our method learns adaptive motion tracking from a single reference motion via the following key design choices:

  • Motion Processing: The retargeted reference motion is sparsified and edited to obtain the augmented motions for adaptation.
  • Training Stage 1: The augmented motions initialize a tracking policy with sparse rewards in global coordinates and dense rewards in local coordinates, which are optimized by double critics separately.
  • Training Stage 2: Two adapters are trained to enable effective time warping for improved tracking performance.
  • Hardware Deployment: The policy can be directly deployed on Unitree G1 robot with lidar odometry.
Interpolate start reference image.

Related Links

Many excellent works inspire the design of AdaMimic.

  • The designs of motion adaptation and keyframing are inspired by Motion Path Editing and RobotKeyframing.
  • The design of time warping (phase adapter) is inspired by a very related work. Differently, we explore keyframe-based adapation.
  • The design of two-stage framework is partially inspired by ASAP. We found 1-stage training struggles at optimization.
  • The design of double-critic module is inspired by RobotKeyframing, Embrace Collisions, and BeamDojo.
  • The design of hardware PD controller is inspired by BeyondMimic. We found it significantly improves hardware stability, smoothness, and safety.
  • The ideas of baselines DeepMimic and AMP are fundamental.

BibTeX


  @article{huang2025adaptive,
    title={Towards Adaptable Humanoid Control via Adaptive Motion Tracking},
    author={Huang, Tao and Wang, Huayi and Ren, Junli and Yin, Kangning and Wang, Zirui and Chen, Xiao and Jia, Feiyu and Zhang, Wentao and Long, Jungfeng and Wang, Jingbo and Pang, Jiangmiao},
    year={2025}
  }