Skip to content

[Model] DIAL #29

Description

@skt0725

ID (slug)

dial

Name

DIAL

Organization

The University of Hong Kong / XPENG Robotics

Year

2026

Description (English)

DIAL is a dual-system VLA framework that decouples high-level intent from low-level action via a differentiable latent bottleneck. A VLM-based System-2 predicts future visual states as latent intent, which a lightweight DiT-based System-1 decodes into robot actions through flow matching. Built on GR00T N1.5 with Qwen2.5-VL, DIAL achieves state-of-the-art performance with 10x fewer demonstrations and robust zero-shot generalization on a humanoid robot.

Description (Korean)

DIAL은 고수준 의도와 저수준 행동을 미분 가능한 잠재 병목으로 분리하는 이중 시스템 VLA 프레임워크입니다. VLM 기반 System-2가 미래 시각 상태를 잠재 의도로 예측하고, 경량 DiT 기반 System-1이 플로우 매칭을 통해 로봇 행동으로 디코딩합니다. GR00T N1.5와 Qwen2.5-VL 기반으로, 기존 대비 10배 적은 시연으로 SOTA를 달성하며 휴머노이드 로봇에서 강건한 제로샷 일반화 능력을 보여줍니다.

GitHub URL

https://github.com/xpeng-robotics/DIAL

Paper URL (arXiv)

https://arxiv.org/abs/2603.29844

HuggingFace URL

https://huggingface.co/xpeng-robotics/DIAL_checkpoints

Project Page URL

https://xpeng-robotics.github.io/dial/

Categories

  • manipulation
  • locomotion
  • navigation
  • dexterous
  • whole-body
  • aerial

Hardware Targets

  • manipulator
  • humanoid
  • quadruped
  • biped
  • mobile
  • drone
  • hand

Learning Methods

  • VLA
  • IL
  • RL
  • diffusion
  • world_model
  • sim2real

Framework

  • pytorch
  • jax
  • tensorflow
  • other

Communication

  • ros2
  • grpc
  • lcm
  • zenoh

Tags (optional)

VLA, dual-system, latent-world-model, humanoid, flow-matching, GR00T-N1.5

Checklist

  • The model is open-source (code or weights publicly available)
  • At least one URL (GitHub, paper, or HuggingFace) is provided
  • I have read the contribution guidelines

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions