EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies

EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies

EBench is a comprehensive simulation benchmark for evaluating generalist mobile manipulation policies across diverse tasks and dimensions, revealing distinct capability profiles an…

Hugging Face · Daily Papers ·Ning Gao, Jinliang Zheng · ·▲ 13 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Ning Gao, Jinliang Zheng, Xing Gao, Haoxiang Ma, Hanqing Wang, Yukai Wang

  • 13 upvotes da comunidade
  • Temas: generalist manipulation policies, simulation benchmark, capability dimensions, generalization dimensions, success-rate scalar, manipulation tasks

Resumo

Resumo original (em inglês), extraído do paper:

EBench is a comprehensive simulation benchmark for evaluating generalist mobile manipulation policies across diverse tasks and dimensions, revealing distinct capability profiles and generalization patterns among state-of-the-art models.

Ler o paper completo no Hugging Face →

compartilhar: