EBench: Elemental Diagnosis of Generalist Mobile Manipulation Policies
EBench is a comprehensive simulation benchmark for evaluating generalist mobile manipulation policies across diverse tasks and dimensions, revealing distinct capability profiles an…
Hugging Face · Daily Papers
·Ning Gao, Jinliang Zheng
·
·▲ 13 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Ning Gao, Jinliang Zheng, Xing Gao, Haoxiang Ma, Hanqing Wang, Yukai Wang
- 13 upvotes da comunidade
- Temas: generalist manipulation policies, simulation benchmark, capability dimensions, generalization dimensions, success-rate scalar, manipulation tasks
Resumo
Resumo original (em inglês), extraído do paper:
EBench is a comprehensive simulation benchmark for evaluating generalist mobile manipulation policies across diverse tasks and dimensions, revealing distinct capability profiles and generalization patterns among state-of-the-art models.