Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City

Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City

Research examines how self-driving car systems and humans perform on visual question answering tasks across different geographic locations, revealing that both human and AI respons…

Hugging Face · Daily Papers ·Adrian Cespedes, Marcelo Chincha · ·▲ 1 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Adrian Cespedes, Marcelo Chincha, Dunant Cusipuma, Victor Flores-Benites, David Ortega, Arturo Deza

  • 1 upvotes da comunidade
  • Temas: Visual Question Answering, VLMs, out-of-distribution, multi-modal systems, Self-Driving Cars, dashcam footage

Resumo

Resumo original (em inglês), extraído do paper:

Research examines how self-driving car systems and humans perform on visual question answering tasks across different geographic locations, revealing that both human and AI responses diverge based on question types but show similar performance regardless of location.

Ler o paper completo no Hugging Face →

compartilhar: