S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

S-Agent is a spatial reasoning framework that enhances visual language models with temporal memory and hierarchical spatial tools to enable continuous 3D world understanding from m…

Hugging Face · Daily Papers ·Yalun Dai, Hao Li · ·▲ 38 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Yalun Dai, Hao Li, Shulin Tian, Runmao Yao, Yuhao Dong, Fangzhou Hong

  • 38 upvotes da comunidade
  • Temas: spatial reasoning, visual language models, temporal memory mechanism, scene memory, agent memory, spatial tools

Resumo

Resumo original (em inglês), extraído do paper:

S-Agent is a spatial reasoning framework that enhances visual language models with temporal memory and hierarchical spatial tools to enable continuous 3D world understanding from multi-view imagery.

Ler o paper completo no Hugging Face →

compartilhar: