Paper LLMs & Texto Dados & Embeddings

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Large language models face challenges in archive-grounded reasoning tasks involving evidence retrieval and synthesis across diverse document collections, with performance varying s…

Hugging Face · Daily Papers ·Honglin Guo, Qi Zhang · 23 de janeiro de 2026

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Honglin Guo, Qi Zhang, Yu Zhang, Weijie Li, Rui Zheng, Zhikai Lei

0 upvotes da comunidade
Temas: large language models, archive-grounded reasoning, evidence retrieval, cross-document task synthesis, leakage-preventing obfuscation, difficulty filtering

Resumo

Resumo original (em inglês), extraído do paper:

Large language models face challenges in archive-grounded reasoning tasks involving evidence retrieval and synthesis across diverse document collections, with performance varying significantly across domains.

Ler o paper completo no Hugging Face →

Ver no Hugging Face

// relacionados

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Resumo

Leia também

Europe is pushing back on Washington’s chip war

Comfy-Org/Krea-2

Cerebras stock plunges after earnings as CEO says margin outlook was misunderstood

OpenAI and Broadcom announce chip designed for LLM inference at scale