Can Language Models Actually Retrieve In-Context? Drowning in Documents at Million Token Scale

arXiv:2607.01538v1 Announce Type: new Abstract: Language models (LMs) raise an intriguing alternative to vector-based retrieval: conditioning on an in-context corpus and directly generating a relevant answer. However, prior work has largely focused on proprietary systems or the smaller-scale reranking task, leaving corpus-scale in-context retrieval largely unexplored. In this work, we present the first systematic study of in-context retrieval on two scales practical retrievers demand: million-to...

arXiv cs.CL ·Siddharth Gollapudi, Nilesh Gupta, Prasann Singhal, Sewon Min ·
compartilhar: