Memory-Managed Long-Context Attention: A Preliminary Study of Editable Request-Local Memory

arXiv:2606.28876v1 Announce Type: new Abstract: Long-context language models often conflate two different goals: compressing history into an efficient state, and maintaining reliable long-term memory. Linear, recurrent, and sparse attention reduce the cost of processing long sequences, but they do not by themselves specify when a fact should be written, overwritten, protected from distractors, or discarded. We study memory-managed long-context attention, a research route that separates a fast re...

arXiv cs.CL ·Junyi Zou, Avrova Donz ·
compartilhar: