Blog LLMs & Texto

NLL-Guided Full-Attention Layer Selection for Training-Free Sliding-Window Adaptation

arXiv:2606.27791v1 Announce Type: new Abstract: Hybrid attention models that mix full and sliding-window attention across layers offer a promising approach to efficient long-context inference, but the critical question of \emph{which layers} should retain full attention remains unsolved. Existing methods use either fixed periodic patterns or attention-based heuristics that may not capture what matters for downstream accuracy. We propose NLL-guided layer selection, a training-free method that dir...

arXiv cs.CL ·Qiong Tang, Xiangkun Hu, Xiangyang Liu, Yiran Chen, Yunfan Shao · 29 de janeiro de 2026

Ver no Hugging Face

// relacionados

NLL-Guided Full-Attention Layer Selection for Training-Free Sliding-Window Adaptation

Leia também

The US military used AI to pick thousands of targets but missed a note saying one was a school

HP accelerates enterprise workflows with OpenAI Frontier

O fantasma do Fable 5: banido, o modelo vive nos datasets que o destilam

MultiHashFormer: e se cada palavra fosse uma impressão digital?