Position Bias Correction is Insufficient for One-Pass Attention Sorting

arXiv:2606.27793v1 Announce Type: new Abstract: Long-context language models suffer from position bias, where information in middle positions is underutilized. Attention Sorting addresses this by iteratively reordering documents based on attention patterns, but its multiple sort-and-generate cycles increase deployment cost. We hypothesize that position bias is the primary bottleneck and propose Debiased One-Pass Attention Sorting, which estimates a per-prompt position-bias curve from the low-att...

arXiv cs.CL ·Qiong Tang, Xiangkun Hu, Xiangyang Liu, Yiran Chen, Yunfan Shao ·
compartilhar: