Progressive Pixel-Neighborhood Deformable Cross-Attention for Multispectral Object Detection

arXiv:2606.24092v1 Announce Type: new Abstract: Effective cross-modal feature alignment and interaction are central challenges in multispectral object detection. Although global cross-attention provides strong long-range modeling ability, its quadratic complexity with respect to feature size limits deployment on resource-constrained platforms. We therefore propose Progressive Pixel-Neighborhood Deformable Cross-Attention for multispectral feature fusion, termed PNAFusion. The proposed framework ...

arXiv cs.CV ·Tian Qiu, Jifeng Shen, Xin Zuo ·
compartilhar: