Is Our Benchmark Enough? An Analysis of Continual Learning for MLLMs

arXiv:2606.20961v1 Announce Type: new Abstract: Continual adaptation is essential for multimodal large language models (MLLMs) deployed across evolving domains, but the state-of-the-art MR-LoRA method highly relies on the assumption that a MLLM-based router is necessary to process complex multimodal inputs. This paper revisits this claim on the MLLM-CL benchmark and argues for two claims. \textbf{First}, routing does not require an MLLM: a simple training-free, replay-free ptotypical routing met...

arXiv cs.LG ·Van-Tuan Tran, Shruthi Gowda, Merim Dzaferagic, Marco Ruffini ·
compartilhar: