LocalNav: Distilling Frontier VLMs and Embodied RL for On-Device Object Goal Navigation
arXiv:2606.27871v1 Announce Type: new Abstract: Vision Language Models (VLMs) have emerged in the robotic domain as a powerful tool that enables environmental perception with language context, serving as a catalyst for open-vocabulary tasks like ObjectNav. Yet, their computational footprint typically confines them to cloud execution, hindering low-latency inference with local deployment on resource-constrained robots. To address this challenge, we present a distillation strategy that transfers c...
arXiv cs.RO
·Nicolas Baumann, Liam Boyle, Pu Deng, Edoardo Ghignone, Boyang Sun, Marc Pollefeys, Luca Benini, Michele Magno
·
// relacionados
Leia também
Blog
HP accelerates enterprise workflows with OpenAI Frontier
Blog
Open Models, Closed Environments: Palantir Brings Secure AI to US Agencies With NVIDIA Nemotron
Blog
Claude Code runs a GitHub repo's hidden malware without verification, giving attackers full control
Blog