VOCA: Visual Odometry with Codec Awareness
arXiv:2607.00189v1 Announce Type: new Abstract: Camera pose estimation from image streams is a critical component of spatial world models that integrate perception into planning and decision-making. Nearly all Visual Odometry (VO) and Simultaneous Localization and Mapping (V-SLAM) systems have focused on datasets containing raw, uncompressed videos. Many working systems instead use ubiquitous hardware units to efficiently compress and decode video streams, saving orders of magnitude in storage a...
arXiv cs.CV
·Nouri Alexander Hilscher, Mateo de Mayo, Dominik Muhle, Christoph Otten genannt Hermes, Daniel Cremers
·
// relacionados
Leia também
Blog
A Filtered Mixture-of-Generators for Fully Synthetic Survival Training
Blog
PHREEQC-MCQ-200: A Diagnostic Benchmark for Tool-Augmented Scientific Simulator Agents
Blog
Device Passport: Enabling Spatio-Temporal Pretrained Models to Generalize Across Input Layouts
Blog