VLM-Aware Meta-Optic Front-End Design for Frozen Vision-Language Models
arXiv:2606.27646v1 Announce Type: new Abstract: Conventional machine-vision pipelines typically rely on high-quality optics that produce clean, human-interpretable images, and optical design has therefore been driven by image-level criteria such as resolution, aberration correction, and pixel fidelity. However, such optics are often impractical for size-, cost-, or form-factor-constrained applications, where compact meta-optics offer an attractive alternative but operate under strict physical ef...
arXiv cs.CV
·Chanik Kang, Rapha\"el Pestourie, Haejun Chung
·
// relacionados
Leia também
Blog
DMV-Bench: Diagnosing Long-Horizon Multimodal Agents' Visual Memory with Incidental Cue Injection
Blog
JD Oxygen AI Item Center (Oxygen AIIC) V1: An Industrial-Scale LLM/VLM-Centric Solution for Item Understanding, Management, and Applications
Blog
A Survey of Automated Presentation Coaching: Systems, Methods, and Open Challenges
Blog