VLM-AR3L: Vision-Language Models for Absolute and Relative Rewards in Reinforcement Learning

arXiv:2607.00483v1 Announce Type: new Abstract: Designing effective reward functions remains a major challenge in reinforcement learning (RL), particularly in open-ended environments where task goals are abstract and difficult to quantify. In this work, we present VLM-AR3L, a framework that leverages Vision-Language Models (VLMs) to provide both absolute and relative rewards for RL. VLM-AR3L interprets an agent's visual observations in the context of a natural language task goal, and learns both...

arXiv cs.RO ·Kuan-Chen Chen, Winston Chen, Wei-Fang Sun, Min-Chun Hu ·
compartilhar: