ELASTIC: Efficiently Learning to Adaptively Scale Test-Time Compute for Generative Control Policies

arXiv:2606.31132v1 Announce Type: new Abstract: Generative control policies (GCPs), such as diffusion policies and flow-based vision-language-action models, enable test-time scaling in robot control. Test-time compute can be allocated along two axes: sequential scaling, which increases denoising steps to refine actions, and parallel scaling, which samples multiple candidate actions to search across modes of the policy distribution. However, the optimal allocation of sequential and parallel compu...

arXiv cs.RO ·Andrew Zou Li, Gokul Swamy, Yonatan Bisk, Andrea Bajcsy ·
compartilhar: