Learning to Fold: prizewinning solution at LeHome Challenge 2026 (1st place online, 2nd offline)

Learning to Fold: prizewinning solution at LeHome Challenge 2026 (1st place online, 2nd offline)

A vision-language-action policy improved with reinforcement learning uses shared network predictions for success estimation and advantage calculation in bimanual garment folding, e…

Hugging Face · Daily Papers ·Ilia Larchenko · ·▲ 4 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Ilia Larchenko

  • 4 upvotes da comunidade
  • Temas: vision-language-action policy, reinforcement-learning loop, value function, advantage estimation, live failure detection, candidate selection

Resumo

Resumo original (em inglês), extraído do paper:

A vision-language-action policy improved with reinforcement learning uses shared network predictions for success estimation and advantage calculation in bimanual garment folding, employing established RL techniques with novel optimization and deployment strategies.

Onde ler

compartilhar: