Blog LLMs & Texto Robótica & RL

DiPS: Dialogue Policy Selection for High-Stakes Persuasion Agents

arXiv:2607.01557v1 Announce Type: new Abstract: Large Language Models (LLMs) often struggle with persuasion in high-stakes scenarios. People's individual personalities and concerns require tailored strategies rather than a one-size-fits-all approach. To address this challenge, we focus on a fire-rescue scenario in which an operator must persuade a resident to evacuate as a high-stakes persuasion domain and propose Dialogue Policy Selection (DiPS), a Q-learning framework to dynamically select per...

arXiv cs.CL ·Tianyi Zhang, Mousumi Das, Abrar Anwar, Jesse Thomason, David Traum · 03 de janeiro de 2026

Ver no Hugging Face

// relacionados

DiPS: Dialogue Policy Selection for High-Stakes Persuasion Agents

Leia também

O complicado problema do Claude Code com a China envolve proibições dos dois lados do Pacífico

AI Security Institute do Reino Unido descobre que benchmarks padrão subestimam sistematicamente o que agentes de IA realmente conseguem fazer

ByteDance-Seed/EdgeBench

Google DeepMind e A24 anunciam parceria de pesquisa inédita