DiPS: Dialogue Policy Selection for High-Stakes Persuasion Agents

arXiv:2607.01557v1 Announce Type: new Abstract: Large Language Models (LLMs) often struggle with persuasion in high-stakes scenarios. People's individual personalities and concerns require tailored strategies rather than a one-size-fits-all approach. To address this challenge, we focus on a fire-rescue scenario in which an operator must persuade a resident to evacuate as a high-stakes persuasion domain and propose Dialogue Policy Selection (DiPS), a Q-learning framework to dynamically select per...

arXiv cs.CL ·Tianyi Zhang, Mousumi Das, Abrar Anwar, Jesse Thomason, David Traum ·
compartilhar: