Blog
LLMs & Texto
Towards Spec Learning: Inference-Time Alignment from Preference Pairs
arXiv:2606.24004v1 Announce Type: new Abstract: Steering a large language model (LLM) toward a desired behavior typically relies on an iterative process of hand-crafting a prompt based on a careful inspection of the model's responses. This is an involved, brittle, and error-prone process. Preference-based fine-tuning is a more rigorous but often prohibitively expensive solution. We propose spec learning, a framework that relies on a brief user instruction and a small set of preference judgments....
arXiv cs.CL
·Dhriti Krishnan, Tejas Goyal, Jaromir Savelka
·