Improving Text-to-Music Generation with Human Preference Rewards

Improving Text-to-Music Generation with Human Preference Rewards

A text-to-music generation system uses reward conditioning, expert iteration, and preference tuning to improve audio quality while maintaining efficiency within a 120M-parameter mo…

Hugging Face · Daily Papers ·Yonghyun Kim, Junwon Lee ·

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Yonghyun Kim, Junwon Lee, Haiwen Xia, Yinghao Ma, Chris Donahue

  • 0 upvotes da comunidade
  • Temas: FluxAudio-S, FAD-CLAP, CLAP score, TuneJury, twin pairwise ranker, training-time reward conditioning

Resumo

Resumo original (em inglês), extraído do paper:

A text-to-music generation system uses reward conditioning, expert iteration, and preference tuning to improve audio quality while maintaining efficiency within a 120M-parameter model framework.

Ler o paper completo no Hugging Face →

compartilhar: