Semantic Browsing: Controllable Diversity for Image Generation

Semantic Browsing: Controllable Diversity for Image Generation

Text-to-image models are enhanced with controlled diversity through semantic browsing capabilities that enable structured navigation of image variations based on meaningful semanti…

Hugging Face · Daily Papers ·Sara Dorfman, Maya Vishnevsky · ·▲ 13 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Sara Dorfman, Maya Vishnevsky, Omer Dahary, Or Patashnik, Daniel Cohen-Or

  • 13 upvotes da comunidade
  • Temas: text-to-image models, semantic browsing, controlled diversity, Vision Language Model, agentic workflow, semantic decision-making

Resumo

Resumo original (em inglês), extraído do paper:

Text-to-image models are enhanced with controlled diversity through semantic browsing capabilities that enable structured navigation of image variations based on meaningful semantic decisions.

Ler o paper completo no Hugging Face →

compartilhar: