Semantic Browsing: Controllable Diversity for Image Generation
Text-to-image models are enhanced with controlled diversity through semantic browsing capabilities that enable structured navigation of image variations based on meaningful semanti…
Hugging Face · Daily Papers
·Sara Dorfman, Maya Vishnevsky
·
·▲ 13 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Sara Dorfman, Maya Vishnevsky, Omer Dahary, Or Patashnik, Daniel Cohen-Or
- 13 upvotes da comunidade
- Temas: text-to-image models, semantic browsing, controlled diversity, Vision Language Model, agentic workflow, semantic decision-making
Resumo
Resumo original (em inglês), extraído do paper:
Text-to-image models are enhanced with controlled diversity through semantic browsing capabilities that enable structured navigation of image variations based on meaningful semantic decisions.