Paper
LLMs & Texto
Characterizing Narrative Content in Web-scale LLM Pretraining Data
A comprehensive analysis of narrative structures in large-scale language model training data reveals measurable, multidimensional narrative patterns that vary across different cont…
Hugging Face · Daily Papers
·Teagan Johnson, Elliott Ash
·
·▲ 2 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Teagan Johnson, Elliott Ash, Andrew Piper, Maria Antoniak
- 2 upvotes da comunidade
- Temas: NarraBERT, RoBERTa, Dolma, NarraDolma, narrative theory, agency
Resumo
Resumo original (em inglês), extraído do paper:
A comprehensive analysis of narrative structures in large-scale language model training data reveals measurable, multidimensional narrative patterns that vary across different content sources and topics.
// relacionados
Leia também
Blog
How Businesses Are Building Specialized AI They Can Trust
Blog
Fika Jobs raises $4M to build a video-first hiring platform where AI agents interview candidates
Blog
Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
Blog