Paper
LLMs & Texto
The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar
Tatoxa is a state-of-the-art text detoxification system for the Tatar language that demonstrates superior performance over existing LLMs and highlights the challenges of cross-ling…
Hugging Face · Daily Papers
·Ilseyar Alimova, Bogdan Monogov
·
·▲ 7 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Ilseyar Alimova, Bogdan Monogov, Artyom Mazur, Daniil Antonov, Vsevolod Karimov, Vitaliy Egorov
- 7 upvotes da comunidade
- Temas: text detoxification, low resource languages, Tatar language, comparative experiments, cross lingual transfer, fine tuning
Resumo
Resumo original (em inglês), extraído do paper:
Tatoxa is a state-of-the-art text detoxification system for the Tatar language that demonstrates superior performance over existing LLMs and highlights the challenges of cross-lingual transfer in low-resource settings.Onde ler
// relacionados
Leia também
Blog
The US military used AI to pick thousands of targets but missed a note saying one was a school
Blog
HP accelerates enterprise workflows with OpenAI Frontier
Editorial
O fantasma do Fable 5: banido, o modelo vive nos datasets que o destilam
Editorial