Dataset Áudio & Voz

sbintuitions/joyo-kanji-yomi-benchmark

Dataset com 10 mil – 100 mil exemplos — 34 downloads no Hugging Face. Joyo Kanji Yomi Benchmark A kanji-level pronunciation evaluation benchmark for Japanese TTS, covering all 2,136 Joyo kanji and their 4,378 readings wi…

Hugging Face · Datasets ·sbintuitions · ·↓ 34 ·♥ 8

O dataset sbintuitions/joyo-kanji-yomi-benchmark está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.

Ficha do dataset

  • Tamanho: 10 mil – 100 mil exemplos
  • Tarefas: síntese de voz
  • Idiomas: japonês
  • Licença: MIT
  • Downloads: 34 · Curtidas: 8

Sobre o dataset

Joyo Kanji Yomi Benchmark A kanji-level pronunciation evaluation benchmark for Japanese TTS, covering all 2,136 Joyo kanji and their 4,378 readings with 13,095 native-speaker-verified test sentences.

Como carregar

Use a biblioteca datasets do Hugging Face:

pip install -U datasets

from datasets import load_dataset

ds = load_dataset("sbintuitions/joyo-kanji-yomi-benchmark")
print(ds)
print(ds["train"][0])

Tags

text-to-speech tts-evaluation japanese kanji chinese-character pronunciation

Explorar o dataset no Hugging Face →

compartilhar: