Dataset Áudio & Voz

sbintuitions/joyo-kanji-yomi-benchmark

Dataset com 10 mil – 100 mil exemplos — 34 downloads no Hugging Face. Joyo Kanji Yomi Benchmark A kanji-level pronunciation evaluation benchmark for Japanese TTS, covering all 2,136 Joyo kanji and their 4,378 readings wi…

Hugging Face · Datasets ·sbintuitions · 29 de janeiro de 2026 ·↓ 34 ·♥ 8

O dataset sbintuitions/joyo-kanji-yomi-benchmark está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.

Ficha do dataset

Tamanho: 10 mil – 100 mil exemplos
Tarefas: síntese de voz
Idiomas: japonês
Licença: MIT
Downloads: 34 · Curtidas: 8

Sobre o dataset

Joyo Kanji Yomi Benchmark A kanji-level pronunciation evaluation benchmark for Japanese TTS, covering all 2,136 Joyo kanji and their 4,378 readings with 13,095 native-speaker-verified test sentences.

Como carregar

Use a biblioteca datasets do Hugging Face:

pip install -U datasets

from datasets import load_dataset

ds = load_dataset("sbintuitions/joyo-kanji-yomi-benchmark")
print(ds)
print(ds["train"][0])

sbintuitions/joyo-kanji-yomi-benchmark

Ficha do dataset

Sobre o dataset

Como carregar

Tags

Leia também

SpaceX has an AI device prototype, and it sure sounds phone-ish

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller

Building a Multimodal Dataset of Academic Paper for Keyword Extraction

Gated Multi-Graph Fusion via Graph Attention Networks for Alzheimer's Disease Detection