Blog Áudio & Voz Dados & Embeddings

What Counts as an Error? Dual-Reference Benchmarking for Atypical ASR

arXiv:2606.31112v1 Announce Type: new Abstract: ASR systems have been often reported to underperform on atypical speech. An often conflated compounding factor is the existence of two valid transcription references: verbatim (actual produced speech, including repetitions/prolongations) and intended (the canonical form of the text with disfluencies removed) in atypical speech recognition depending on context and use-case. Most ASR evaluations conflate this duality into a single ground truth and re...

arXiv cs.CL ·Hawau Olamide Toyin, Srinivasan Umesh, Hanan Aldarmaki · 01 de janeiro de 2026

Ver no Hugging Face

// relacionados

What Counts as an Error? Dual-Reference Benchmarking for Atypical ASR

Leia também

SpaceX has an AI device prototype, and it sure sounds phone-ish

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller

Building a Multimodal Dataset of Academic Paper for Keyword Extraction

Gated Multi-Graph Fusion via Graph Attention Networks for Alzheimer's Disease Detection