What Counts as an Error? Dual-Reference Benchmarking for Atypical ASR
arXiv:2606.31112v1 Announce Type: new Abstract: ASR systems have been often reported to underperform on atypical speech. An often conflated compounding factor is the existence of two valid transcription references: verbatim (actual produced speech, including repetitions/prolongations) and intended (the canonical form of the text with disfluencies removed) in atypical speech recognition depending on context and use-case. Most ASR evaluations conflate this duality into a single ground truth and re...
arXiv cs.CL
·Hawau Olamide Toyin, Srinivasan Umesh, Hanan Aldarmaki
·
// relacionados
Leia também
Blog
SpaceX has an AI device prototype, and it sure sounds phone-ish
Blog
Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller
Blog
Building a Multimodal Dataset of Academic Paper for Keyword Extraction
Blog