Blog
Áudio & Voz
Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English
arXiv:2606.23948v1 Announce Type: new Abstract: Self-supervised and supervised speech models are increasingly used to investigate which linguistic information their internal representations encode, and at what level of abstraction they encode it. One underexplored phenomenon is consonant cluster reduction (CCR) in African American English (AAE), a widespread phonological process and a source of automatic speech recognition (ASR) disparity. To examine how CCR is represented, we conduct speaker-in...
arXiv cs.CL
·Hamid Mojarad, Kevin Tang
·
// relacionados
Leia também
Modelo
bosonai/higgs-tts-v3-4b
Blog
Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency
Blog
Data Scale, Not Latency, Shapes Cross-Lingual Encoder Transfer in Streaming ASR
Blog