Blog Áudio & Voz

Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English

arXiv:2606.23948v1 Announce Type: new Abstract: Self-supervised and supervised speech models are increasingly used to investigate which linguistic information their internal representations encode, and at what level of abstraction they encode it. One underexplored phenomenon is consonant cluster reduction (CCR) in African American English (AAE), a widespread phonological process and a source of automatic speech recognition (ASR) disparity. To examine how CCR is represented, we conduct speaker-in...

arXiv cs.CL ·Hamid Mojarad, Kevin Tang · 24 de janeiro de 2026

Ver no Hugging Face

// relacionados

Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English

Leia também

bosonai/higgs-tts-v3-4b

Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency

Data Scale, Not Latency, Shapes Cross-Lingual Encoder Transfer in Streaming ASR

Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World