Duration Aware Scheduling for ASR Serving Under Workload Drift
Duration-aware scheduling policies improve ASR serving latency by leveraging audio length as a predictor for processing time, with SJF and HRRN algorithms showing significant media…
Hugging Face · Daily Papers
·Darshan Makwana, Yash Jogi
·
·▲ 3 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Darshan Makwana, Yash Jogi, Harsh Kotta, Aayush Kubba
- 3 upvotes da comunidade
- Temas: Automatic Speech Recognition, scheduling policies, end-to-end latency, first-come-first-served, Shortest Job First, Highest Response Ratio Next
Resumo
Resumo original (em inglês), extraído do paper:
Duration-aware scheduling policies improve ASR serving latency by leveraging audio length as a predictor for processing time, with SJF and HRRN algorithms showing significant median latency reductions while maintaining throughput.
// relacionados
Leia também
Blog
How to burst the AI bubble: Strike at its roots
Blog
MindAlign: Decoding Inner Speech from fMRI Signals via Multimodal Embedding Alignment under Limited Data
Blog
EmoInstruct-TTS: Dual-Path Instruction-Guided Emotional Speech Synthesis
Blog