Duration Aware Scheduling for ASR Serving Under Workload Drift

Duration Aware Scheduling for ASR Serving Under Workload Drift

Duration-aware scheduling policies improve ASR serving latency by leveraging audio length as a predictor for processing time, with SJF and HRRN algorithms showing significant media…

Hugging Face · Daily Papers ·Darshan Makwana, Yash Jogi · ·▲ 3 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Darshan Makwana, Yash Jogi, Harsh Kotta, Aayush Kubba

  • 3 upvotes da comunidade
  • Temas: Automatic Speech Recognition, scheduling policies, end-to-end latency, first-come-first-served, Shortest Job First, Highest Response Ratio Next

Resumo

Resumo original (em inglês), extraído do paper:

Duration-aware scheduling policies improve ASR serving latency by leveraging audio length as a predictor for processing time, with SJF and HRRN algorithms showing significant median latency reductions while maintaining throughput.

Ler o paper completo no Hugging Face →

compartilhar: