Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Multi-LCB addresses the limitation of LiveCodeBench by providing a multi-language benchmark for evaluating LLMs across twelve programming languages while maintaining contamination…

Hugging Face · Daily Papers ·Maria Ivanova, Pavel Zadorozhny · ·▲ 56 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Maria Ivanova, Pavel Zadorozhny, Rodion Levichev, Ivan Petrov, Adamenko Pavel, Ivan Lopatin

  • 56 upvotes da comunidade
  • Temas: large language models, code-generation tasks, competitive programming problems, contamination-aware evaluation, cross-language code generation, multilingual performance

Resumo

Resumo original (em inglês), extraído do paper:

Multi-LCB addresses the limitation of LiveCodeBench by providing a multi-language benchmark for evaluating LLMs across twelve programming languages while maintaining contamination controls and evaluation protocols.

Ler o paper completo no Hugging Face →

compartilhar: