Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages
Multi-LCB addresses the limitation of LiveCodeBench by providing a multi-language benchmark for evaluating LLMs across twelve programming languages while maintaining contamination…
Hugging Face · Daily Papers
·Maria Ivanova, Pavel Zadorozhny
·
·▲ 56 upvotes
Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.
Autores: Maria Ivanova, Pavel Zadorozhny, Rodion Levichev, Ivan Petrov, Adamenko Pavel, Ivan Lopatin
- 56 upvotes da comunidade
- Temas: large language models, code-generation tasks, competitive programming problems, contamination-aware evaluation, cross-language code generation, multilingual performance
Resumo
Resumo original (em inglês), extraído do paper:
Multi-LCB addresses the limitation of LiveCodeBench by providing a multi-language benchmark for evaluating LLMs across twelve programming languages while maintaining contamination controls and evaluation protocols.
// relacionados
Leia também
Blog
How Businesses Are Building Specialized AI They Can Trust
Blog
Fika Jobs raises $4M to build a video-first hiring platform where AI agents interview candidates
Blog
Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
Blog