Building better AI benchmarks: How many raters are enough?

Algorithms & Theory

Google Research ·
compartilhar: