allenai/olmOCR-bench

Dataset em destaque no Hugging Face — 5.9 mil downloads. olmOCR-bench olmOCR-bench is a dataset of 1,403 PDF files, plus 7,010 unit test cases that capture properties of the output that a good OCR system sho…

Hugging Face · Datasets ·allenai · ·↓ 5929 ·♥ 243

O dataset allenai/olmOCR-bench está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.

  • 5.9 mil downloads
  • 243 curtidas

Sobre o dataset

olmOCR-bench olmOCR-bench is a dataset of 1,403 PDF files, plus 7,010 unit test cases that capture properties of the output that a good OCR system should have.

text

Explorar o dataset no Hugging Face →

compartilhar: