Anthropic/hh-rlhf

Dataset em destaque no Hugging Face — 28.5 mil downloads. Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and h…

Hugging Face · Datasets ·Anthropic · ·↓ 28526 ·♥ 1806

O dataset Anthropic/hh-rlhf está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.

Ficha do dataset

  • Licença: MIT
  • Downloads: 28.5 mil · Curtidas: 1.8 mil

Sobre o dataset

Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.

Como carregar

Use a biblioteca datasets do Hugging Face:

pip install -U datasets

from datasets import load_dataset

ds = load_dataset("Anthropic/hh-rlhf")
print(ds)
print(ds["train"][0])

Tags

human-feedback

Explorar o dataset no Hugging Face →

compartilhar: