Anthropic/hh-rlhf
Dataset em destaque no Hugging Face — 28.5 mil downloads. Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and h…
Hugging Face · Datasets
·Anthropic
·
·↓ 28526
·♥ 1806
O dataset Anthropic/hh-rlhf está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.
Ficha do dataset
- Licença: MIT
- Downloads: 28.5 mil · Curtidas: 1.8 mil
Sobre o dataset
Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
Como carregar
Use a biblioteca datasets do Hugging Face:
pip install -U datasets
from datasets import load_dataset
ds = load_dataset("Anthropic/hh-rlhf")
print(ds)
print(ds["train"][0])Tags
human-feedback
// relacionados