Dataset Robótica & RL LLMs & Texto Dados & Embeddings

Anthropic/hh-rlhf

Dataset em destaque no Hugging Face — 28.5 mil downloads. Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and h…

Hugging Face · Datasets ·Anthropic · 26 de janeiro de 2023 ·↓ 28526 ·♥ 1806

O dataset Anthropic/hh-rlhf está entre os destaques do Hugging Face — dados que alimentam o treinamento e a avaliação dos modelos do momento.

Ficha do dataset

Licença: MIT
Downloads: 28.5 mil · Curtidas: 1.8 mil

Sobre o dataset

Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.

Como carregar

Use a biblioteca datasets do Hugging Face:

pip install -U datasets

from datasets import load_dataset

ds = load_dataset("Anthropic/hh-rlhf")
print(ds)
print(ds["train"][0])

Anthropic/hh-rlhf

Ficha do dataset

Sobre o dataset

Como carregar

Tags

Leia também

Um único exemplo basta: o truque de aritmética que reensina um robô

The Google Health API Got a CLI: ghealth is an Open-Source Tool for Your Fitbit Air Data

Optimal any-angle path planning in static and dynamic environments

Stop Pretending Social Robots Are Inevitable