Paper LLMs & Texto Dados & Embeddings

Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

A large-scale video editing dataset and model are introduced that support multi-task and structural manipulations through advanced data synthesis and network architectures.

Hugging Face · Daily Papers ·Sen Liang, Cong Wang · 30 de janeiro de 2026 ·▲ 1 upvotes

Este artigo está em destaque na seleção diária de papers do Hugging Face, curada pela comunidade de pesquisa em IA.

Autores: Sen Liang, Cong Wang, Zhentao Yu, Fengbin Guan, Zhengguang Zhou, Teng Hu

1 upvotes da comunidade
Temas: instruction-aligned video editing pairs, data synthesis pipeline, progressive filtering system, MLLM, decoupled dual-branch design, mask branch

Resumo

Resumo original (em inglês), extraído do paper:

A large-scale video editing dataset and model are introduced that support multi-task and structural manipulations through advanced data synthesis and network architectures.

Onde ler

Ver no Hugging Face

// relacionados

Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing

Resumo

Onde ler

Leia também

Using Lift to Turn Research PDFs into Structured JSON with Controlled, Schema-Guided Field-Level Evaluation

Anthropic Redeploys Claude Fable 5 on July 1 After US Export Controls Lift, Adds New Cybersecurity Classifier

The latest AI news we announced in June 2026

Cloudflare’s new policy pushes AI companies to pay for publishers’ content