MiG-NJU/OmniVideo-100K
Dataset em destaque no Hugging Face — 1.9 mil downloads. OmniVideo-100K Official repository for OmniVideo-100K, an instruction-tuning dataset introduced in our paper: "OmniVideo-100K: A Dataset for Audio-Vis…
Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.
Dataset em destaque no Hugging Face — 1.9 mil downloads. OmniVideo-100K Official repository for OmniVideo-100K, an instruction-tuning dataset introduced in our paper: "OmniVideo-100K: A Dataset for Audio-Vis…
In this tutorial, we build a complete Crawlee for Python workflow from setup to AI-ready output. We generate a local demo website, then crawl it with BeautifulSoupCrawler, ParselCrawler, and PlaywrightCrawler. We extract titles, metadata, product fields, and JavaScript-rendered cards, and capture full-page screenshots. We then normalize the data, build a link graph, and export JSON, CSV, and RAG-ready JSONL chunks. The post Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, ...
Modelo de modelo em alta no Hugging Face — 363 downloads e 43 curtidas da comunidade.
Modelo de modelo em alta no Hugging Face — 409 downloads e 42 curtidas da comunidade.
Modelo de modelo — 1.0 mil downloads e 140 curtidas no Hugging Face.
Modelo de modelo em alta no Hugging Face — 445 downloads e 42 curtidas da comunidade.
Modelo de geração de texto · 12 B de parâmetros — 7.5 mil downloads e 142 curtidas no Hugging Face.
Multimodal Chain-of-Thought reasoning shows selective effectiveness across different tasks, with limitations in maintaining visual introspection during reasoning processes.
Interleaved speech-text language models exhibit an implicit transcription phase where text tokens become decodable in intermediate layers, followed by text-based prediction before…
PolicyTrim is a reinforcement learning-based framework that enhances VLA model efficiency by extending reliable action chunk lengths and reducing redundant physical steps through d…
Libretto provides a structured framework for symbolic music generation and revision using LLM-native grammar and statistical evaluation across musical dimensions.
PlanBench-XL evaluates large language model agents' ability to plan and adapt in complex tool-rich environments with limited visibility and dynamic disruptions.