// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

MiG-NJU/OmniVideo-100K

Dataset em destaque no Hugging Face — 1.9 mil downloads. OmniVideo-100K Official repository for OmniVideo-100K, an instruction-tuning dataset introduced in our paper: "OmniVideo-100K: A Dataset for Audio-Vis…

21.06.2026 ·↓ 1879

Blog Robótica & RL

Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, Link Graphs, and RAG Chunk Export

In this tutorial, we build a complete Crawlee for Python workflow from setup to AI-ready output. We generate a local demo website, then crawl it with BeautifulSoupCrawler, ParselCrawler, and PlaywrightCrawler. We extract titles, metadata, product fields, and JavaScript-rendered cards, and capture full-page screenshots. We then normalize the data, build a link graph, and export JSON, CSV, and RAG-ready JSONL chunks. The post Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, ...

21.06.2026

Modelo LLMs & Texto

Boogu/Boogu-Image-0.1-Turbo-fp8

Modelo de modelo em alta no Hugging Face — 363 downloads e 43 curtidas da comunidade.

21.06.2026 ·↓ 363

Modelo LLMs & Texto

Boogu/Boogu-Image-0.1-Turbo

Modelo de modelo em alta no Hugging Face — 409 downloads e 42 curtidas da comunidade.

21.06.2026 ·↓ 409

Modelo LLMs & Texto

Boogu/Boogu-Image-0.1-Edit

Modelo de modelo — 1.0 mil downloads e 140 curtidas no Hugging Face.

21.06.2026 ·↓ 1017

Modelo LLMs & Texto

Boogu/Boogu-Image-0.1-Base

Modelo de modelo em alta no Hugging Face — 445 downloads e 42 curtidas da comunidade.

21.06.2026 ·↓ 445

Modelo LLMs & Texto

huihui-ai/Huihui-gemma-4-12B-coder-fable5-composer2.5-v1-abliterated

Modelo de geração de texto · 12 B de parâmetros — 7.5 mil downloads e 142 curtidas no Hugging Face.

21.06.2026 ·↓ 7543

Paper LLMs & Texto

Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do

Multimodal Chain-of-Thought reasoning shows selective effectiveness across different tasks, with limitations in maintaining visual introspection during reasoning processes.

21.06.2026 ·▲ 7

Paper Áudio & Voz

Interleaved Speech Language Models Latently Work In Text

Interleaved speech-text language models exhibit an implicit transcription phase where text tokens become decodable in intermediate layers, followed by text-based prediction before…

21.06.2026 ·▲ 10

Paper Robótica & RL

PolicyTrim: Boosting Intrinsic Policy Efficiency of Vision-Language-Action Models

PolicyTrim is a reinforcement learning-based framework that enhances VLA model efficiency by extending reliable action chunk lengths and reducing redundant physical steps through d…

21.06.2026 ·▲ 2

Paper LLMs & Texto

Libretto: Giving LLM Agents a Sense of Musical Structure

Libretto provides a structured framework for symbolic music generation and revision using LLM-native grammar and statistical evaluation across musical dimensions.

21.06.2026 ·▲ 1

Paper LLMs & Texto

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

PlanBench-XL evaluates large language model agents' ability to plan and adapt in complex tool-rich environments with limited visibility and dynamic disruptions.

21.06.2026 ·▲ 63

← anterior 139 / 178 próxima →

2126 itens no radar