// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

OpenThoughts-Agent: Data Recipes for Agentic Models

An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training…

23.06.2026 ·▲ 27

Paper LLMs & Texto

RoPE-Aware Bit Allocation for KV-Cache Quantization

Block-GTQ introduces a RoPE-aware bit allocation method for key-cache quantization that improves attention accuracy and downstream performance through adaptive bit distribution and…

23.06.2026 ·▲ 4

Paper LLMs & Texto

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Wan-Streamer is a unified, end-to-end multimodal model that enables real-time audio-visual interaction through causal attention mechanisms and integrated processing of visual, audi…

23.06.2026 ·▲ 34

Paper Geração de Imagem

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Video diffusion models are adapted to decode explicit surface primitives directly from latent space, enabling high-quality 3D scene generation with improved geometric accuracy and…

23.06.2026 ·▲ 13

Paper LLMs & Texto

Are We Ready For An Agent-Native Memory System?

Large language model agents' memory systems have evolved into complex data management frameworks requiring systematic evaluation across multiple modules and workloads to understand…

23.06.2026 ·▲ 61

Blog LLMs & Texto

How Omio is building the future of conversational travel

Discover how Omio uses OpenAI to power conversational travel experiences, accelerate product development, and transform into an AI-native company.

23.06.2026

Paper Multimodal

InSight: Self-Guided Skill Acquisition via Steerable VLAs

InSight enables autonomous skill acquisition for vision-language-action models through primitive-action level steerability and automated demonstration generation.

23.06.2026

Paper LLMs & Texto

Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

A large-scale synthetic dataset and specialized model architecture are introduced to address the challenges of artistic text recognition by improving data diversity and model flexi…

23.06.2026 ·▲ 6

Paper LLMs & Texto

CAVEWOMAN: How Large Language Models Behave Under Linguistic Input and Output Compression

Two-channel evaluation shows output compression reduces costs while input compression increases costs and degrades accuracy across models and datasets.

23.06.2026 ·▲ 4

Paper LLMs & Texto

IV-CoT: Implicit Visual Chain-of-Thought for Structure-Aware Text-to-Image Generation

Implicit Visual Chain-of-Thought decomposes visual conditioning into structural and semantic cascades for improved structure-aware image generation with sketch supervision.

23.06.2026 ·▲ 13

Paper LLMs & Texto

Qwen-AgentWorld: Language World Models for General Agents

Language-based world models enable agentic environment simulation across multiple domains and enhance general agent performance through scalable simulation and improved downstream…

23.06.2026 ·▲ 88

Paper LLMs & Texto

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

NatureBench presents a cross-disciplinary benchmark of 90 scientific tasks derived from Nature publications to assess AI coding agents' ability to achieve discovery rather than jus…

23.06.2026 ·▲ 48

← anterior 73 / 120 próxima →

1439 itens no radar