// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Large language models face challenges in archive-grounded reasoning tasks involving evidence retrieval and synthesis across diverse document collections, with performance varying s…

23.06.2026

Paper LLMs & Texto

MEMPROBE: Probing Long-Term Agent Memory via Hidden User-State Recovery

Long-term memory in LLM agents should be evaluated as an auditable post-interaction artifact by reconstructing structured user state from the agent's memory, as demonstrated by MEM…

23.06.2026 ·▲ 1

Paper LLMs & Texto

Escaping the Self-Confirmation Trap: An Execute-Distill-Verify Paradigm for Agentic Experience Learning

EDV is a three-stage framework that uses multiple heterogeneous agents to collaboratively construct reliable experiences for LLM agents, preventing self-confirmatory errors through…

23.06.2026 ·▲ 8

Blog LLMs & Texto

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

23.06.2026

Paper LLMs & Texto

OpenThoughts-Agent: Data Recipes for Agentic Models

An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training…

23.06.2026 ·▲ 27

Paper LLMs & Texto

RoPE-Aware Bit Allocation for KV-Cache Quantization

Block-GTQ introduces a RoPE-aware bit allocation method for key-cache quantization that improves attention accuracy and downstream performance through adaptive bit distribution and…

23.06.2026 ·▲ 4

Paper LLMs & Texto

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Wan-Streamer is a unified, end-to-end multimodal model that enables real-time audio-visual interaction through causal attention mechanisms and integrated processing of visual, audi…

23.06.2026 ·▲ 34

Paper Geração de Imagem

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Video diffusion models are adapted to decode explicit surface primitives directly from latent space, enabling high-quality 3D scene generation with improved geometric accuracy and…

23.06.2026 ·▲ 13

Paper LLMs & Texto

Are We Ready For An Agent-Native Memory System?

Large language model agents' memory systems have evolved into complex data management frameworks requiring systematic evaluation across multiple modules and workloads to understand…

23.06.2026 ·▲ 61

Blog LLMs & Texto

How Omio is building the future of conversational travel

Discover how Omio uses OpenAI to power conversational travel experiences, accelerate product development, and transform into an AI-native company.

23.06.2026

Paper Multimodal

InSight: Self-Guided Skill Acquisition via Steerable VLAs

InSight enables autonomous skill acquisition for vision-language-action models through primitive-action level steerability and automated demonstration generation.

23.06.2026

Paper LLMs & Texto

Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods

A large-scale synthetic dataset and specialized model architecture are introduced to address the challenges of artistic text recognition by improving data diversity and model flexi…

23.06.2026 ·▲ 6

← anterior 48 / 94 próxima →

1118 itens no radar