// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

Infatoshi/kernelbench-hard-traces

Dataset em destaque no Hugging Face — 378 downloads. KernelBench-Hard agent traces Frontier coding agents writing optimized CUDA/Triton kernels (FP8 GEMM, paged attention, MoE, W4A16, .

23.06.2026 ·↓ 378

Blog LLMs & Texto

The running list: major tech layoffs in 2026 where employers cited AI

A running look — in reverse chronological order — at the bigger tech companies that have announced significant layoffs this year with AI as a stated factor.

23.06.2026

Blog LLMs & Texto

OpenAI launches new initiative to help find and patch open source bugs

OpenAI is using AI to help the open source community better protect itself.

23.06.2026

Paper LLMs & Texto

Do Thinking Tokens Help with Safety?

Research reveals that reasoning models' safety outcomes are predictable from early hidden representations, with deliberation appearing but not substantially influencing final respo…

23.06.2026 ·▲ 1

Paper LLMs & Texto

ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection

A comprehensive multimodal misinformation detection framework is introduced that handles complex, multilingual content with multiple images and diverse verification approaches, ach…

23.06.2026 ·▲ 2

Paper Geração de Imagem

FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation

FLUX3D addresses limitations in image-to-3D Gaussian Splatting generation by improving representation learning and cross-modal alignment through specialized architectures and atten…

23.06.2026

Paper Multimodal

FlowR2A: Learning Reward-to-Action Distribution for Multimodal Driving Planning

FlowR2A addresses the tension in multimodal driving planning by combining dense reward supervision with dynamic proposal generation through a flow-matching decoder that learns rewa…

23.06.2026 ·▲ 1

Paper LLMs & Texto

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

Jailbreak attacks expose vulnerabilities in aligned large language models, revealing that harmful intent is encoded in structured intermediate uncertainty dynamics rather than outp…

23.06.2026 ·▲ 3

Paper LLMs & Texto

AGORA: An Archive-Grounded Benchmark for Agentic Workplace Document Reasoning

Large language models face challenges in archive-grounded reasoning tasks involving evidence retrieval and synthesis across diverse document collections, with performance varying s…

23.06.2026

Paper LLMs & Texto

MEMPROBE: Probing Long-Term Agent Memory via Hidden User-State Recovery

Long-term memory in LLM agents should be evaluated as an auditable post-interaction artifact by reconstructing structured user state from the agent's memory, as demonstrated by MEM…

23.06.2026 ·▲ 1

Paper LLMs & Texto

Escaping the Self-Confirmation Trap: An Execute-Distill-Verify Paradigm for Agentic Experience Learning

EDV is a three-stage framework that uses multiple heterogeneous agents to collaboratively construct reliable experiences for LLM agents, preventing self-confirmatory errors through…

23.06.2026 ·▲ 8

Blog LLMs & Texto

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

23.06.2026

← anterior 72 / 120 próxima →

1439 itens no radar