// radar de ia

LLMs & Texto

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

Deep Research in Physical Sciences: A Multi-Agent Framework and Comprehensive Benchmark

PhySciBench benchmark reveals limited performance of current LLM agents in physical science research, leading to development of DelveAgent framework that improves accuracy through…

17.06.2026 ·▲ 10

Paper Geração de Imagem

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

A 3D brain MRI generative model uses a masked-autoencoder tokenizer to create clinically informative embeddings that support both medical task performance and controlled image gene…

17.06.2026 ·▲ 7

Paper LLMs & Texto

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

Current memory agents lack reliable shared institutional deployment due to challenges in balancing utility, access control, and forgetting across multiple principals with diverse a…

17.06.2026 ·▲ 13

Paper LLMs & Texto

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

Trajectory-Augmented Policy Optimization (TAPO) enhances large language model reasoning by creating explicit corrective trajectories that preserve erroneous reasoning while incorpo…

17.06.2026 ·▲ 13

Paper LLMs & Texto

FAPO: Fully Autonomous Prompt Optimization of Multi-Step LLM Pipelines

FAPO optimizes LLM pipelines by combining prompt editing with structural changes, demonstrating superior performance across multiple benchmarks and security tasks.

17.06.2026 ·▲ 10

Blog LLMs & Texto

Agentic Resource Discovery: Let agents search

17.06.2026

Paper LLMs & Texto

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

PerceptionDLM enables efficient parallel region perception in multimodal diffusion language models through structured attention masking and efficient prompting, achieving faster in…

17.06.2026 ·▲ 50

Paper LLMs & Texto

OpenRath: Session-Centered Runtime State for Agent Systems

OpenRath introduces a PyTorch-like programming model for multi-agent systems using Session as a central runtime abstraction that enables explicit fork, merge, and replay operations…

17.06.2026 ·▲ 12

Paper LLMs & Texto

Comparing Linear Probes with Mahalanobis Cosine Similarity

The Mahalanobis cosine similarity provides a theoretically grounded method for comparing linear probes that correlates strongly with out-of-distribution performance metrics.

17.06.2026 ·▲ 1

Paper Dados & Embeddings

Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why

ACIE, an agentic RAG system deployed in a clinical setting, demonstrates high accuracy in extracting medical information from complex patient contexts, achieving 96.

17.06.2026 ·▲ 5

Paper LLMs & Texto

WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents

WorldLines benchmark evaluates long-term memory in embodied agents through household scenarios, while ObsMem framework addresses challenges in partial observability and memory tran…

17.06.2026 ·▲ 3

Blog LLMs & Texto

Hands Free, AIs Forward: NVIDIA XR AI Brings Agents to AR Glasses

NVIDIA XR AI is now available in public beta, giving developers a framework for building multimodal AI agents for AR glasses and XR devices.

16.06.2026

← anterior 25 / 43 próxima →

513 itens no radar