// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

Causal Discovery in the Era of Agents

Language models should assist causal discovery workflows by providing contextual support and explanations rather than generating causal conclusions, as demonstrated through a platf…

22.06.2026 ·▲ 5

Paper Robótica & RL

Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents

A failure detection framework for long-horizon robotic tasks uses action-conditioned world models and functional conformal prediction to monitor manipulation trajectories with only…

22.06.2026 ·▲ 7

Paper Visão Computacional

ShotcreteDepth: A Bi-modal Dataset for Robust Robotic Depth Perception in Shotcrete Construction Environments

A bi-modal construction domain dataset combining stereo RGB and LiDAR data under challenging environmental conditions is introduced for autonomous system perception research.

22.06.2026

Paper LLMs & Texto

AOHP: An Open-Source OS-Level Agent Harness for Personalized, Efficient and Secure Interaction

AOHP presents an Android-based operating system framework that treats AI agents as first-class entities, enhancing task completion rates and reducing execution costs through specia…

22.06.2026 ·▲ 25

Paper LLMs & Texto

Training Open Models for Agentic Phone Use

PhoneBuddy combines real and mock app environments to improve training of open models for phone use, demonstrating enhanced task success rates through mixed reinforcement learning…

22.06.2026 ·▲ 6

Paper Visão Computacional

UniverSat: Resolution- and Modality-Agnostic Transformers for Earth Observation

UniverSat introduces a Universal Patch Encoder for Vision Transformers that enables robust, sensor-agnostic spatial feature extraction across diverse Earth Observation data types.

22.06.2026 ·▲ 2

Paper Dados & Embeddings

GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents

Computer-use agents can execute software tasks through either graphical interfaces or programmatic command interfaces, but existing evaluations confound interaction modality with d…

22.06.2026 ·▲ 28

Paper LLMs & Texto

Plans Don't Persist: Why Context Management Is Load Bearing for LLM Agents

Standard LLM agents rely on plan content remaining in context rather than maintaining it as persistent state, with evidence shown through replay pairing diagnostics and compression…

22.06.2026 ·▲ 1

Paper LLMs & Texto

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

A principled synthesis engine generates high-quality terminal-agent tasks through multi-dimensional capability taxonomy and evidence-guided research, creating a distilled dataset t…

22.06.2026 ·▲ 24

Paper LLMs & Texto

Dense Reward for Multi-View 3D Reasoning with Global Maps and Local Views

DR-MV3D presents a map-grounded learning framework with dense rewards to improve multi-view 3D visual question answering through global map construction, view-trajectory planning,…

22.06.2026 ·▲ 4

Paper LLMs & Texto

Managing Procedural Memory in LLM Agents: Control, Adaptation, and Evaluation

Procedural memory enhances LLM agents on workplace tasks through skill transfer across roles and models, with varying generalization capabilities affecting deployment strategies.

22.06.2026 ·▲ 17

Paper LLMs & Texto

When Agents Commit Too Soon: Diagnosing Premature Commitment in LLM Agents

Pre premature commitment in long-horizon LLM agents leads to silent failures where agents defend early interpretations without considering alternatives, and hidden-state convergenc…

22.06.2026 ·▲ 1

← anterior 122 / 163 próxima →

1946 itens no radar