Radar de IA — Notícias, Modelos e Papers

R2HandoverSim: A Simulation Framework and Benchmark for Robot-to-Human Object Handovers

arXiv:2606.21011v1 Announce Type: new Abstract: We present R2HandoverSim, a simulation benchmark for robot-to-human (R2H) object handovers. Although R2H handover methods have advanced rapidly, the lack of standardized evaluation protocols impedes objective comparison. Our benchmark enables reproducible evaluation by systematically comparing four baselines on their predicted shared grasp poses. We conduct a user study with 30 participants, analyze baseline performance, and show that simulation re...

23.06.2026

Blog LLMs & Texto

SkillHarness: Harnessing Safe Skills for Computer-Use Agents

arXiv:2606.20636v1 Announce Type: new Abstract: Computer-Use Agents (CUAs) are increasingly deployed in dynamic interactive environments, creating a growing need for continual skill learning during interaction. Recent approaches address this challenge by learning reusable skills from successful trajectories. However, these skill learning methods largely assume static and safe environments, overlooking risks from adversarial interactions (e.g., prompt injections) and environmental dynamics (e.g.,...

23.06.2026

Blog LLMs & Texto

A Gated Graph Neural Network Approach to Fast-Convergent Dynamic Average Estimation

arXiv:2606.20955v1 Announce Type: new Abstract: Dynamic average estimation is a critical problem in multi-agent systems, enabling agents to collaboratively estimate time-varying signals using only local information exchange. Traditional model-based approaches often face challenges related to convergence speed and sensitivity to network topology changes. This paper introduces a novel learning-based solution leveraging Gated Graph Neural Networks (GGNNs) for fast-convergent dynamic average estimat...

23.06.2026

Blog LLMs & Texto

Hypothesis-Disciplined Multi-Agent Automated Formalization of Asymptotic Statistical Theory

arXiv:2606.20642v1 Announce Type: new Abstract: Asymptotic statistical theory is a challenging domain for AI-assisted formalization: its central results mix convergence statements, asymptotic expansions, functional analysis, and regularity conditions that have a large gap from existing infrastructure in Lean 4 formalization. To address these challenges, we propose a hypothesis-disciplined Lean 4 formalization pipeline built from multiple agents: a manager that coordinates seven specialist roles ...

23.06.2026

Blog Robótica & RL

Pose-Agnostic Robotic Functional Grasping via Observation-Action Canonicalization

arXiv:2606.21148v1 Announce Type: new Abstract: Functional robotic grasping requires a policy that generalizes across diverse object geometries and poses while maintaining task-specific contact precision. We study this challenge through mug-handle grasping, where thin handles, instance variation, and upright or inverted placements make both perception and control sensitive to object configuration. Grasp pose detection methods operate open-loop and are sensitive to estimation errors on thin handl...

23.06.2026

Blog LLMs & Texto

Peeking Inside LLMs: Leveraging Internal Artifacts of LLMs for Enhancing Reliability in Legal Classification

arXiv:2606.20929v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly being adopted in the legal domain. However, despite their strong performance, LLMs are prone to generating incorrect or hallucinated outputs, raising serious concerns about their reliability in high-stakes domains such as law. Detecting the correctness of responses of LLM-based systems is therefore a critical challenge. In this work, we explore the potential of leveraging internal artifacts of LLM to de...

23.06.2026

Blog Robótica & RL

Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study

arXiv:2606.20874v1 Announce Type: new Abstract: Cryopathy syndromes are difficult to classify because laboratory patterns often overlap across diagnostic categories, while some diagnoses are rare. This makes routine interpretation of cryoglobulin-related tests challenging and increases dependence on expert judgment. The aim of this study was to develop and compare machine learning approaches for automated classification of cryopathy syndromes from laboratory data and to identify a practical stra...

23.06.2026

Blog LLMs & Texto

The New Associationism: Lessons from Deep Learning

arXiv:2606.20600v1 Announce Type: new Abstract: What can the success of modern AI tell us about how humans learn? This paper argues that taking AI seriously as a model of human learning supports a modest but genuine associationism. The central finding is that supervised learning -- learning driven by evaluative feedback -- underlies a surprisingly wide range of contemporary AI systems, from large language models to game-playing agents, differing primarily in how much work is required to generate...

23.06.2026

Blog LLMs & Texto

Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study

arXiv:2606.20933v1 Announce Type: new Abstract: The choice of loss function and optimizer is an important decision, that shapes further model training. Yet automated architecture search pipelines (AutoML) benefits significantly more from the optimal pairing selection and vice versa. This paper investigates whether a single recipe is sufficient for heterogeneous architecture pools, or whether the optimal pairing varies across structurally diverse models. We conduct a systematic empirical study of...

23.06.2026

Blog LLMs & Texto

How Should a Robot Configure Its Laser Scanner for Inspection?

arXiv:2606.21093v1 Announce Type: new Abstract: Robotic inspection relies on accurate sensing to acquire high-fidelity geometric measurements for defect detection and metrology. While prior work has focused on robot motion and viewpoint planning, how to configure sensing parameters remains largely underexplored, despite their decisive impact on measurement quality. We propose SenseHD, a robotic sensing system that formulates scanner configuration as an instruction-conditioned sensing decision. I...

23.06.2026

Blog Dados & Embeddings

ELADO: Elliptic PDE Assessment Datasets for Operator Learning

arXiv:2606.20771v1 Announce Type: new Abstract: We introduce ELADO (Elliptic PDE Assessment Datasets for Operator Learning), a systematic benchmark suite constructed to show and quantify failure modes of neural operator architectures when learning solution operators of elliptic PDEs. While the benchmarks of existing datasets focus on average case performance, the ELADO datasets are constructed to highlight challenges that arise naturally in elliptic PDE problems. In particular, we construct seve...

23.06.2026

Blog LLMs & Texto

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

arXiv:2606.20945v1 Announce Type: new Abstract: Self-attention is central to Transformer performance and is often the most expensive part of the Transformer at long context lengths because its pairwise token interactions scale quadratically with sequence length. Standard dense attention also applies the same set of attention heads to every token regardless of token difficulty or information content. This uniform activation can waste compute, especially as sequences grow longer and attention cost...

23.06.2026

O que está acontecendo agora

R2HandoverSim: A Simulation Framework and Benchmark for Robot-to-Human Object Handovers

SkillHarness: Harnessing Safe Skills for Computer-Use Agents

A Gated Graph Neural Network Approach to Fast-Convergent Dynamic Average Estimation

Hypothesis-Disciplined Multi-Agent Automated Formalization of Asymptotic Statistical Theory

Pose-Agnostic Robotic Functional Grasping via Observation-Action Canonicalization

Peeking Inside LLMs: Leveraging Internal Artifacts of LLMs for Enhancing Reliability in Legal Classification

Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study

The New Associationism: Lessons from Deep Learning

Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study

How Should a Robot Configure Its Laser Scanner for Inspection?

ELADO: Elliptic PDE Assessment Datasets for Operator Learning

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention