Radar de IA — Notícias, Modelos e Papers

Learning-Based Modeling of Soft Robots via Cosserat Rod Theory

arXiv:2606.20958v1 Announce Type: new Abstract: Modeling soft robot dynamics is challenging due to their continuum structure and typically nonlinear dynamics. Creating models based on first-order principles is typically time-demanding, and their expressiveness is limited, whereas data-driven models lack interpretability and physical consistency. This work aims to overcome these challenges by introducing a port-Hamiltonian Gaussian Process Regression framework for learning and simulating the dyna...

23.06.2026

Blog Robótica & RL

Factor-Aware Mixture-of-Experts with Pretrained Encoder for Combinatorial Generalization

arXiv:2606.21100v1 Announce Type: new Abstract: The integration of pretrained encoders with diffusion policies has become a dominant paradigm for visual robotic manipulation. However, it still struggles to generalize across complex environments with varying factors such as lighting and surface textures. To address this, we propose FAME, a framework that integrates a factor-aware mixture-of-experts (MoE) with a pretrained encoder to enhance generalization to environmental variations. FAME follows...

23.06.2026

Blog LLMs & Texto

LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations

arXiv:2606.21098v1 Announce Type: new Abstract: Reliable evaluation of phrase break annotations is crucial, as subtle variations in prosodic boundaries directly affect the clarity and naturalness of speech. However, existing approaches exhibit major limitations: single-reference evaluation assumes a unique gold phrasing for an utterance despite multiple valid phrasings, while human judgment, though flexible, is labor-intensive and unscalable. To address these, we propose LLM-based Multi-Referenc...

23.06.2026

Blog LLMs & Texto

In LLM Reasoning, there is Irrationality on top of Value Misalignment

arXiv:2606.20624v1 Announce Type: new Abstract: Significant progress has been made in aligning LLMs with target value functions. We argue that, even when an LLM has been well aligned in (post-)training, it may still fail to maximise the aligned value in reasoning. We mathematically formalise this gap as rational value risk: the utility discrepancy between a model's deployed reasoning strategy and its rational counterpart, which is defined to be the responses that maximise expected utility in the...

23.06.2026

Blog Multimodal

Beyond 'One Language, One Script': Quantifying Orthographic Bias in Multilingual VLMs with PuMVR

arXiv:2606.20770v1 Announce Type: new Abstract: Current Vision-Language Models (VLMs) are celebrated for their multilingual capabilities, yet they operate under a flawed assumption: that one language corresponds to a single writing system. This overlooks billions of users of multi-script languages like Punjabi, Serbian, Hindi-Urdu, Kurdish, among many others, for whom a model's capability may be fractured by orthographic bias. We introduce PuMVR (Punjabi Multimodal Visual Reasoning), the first b...

23.06.2026

Blog Multimodal

Evidential Fusion Network for Multimodal Survival Prediction under Missing Modalities

arXiv:2606.20757v1 Announce Type: new Abstract: Recent multimodal survival prediction models have demonstrated strong predictive performance by leveraging complementary information across modalities. However, such models generally assume data completeness and exhibit limited robustness toward missing modalities, which are frequently encountered in real-world clinical settings. We propose the Evidential Missing Modality Survival Fusion (EMMS) model for multimodal survival prediction under missing...

23.06.2026

Blog Dados & Embeddings

Understanding Latent Flow Models for Tabular Data Synthesis: Targets, Paths, and Sampling

arXiv:2606.20878v1 Announce Type: new Abstract: Synthetic tabular data enables microdata sharing in regulated domains, yet deploying continuous-time generative models requires balancing analytical utility, disclosure risk, and computational cost. Latent-space flow models are flexible, but theoretical equivalences across learning targets, probability paths, and sampling dynamics can translate into different behaviour under finite-step integration and explicit compute budgets. We present an empiri...

23.06.2026

Blog LLMs & Texto

GEOPHYS: The Geometry of Physical Plausibility

arXiv:2606.20707v1 Announce Type: new Abstract: While humans can identify physically implausible events within milliseconds, machine learning approaches addressing the same problem are extremely slow and expensive. They either rely on external multimodal-LLM judges or require ad-hoc modifications to the training procedure. In this work, we argue that indicators of physical plausibility are implicitly captured by five geometric properties of the per-frame embeddings produced by frozen image encod...

23.06.2026

Blog Robótica & RL

CIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networks

arXiv:2606.20747v1 Announce Type: new Abstract: Explainable Artificial Intelligence aims to make black-box models more trustworthy by presenting, in a human-understandable manner, the elements that lead to the model's output. This involves both (i) identifying components and connections with genuine causal influence on outputs and (ii) translating such structures into an interpretable representation. For the former, we introduce CIExplainer, a novel perturbation-based method grounded in causal i...

23.06.2026

Blog LLMs & Texto

Confidence Laundering in Agent Systems: Why Uncertainty Needs a Latent Carrier

arXiv:2606.20662v1 Announce Type: new Abstract: Modern agent systems can turn uncertainty into overconfidence. Fragile upstream decisions are often exposed to downstream components as clean intermediate artifacts, while the uncertainty behind those decisions is lost at the interface. As a result, local ambiguity can become system-level error amplification. We argue that this reveals an interface bottleneck in agent uncertainty propagation: uncertainty does not propagate simply because a trajecto...

23.06.2026

Blog Robótica & RL

Vesta: A Generalist Embodied Reasoning Model

arXiv:2606.20905v1 Announce Type: new Abstract: Robots operating in open-world environments must seamlessly integrate localization, spatial reasoning, navigation, and long-horizon planning. While specialist models excel at individual tasks, deploying a multi-model stack is computationally expensive and prone to cascading errors. We present Vesta, a unified embodied generalist that consolidates these capabilities into a single foundation model. Our approach combines a diverse and massive curated ...

23.06.2026

Blog Robótica & RL

ReFPO: Reflow Regularization for Flow Matching Policy Gradients

arXiv:2606.21086v1 Announce Type: new Abstract: We present Reflow-regularized Flow Matching Policy Gradients (ReFPO), a simple online RL method that adds explicit Reflow regularization to FPO for efficient flow-based control. We uncover a key structural property: the gradient updates in Flow Matching Policy Gradients (FPO) can be interpreted as an implicit advantage-weighted Reflow process, providing a new geometric perspective on flow-based policy gradients. Building on this insight, ReFPO intr...

23.06.2026

O que está acontecendo agora

Learning-Based Modeling of Soft Robots via Cosserat Rod Theory

Factor-Aware Mixture-of-Experts with Pretrained Encoder for Combinatorial Generalization

LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations

In LLM Reasoning, there is Irrationality on top of Value Misalignment

Beyond 'One Language, One Script': Quantifying Orthographic Bias in Multilingual VLMs with PuMVR

Evidential Fusion Network for Multimodal Survival Prediction under Missing Modalities

Understanding Latent Flow Models for Tabular Data Synthesis: Targets, Paths, and Sampling

GEOPHYS: The Geometry of Physical Plausibility

CIExplainer++: Generating Causal and Interpretable Explanations for Graph Neural Networks

Confidence Laundering in Agent Systems: Why Uncertainty Needs a Latent Carrier

Vesta: A Generalist Embodied Reasoning Model

ReFPO: Reflow Regularization for Flow Matching Policy Gradients