Radar de IA — Notícias, Modelos e Papers

Peeking Inside LLMs: Leveraging Internal Artifacts of LLMs for Enhancing Reliability in Legal Classification

arXiv:2606.20929v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly being adopted in the legal domain. However, despite their strong performance, LLMs are prone to generating incorrect or hallucinated outputs, raising serious concerns about their reliability in high-stakes domains such as law. Detecting the correctness of responses of LLM-based systems is therefore a critical challenge. In this work, we explore the potential of leveraging internal artifacts of LLM to de...

23.06.2026

Blog Robótica & RL

Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study

arXiv:2606.20874v1 Announce Type: new Abstract: Cryopathy syndromes are difficult to classify because laboratory patterns often overlap across diagnostic categories, while some diagnoses are rare. This makes routine interpretation of cryoglobulin-related tests challenging and increases dependence on expert judgment. The aim of this study was to develop and compare machine learning approaches for automated classification of cryopathy syndromes from laboratory data and to identify a practical stra...

23.06.2026

Blog LLMs & Texto

The New Associationism: Lessons from Deep Learning

arXiv:2606.20600v1 Announce Type: new Abstract: What can the success of modern AI tell us about how humans learn? This paper argues that taking AI seriously as a model of human learning supports a modest but genuine associationism. The central finding is that supervised learning -- learning driven by evaluative feedback -- underlies a surprisingly wide range of contemporary AI systems, from large language models to game-playing agents, differing primarily in how much work is required to generate...

23.06.2026

Blog LLMs & Texto

Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study

arXiv:2606.20933v1 Announce Type: new Abstract: The choice of loss function and optimizer is an important decision, that shapes further model training. Yet automated architecture search pipelines (AutoML) benefits significantly more from the optimal pairing selection and vice versa. This paper investigates whether a single recipe is sufficient for heterogeneous architecture pools, or whether the optimal pairing varies across structurally diverse models. We conduct a systematic empirical study of...

23.06.2026

Blog LLMs & Texto

How Should a Robot Configure Its Laser Scanner for Inspection?

arXiv:2606.21093v1 Announce Type: new Abstract: Robotic inspection relies on accurate sensing to acquire high-fidelity geometric measurements for defect detection and metrology. While prior work has focused on robot motion and viewpoint planning, how to configure sensing parameters remains largely underexplored, despite their decisive impact on measurement quality. We propose SenseHD, a robotic sensing system that formulates scanner configuration as an instruction-conditioned sensing decision. I...

23.06.2026

Blog Dados & Embeddings

ELADO: Elliptic PDE Assessment Datasets for Operator Learning

arXiv:2606.20771v1 Announce Type: new Abstract: We introduce ELADO (Elliptic PDE Assessment Datasets for Operator Learning), a systematic benchmark suite constructed to show and quantify failure modes of neural operator architectures when learning solution operators of elliptic PDEs. While the benchmarks of existing datasets focus on average case performance, the ELADO datasets are constructed to highlight challenges that arise naturally in elliptic PDE problems. In particular, we construct seve...

23.06.2026

Blog LLMs & Texto

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

arXiv:2606.20945v1 Announce Type: new Abstract: Self-attention is central to Transformer performance and is often the most expensive part of the Transformer at long context lengths because its pairwise token interactions scale quadratically with sequence length. Standard dense attention also applies the same set of attention heads to every token regardless of token difficulty or information content. This uniform activation can waste compute, especially as sequences grow longer and attention cost...

23.06.2026

Blog Robótica & RL

MemoryVAM: Integrating Memory into Video Action Model for Robot Manipulation

arXiv:2606.20679v1 Announce Type: new Abstract: Video-world-model policies learn action-relevant representations by predicting future observations. However, they condition on only a short observation window, which renders long-horizon manipulation non-Markovian when the correct action depends on earlier events that are no longer visible. We present MemoryVAM, an episodic memory mechanism for video-world-model policies. We employ a Recap-Cue (RC) module, in which a Perceiver-based Recap Compresso...

23.06.2026

Blog LLMs & Texto

A-Evolve-Training: Autonomous Post-Training of a 30B Model

arXiv:2606.20657v1 Announce Type: new Abstract: Post-training a frontier model is normally weeks of human work: proposing data and recipe changes, launching runs, reading evals, deciding what to keep. We report an autonomous system that runs this loop with no human in the loop, post-training a 30B Nemotron across four rounds over multiple weeks. The autonomously produced model reaches a held-out score of 0.86 against the top human submission's 0.87 on the public NVIDIA Nemotron-Reasoning Challen...

23.06.2026

Blog Multimodal

How Well Can Your Video Model Remember? Measuring Memory-Budget Trade-offs in Long Video Understanding

arXiv:2606.20726v1 Announce Type: new Abstract: We introduce a compact empirical model that quantifies how answer accuracy degrades as a function of frame budget B and temporal distance D in long video understanding -- analyzing performance when recalling content from D seconds in the past using a fraction B of total frames. Long-form models operate under strict budgets, yet no prior framework predicts how accuracy degrades as B shrinks and events recede. We fit a weighted least-squares model on...

23.06.2026

Blog LLMs & Texto

Shear-Free Viewport Magnification for 360-Degree via Spherical Mobius Boosts

arXiv:2606.20684v1 Announce Type: new Abstract: Viewport-adaptive 360-degree imaging seeks to allocate a fixed sampling budget to the region a viewer is likely to observe. Existing view-biased projections increase viewport resolution through non-conformal warps, which can introduce anisotropic stretching and shear. We formulate spherical Mobius boosts as exact conformal maps for fixed-budget viewport magnification. The continuous spherical warp has quasiconformal dilatation K = 1, reallocating s...

23.06.2026

Blog Robótica & RL

Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning

arXiv:2606.20858v1 Announce Type: new Abstract: The temporal structure of reward composition in reinforcement learning (RL) is typically hand-designed and held fixed throughout training, leaving the progression of motivational priorities largely unexplored. In this work, we propose an evolutionary framework for discovering developmental reward schedules, in which three distinct biologically inspired motivational components -- agency, novelty, and reactivity -- are combined through time-varying w...

23.06.2026

O que está acontecendo agora

Peeking Inside LLMs: Leveraging Internal Artifacts of LLMs for Enhancing Reliability in Legal Classification

Machine Learning Classification of Cryopathy Syndromes: A Comprehensive Comparative Study

The New Associationism: Lessons from Deep Learning

Towards Robust Training in NNGPT AutoML Pipeline: A Loss-Optimizer Pairing Selection Study

How Should a Robot Configure Its Laser Scanner for Inspection?

ELADO: Elliptic PDE Assessment Datasets for Operator Learning

Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention

MemoryVAM: Integrating Memory into Video Action Model for Robot Manipulation

A-Evolve-Training: Autonomous Post-Training of a 30B Model

How Well Can Your Video Model Remember? Measuring Memory-Budget Trade-offs in Long Video Understanding

Shear-Free Viewport Magnification for 360-Degree via Spherical Mobius Boosts

Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning