// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

Blog LLMs & Texto

Source: Elastic agrees to buy CRV-backed Deductive AI for up to $85M

Deductive AI, a startup that uses AI to catch and resolve bugs in software, was founded just three years ago.

19.06.2026

Paper LLMs & Texto

Counsel: A Meta-Evaluation Dataset for Agentic Tasks

A large-scale dataset of human-metaevaluations of LLM critiques for agentic tasks is introduced to improve the calibration and reliability of automated evaluation methods.

19.06.2026 ·▲ 3

Paper LLMs & Texto

CalVerT: Augmenting Agents with Calibrated Verifier Telemetry Improves Action and Learning in Knowledge-Intensive Tasks

Calibrated verifier telemetry enhances LLM agents in knowledge-intensive question answering by providing confidence scores and grounding verification, reducing both over-retrieval…

19.06.2026 ·▲ 2

Paper LLMs & Texto

Speaker Identity in Non-Verbal Vocalizations: Conditional Distillation and Mixture of Experts Approach

A novel speaker verification framework combines frozen self-supervised features with ECAPA-TDNN and MoE modules to improve identity verification across both speech and non-verbal v…

19.06.2026

Paper Robótica & RL

Discretizing Reward Models

Reward models in reinforcement learning suffer from oversensitivity issues where they assign different scores to equally good responses, leading to poor policy learning, but this c…

19.06.2026 ·▲ 12

Paper Robótica & RL

PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

PoLAR introduces a geometrically structured latent action representation in hyperbolic space that separates transition extent from transition mode, improving robotic policy learnin…

19.06.2026 ·▲ 7

Paper LLMs & Texto

Demystifying Training-Time Augmentation for Data-Constrained Language Model Pretraining

Training-time data augmentation techniques help mitigate overfitting in autoregressive language model pretraining by delaying performance deterioration and improving final model qu…

19.06.2026 ·▲ 1

Paper LLMs & Texto

DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams

Agentic Data Tailoring paradigm uses learnable data processing to structure high-entropy multimodal streams, with DataClaw_0-9B model achieving robust alignment through SFT and GRP…

19.06.2026 ·▲ 60

Paper Áudio & Voz

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

UnityShots is a memory-driven audio-video generation system that maintains consistent subject appearance and audio across video cuts using fixed-size long-term and short-term memor…

19.06.2026 ·▲ 14

Paper LLMs & Texto

PrivacyAlign: Contextual Privacy Alignment for LLM Agents

Researchers develop a human-centered approach to align AI agents with privacy norms by creating a comprehensive dataset of privacy judgments and using annotation-conditioned reward…

19.06.2026 ·▲ 3

Paper Áudio & Voz

Improving Text-to-Music Generation with Human Preference Rewards

A text-to-music generation system uses reward conditioning, expert iteration, and preference tuning to improve audio quality while maintaining efficiency within a 120M-parameter mo…

19.06.2026

Paper LLMs & Texto

An Exploratory Case Study of LLM-Assisted Refactoring and Gameplay Feature Generation in an Endless Runner Game

Large language models demonstrate varying effectiveness in software development tasks, successfully completing localized refactoring but showing limitations in integrating new game…

19.06.2026

← anterior 92 / 125 próxima →

1494 itens no radar