// radar de ia

O que está acontecendo agora

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

Discretizing Reward Models

Reward models in reinforcement learning suffer from oversensitivity issues where they assign different scores to equally good responses, leading to poor policy learning, but this c…

19.06.2026 ·▲ 12

Paper Robótica & RL

PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

PoLAR introduces a geometrically structured latent action representation in hyperbolic space that separates transition extent from transition mode, improving robotic policy learnin…

19.06.2026 ·▲ 7

Paper LLMs & Texto

Demystifying Training-Time Augmentation for Data-Constrained Language Model Pretraining

Training-time data augmentation techniques help mitigate overfitting in autoregressive language model pretraining by delaying performance deterioration and improving final model qu…

19.06.2026 ·▲ 1

Paper LLMs & Texto

DataClaw0: Agentic Tailoring Multimodal Data from Raw Streams

Agentic Data Tailoring paradigm uses learnable data processing to structure high-entropy multimodal streams, with DataClaw_0-9B model achieving robust alignment through SFT and GRP…

19.06.2026 ·▲ 60

Paper Áudio & Voz

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

UnityShots is a memory-driven audio-video generation system that maintains consistent subject appearance and audio across video cuts using fixed-size long-term and short-term memor…

19.06.2026 ·▲ 14

Paper LLMs & Texto

PrivacyAlign: Contextual Privacy Alignment for LLM Agents

Researchers develop a human-centered approach to align AI agents with privacy norms by creating a comprehensive dataset of privacy judgments and using annotation-conditioned reward…

19.06.2026 ·▲ 3

Paper Áudio & Voz

Improving Text-to-Music Generation with Human Preference Rewards

A text-to-music generation system uses reward conditioning, expert iteration, and preference tuning to improve audio quality while maintaining efficiency within a 120M-parameter mo…

19.06.2026

Paper LLMs & Texto

An Exploratory Case Study of LLM-Assisted Refactoring and Gameplay Feature Generation in an Endless Runner Game

Large language models demonstrate varying effectiveness in software development tasks, successfully completing localized refactoring but showing limitations in integrating new game…

19.06.2026

Paper Dados & Embeddings

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

EvoEmbedding is a dynamic embedding model that generates adaptive representations by maintaining a continuously updated latent memory, enabling improved retrieval performance in lo…

19.06.2026 ·▲ 22

Paper LLMs & Texto

Toward Open Weight Models Without Risks: Separating Public and Private Capabilities in LLMs

Tiered Language Models (TLMs) provide a framework for releasing large language models with configurable capability levels through secret keys that modify computation graphs while m…

19.06.2026 ·▲ 1

Modelo LLMs & Texto

Mia-AiLab/Gemmable-4-12B-MTP-GGUF

Modelo de modelo em alta no Hugging Face — 7.2 mil downloads e 40 curtidas da comunidade.

18.06.2026 ·↓ 7248

Modelo LLMs & Texto

Mia-AiLab/Qwable-3.6-27b

Modelo de modelo em alta no Hugging Face — 24.0 mil downloads e 125 curtidas da comunidade.

18.06.2026 ·↓ 23958

← anterior 111 / 144 próxima →

1720 itens no radar