// radar de ia

Dados & Embeddings

Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.

todas LLMs & Texto Geração de Imagem Visão Computacional Áudio & Voz Multimodal Dados & Embeddings Robótica & RL

WithinUsAI/claude_mythos_distilled_25k

Dataset em destaque no Hugging Face — 2.6 mil downloads. Claude Mythos Distilled 25K A high-quality synthetic supervised fine-tuning (SFT) dataset designed to train and fine-tune any LLM to mirror the capabi…

18.05.2026 ·↓ 2593

Blog Dados & Embeddings

Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability

Our recent paper, “LLMs Corrupt Your Documents When You Delegate”, has generated discussion about the reliability of AI systems in delegated workflows. We appreciate the interest in this work and want to clarify several important points about what the paper does—and does not—claim. The research aims to develop robust evaluation methods for long-horizon delegated and […] The post Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability appeared first on Microso...

15.05.2026

Blog Dados & Embeddings

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

14.05.2026

Blog LLMs & Texto

mimalloc: A new, high-performance, scalable memory allocator for the modern era

mimalloc is an open-source, modern, scalable memory allocator that is a drop-in replacement for malloc and free. It is relatively small (~12K lines), with clear internal data structures, and is easy to build and integrate into other projects. It provides bounded worst-case allocation times (up to OS primitives), bounded space overhead, low internal fragmentation, and minimal contention by relying almost exclusively on atomic operations. The post mimalloc: A new, high-performance, scalable memory...

13.05.2026

Blog Dados & Embeddings

GridSFM: A new, small foundation model for the electric grid

Introducing GridSFM, a small foundation model that can predict AC optimal power flow in milliseconds, boosting efficiency and unlocking cost savings. Learn how GridSFM gives grid operators direct visibility into congestion, stability, and system health. The post GridSFM: A new, small foundation model for the electric grid appeared first on Microsoft Research .

13.05.2026

Dataset LLMs & Texto

WithinUsAI/GPT_5.5_Distilled

Dataset em destaque no Hugging Face — 776 downloads.

12.05.2026 ·↓ 776

Dataset Dados & Embeddings

nvidia/PhysicalAI-Autonomous-Vehicles

Dataset em destaque no Hugging Face — 199.2 mil downloads. PHYSICAL AI AUTONOMOUS VEHICLES The PhysicalAI-Autonomous-Vehicles dataset provides one of the largest, geographically diverse collections of multi-se…

06.05.2026 ·↓ 199196

Blog LLMs & Texto

Designing synthetic datasets for the real world: Mechanism design and reasoning from first principles

Generative AI

16.04.2026

Blog Dados & Embeddings

Building better AI benchmarks: How many raters are enough?

Algorithms & Theory

31.03.2026

Dataset LLMs & Texto

ScaleAI/SWE-bench_Pro

Dataset em destaque no Hugging Face — 66.8 mil downloads. Dataset Summary SWE-Bench Pro is a challenging, enterprise-level dataset for testing agent ability on long-horizon software engineering tasks.

23.02.2026 ·↓ 66791

Modelo Visão Computacional

facebook/sam3

Modelo de mask generation em alta no Hugging Face — 1.7 mi downloads e 2.3 mil curtidas da comunidade.

20.11.2025 ·↓ 1745996

Blog LLMs & Texto

Which Agent Causes Task Failures and When?Researchers from PSU and Duke explores automated failure attribution of LLM Multi-Agent Systems

In recent years, LLM Multi-Agent systems have garnered widespread attention for their collaborative approach to solving complex problems. However, it's a common scenario for these systems to fail at a task despite a flurry of activity. The post Which Agent Causes Task Failures and When?Researchers from PSU and Duke explores automated failure attribution of LLM Multi-Agent Systems first appeared on Synced .

14.08.2025

← anterior 10 / 11 próxima →

125 itens no radar