ibm-research/AssetOpsBench
Dataset com menos de mil exemplos — 869 downloads no Hugging Face. AssetOpsBench AssetOpsBench is a specialized benchmark designed for evaluating Large Language Models (LLMs) and Multi-Agent systems in industrial oper…
Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.
Dataset com menos de mil exemplos — 869 downloads no Hugging Face. AssetOpsBench AssetOpsBench is a specialized benchmark designed for evaluating Large Language Models (LLMs) and Multi-Agent systems in industrial oper…
Data Formulator introduces AI-powered analytics for enterprise data workflows. Data teams can easily bring enterprise data into an AI-ready workspace where users can explore, analyze, and visualize data with AI agents to turn raw data into actionable insights. The post Data Formulator 0.7: AI-powered data analytics for enterprise data appeared first on Microsoft Research .
Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.
Dataset em destaque no Hugging Face — 90.3 mil downloads. Ultra-FineWeb-L3 📜 Ultra-FineWeb Technical Report | 📦 UltraData Collection | 🌐 UltraData | 🤗 MiniCPM5 Series English | 中文 📚 Introdu…
Security, Privacy and Abuse Prevention
Understanding AI as an extension of human intelligence—not a replacement for it—offers a more grounded path for building trustworthy AI systems. The post Extending Human Intelligence Through AI appeared first on Microsoft Research .
Dataset com menos de mil exemplos — 37.3 mil downloads no Hugging Face. ITBench-AA Artificial Analysis' release of the public scenarios from IBM's ITBench benchmark, used for the ITBench-AA leaderboard.
Modelo de visão e linguagem · 27 B de parâmetros — 874.4 mil downloads e 869 curtidas no Hugging Face.
Dataset em destaque no Hugging Face — 215 downloads. Scarf (Self-Contained Application Refactoring) is a benchmark suite for evaluating AI agents' ability to migrate enterprise Java applications across J…