OpenThoughts-Agent: Data Recipes for Agentic Models
An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training…
Papers, modelos e datasets em alta no Hugging Face, além do blog oficial — com leitura editorial em português.
An open-source data curation pipeline for training agentic language models is presented, demonstrating superior performance through systematic experimentation and scalable training…
Block-GTQ introduces a RoPE-aware bit allocation method for key-cache quantization that improves attention accuracy and downstream performance through adaptive bit distribution and…
Wan-Streamer is a unified, end-to-end multimodal model that enables real-time audio-visual interaction through causal attention mechanisms and integrated processing of visual, audi…
Video diffusion models are adapted to decode explicit surface primitives directly from latent space, enabling high-quality 3D scene generation with improved geometric accuracy and…
Large language model agents' memory systems have evolved into complex data management frameworks requiring systematic evaluation across multiple modules and workloads to understand…
Discover how Omio uses OpenAI to power conversational travel experiences, accelerate product development, and transform into an AI-native company.
InSight enables autonomous skill acquisition for vision-language-action models through primitive-action level steerability and automated demonstration generation.
A large-scale synthetic dataset and specialized model architecture are introduced to address the challenges of artistic text recognition by improving data diversity and model flexi…
Two-channel evaluation shows output compression reduces costs while input compression increases costs and degrades accuracy across models and datasets.
Implicit Visual Chain-of-Thought decomposes visual conditioning into structural and semantic cascades for improved structure-aware image generation with sketch supervision.
Language-based world models enable agentic environment simulation across multiple domains and enhance general agent performance through scalable simulation and improved downstream…
NatureBench presents a cross-disciplinary benchmark of 90 scientific tasks derived from Nature publications to assess AI coding agents' ability to achieve discovery rather than jus…