Blog LLMs & Texto

A-Evolve-Training: Autonomous Post-Training of a 30B Model

arXiv:2606.20657v1 Announce Type: new Abstract: Post-training a frontier model is normally weeks of human work: proposing data and recipe changes, launching runs, reading evals, deciding what to keep. We report an autonomous system that runs this loop with no human in the loop, post-training a 30B Nemotron across four rounds over multiple weeks. The autonomously produced model reaches a held-out score of 0.86 against the top human submission's 0.87 on the public NVIDIA Nemotron-Reasoning Challen...

arXiv cs.AI ·Zhan Shi, Bing He, Yisi Sang, Hanqing Lu · 23 de janeiro de 2026

Ver no Hugging Face

// relacionados

A-Evolve-Training: Autonomous Post-Training of a 30B Model

Leia também

How Businesses Are Building Specialized AI They Can Trust

Fika Jobs raises $4M to build a video-first hiring platform where AI agents interview candidates

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

Cursor announces its own AI model, a new Git platform, and a mobile app