Blog Dados & Embeddings

Beyond Compilation: Evaluating Faithful Natural-Language-to-Lean Statement Formalization

arXiv:2606.31002v1 Announce Type: new Abstract: Theorem-proving benchmarks evaluate proof search against fixed formal statements, but natural-language-to-Lean formalization must generate the formal statement itself. In this setting, compilation is only a validity check: a Lean declaration may type-check while omitting hypotheses, changing domains, or expressing a vacuous claim. We study faithful statement formalization as both an evaluation problem and a bottleneck-attribution problem. On a 400-...

arXiv cs.AI ·Ke Zhang, Patricio Gallardo Candela, Sudhir Murthy, Yi Xie, Zhi Wang, Maziar Raissi · 01 de janeiro de 2026

Ver no Hugging Face

// relacionados

Beyond Compilation: Evaluating Faithful Natural-Language-to-Lean Statement Formalization

Leia também

Using Lift to Turn Research PDFs into Structured JSON with Controlled, Schema-Guided Field-Level Evaluation

SpaceX shows investors a slim AI smartphone prototype powered by xAI technology

google/tabfm-1.0.0-pytorch

Hidden code in Claude Code secretly flagged Chinese users