FirstPass: Grounding AI Scientific Judgment in Multi-Round Editorial Outcomes

arXiv:2606.20769v1 Announce Type: new Abstract: AI systems for peer review fail on three fronts: they train on Computer Science and Machine Learning venues alone, ignore the iterative dialogue that validates science, and evaluate on stylistic mimicry rather than real editorial judgment. We introduce FirstPass, a dataset and fine-tuned model that addresses all three. Curating 3,668 complete multi-round peer-review dialogues from Nature Communications across five scientific domains (biology, chemi...

arXiv cs.CL ·Prabhjot Singh, Somnath Luitel, Manmeet Singh, Josh Durkee ·
compartilhar: