VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct
A novel framework called VeriEvol is introduced that addresses the challenge of scaling reinforcement learning for visual mathematical reasoning by ensuring reliable reward labels…