New Reasoning Architecture Achieves Human-Level Performance on Graduate-Level Mathematics
A team at MIT CSAIL has unveiled a new neural architecture that combines chain-of-thought prompting with formal verification, scoring 94% on the MATH benchmark — the highest ever recorded by an AI system.