OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model (OpenAI)

Wait 5 sec.

OpenAI:OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model  —  We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.