
OpenAI:
OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model — We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.

OpenAI:
OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model — We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.
Source: TechMeme
Source Link: http://www.techmeme.com/251216/p21#a251216p21