National Cyber Warfare Foundation (NCWF)

OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its str


0 user ratings
2025-12-16 17:23:05
milo
Developers , Shooting the breeze / off-topic

OpenAI:

OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model  —  We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.




OpenAI:

OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700+ questions, finding GPT-5.2 is its strongest model  —  We introduce FrontierScience, a new benchmark that evaluates AI capabilities for expert-level scientific reasoning across physics, chemistry, and biology.



Source: TechMeme
Source Link: http://www.techmeme.com/251216/p21#a251216p21


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Developers
Shooting the breeze / off-topic



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.