Cristina Criddle / Financial Times:
Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests — Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
Cristina Criddle / Financial Times:
Meta, OpenAI, Microsoft, and other AI companies create their own internal benchmarks as new models approach or exceed 90% accuracy on existing public tests — Rapidly advancing technology is surpassing current methods of evaluating and comparing large language models
Source: TechMeme
Source Link: http://www.techmeme.com/241110/p1#a241110p1