GPT-5 launched yesterday. 94.6% on AIME 2025. 74.9% on SWE-bench.
As we approach the upper bounds of these benchmarks, they die.
What makes GPT-5 and the next generation of models revolutionary isn’t their knowledge. It’s knowing how to act. For GPT-5 this happens at two levels. First, deciding which model to use. But second, and more importantly, through tool calling.
We’ve been living in an era where LLMs mastered knowledge retrieval & reassembly.
Recent Stories
Frontiers | Artificial Intelligence vs Human Evaluation of Anesthesia Education Videos: A Comparative Analysis Using Validated Quality Scales
Background: YouTube has become an increasingly popular platform for medical education, yet the accuracy and educational quality of anesthesia-related videos ...
Jan 19, 2026The Race to Build the DeepSeek of Europe Is On
As Europe’s longstanding alliance with the US falters, its push to become a self-sufficient AI superpower has become more urgent.
Jan 18, 2026Ed Zitron on big tech, backlash, boom and bust: ‘AI has taught us that people are excited to replace human beings’
His blunt, brash scepticism has made the podcaster and writer something of a cult figure. But as concern over large language models builds, he’s no longer the outsider he once was