The company claims its ability to tackle complex, multistep problems paves the way for much more proficient AI agents. Anthropic has announced two new AI models that it claims represent a major step ...
What if the toughest problems humanity faces—those that stump our brightest minds and stretch the limits of human ingenuity—could be tackled by a single, purpose-built system? Enter Gemini Deep Think, ...
Researchers at Google Cloud and UCLA have proposed a new reinforcement learning framework that significantly improves the ability of language models to learn very challenging multi-step reasoning ...
Singapore-based AI startup Sapient Intelligence has developed a new AI architecture that can match, and in some cases vastly outperform, large language models (LLMs) on complex reasoning tasks, all ...
OpenAI published a new paper called "Monitoring Monitorability." It offers methods for detecting red flags in a model's reasoning. Those shouldn't be mistaken for silver bullet solutions, though. In ...
From GPT-5 to GPT-5 Thinking, here's a simple guide to ChatGPT's current models and the tasks they're best suited for.
On Thursday, Anthropic released Claude Opus 4 and Claude Sonnet 4, marking the company’s return to larger model releases after primarily focusing on mid-range Sonnet variants since June of last year.