Tag Archives: SR models

New study shows why simulated reasoning AI models don’t yet live up to their billing

A screenshot of the 2025 USAMO Problem #1 and a solution, shown on the AoPSOnline website. Credit: AoPSOnline The US Math Olympiad (USAMO) serves as a qualifier for the International Math Olympiad and presents a much higher bar than tests like the American Invitational Mathematics Examination (AIME). While AIME problems are difficult, they require integer… Read More »

OpenAI releases new simulated reasoning models with full tool access

On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities with access to functions like web browsing and coding. These models mark the first time OpenAI’s reasoning-focused models can use every ChatGPT tool simultaneously, including visual analysis and image generation. OpenAI announced o3 in December, and until now, only… Read More »

Researchers concerned to find AI models hiding their true “reasoning” processes

Remember when teachers demanded that you “show your work” in school? Some fancy new AI models promise to do exactly that, but new research suggests that they sometimes hide their actual methods while fabricating elaborate explanations instead. New research from Anthropic—creator of the ChatGPT-like Claude AI assistant—examines simulated reasoning (SR) models like DeepSeek’s R1, and… Read More »

Cutting-edge Chinese “reasoning” model rivals OpenAI o1—and it’s free to download

Unlike conventional LLMs, these SR models take extra time to produce responses, and this extra time often increases performance on tasks involving math, physics, and science. And this latest open model is turning heads for apparently quickly catching up to OpenAI. For example, DeepSeek reports that R1 outperformed OpenAI’s o1 on several benchmarks and tests,… Read More »

Sam Altman says “we are now confident we know how to build AGI”

On Sunday, OpenAI CEO Sam Altman offered two eye-catching predictions about the near-future of artificial intelligence. In a post titled “Reflections” on his personal blog, Altman wrote, “We are now confident we know how to build AGI as we have traditionally understood it.” He added, “We believe that, in 2025, we may see the first… Read More »