The Register on MSN

AI models still suck at math

Just less than before, according to the ORCA test exclusive Current-day LLMs are prediction engines and, as such, they can ...
The Covid-19 pandemic has triggered a wave of severe economic disruption around the world, causing widespread chaos, profound changes in the business landscape and overwhelming operational challenges.
Tao: Today there are a lot of very tedious types of mathematics that we don’t like doing, so we look for clever ways to get ...
OpenAI’s unreleased model solved five of 10 unpublished research-level math problems and proposed a breakthrough physics formula, signaling a new era for AI in science.
There are, generally speaking, two types of people in the mathematical optimization software field: • Optimization solver developers: The technical experts who devise and implement the algorithms that ...
AI could soon spew out hundreds of mathematical proofs that look "right" but contain hidden flaws, or proofs so complex we can't verify them. How will we know if they're right?
Large language models struggle to solve research-level math questions. It takes a human to assess just how poorly they perform.
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
DeepSeek released DeepSeek-Math-V2, an AI model specialized for mathematical reasoning, on November 27, 2025. DeepSeek-Math-V2 focuses on theorem proving and self-verification capabilities, and ...