Are reasoning models fundamentally flawed?
A report from Apple has cast significant doubts on the efficacy of reasoning models, going as far as to suggest that when a problem is too complex, they simply give up
Sign up today and you will receive a free copy of our Future Focus 2025 report - the leading guidance on AI, cybersecurity and other IT challenges as per 700+ senior executives
You are now subscribed
Your newsletter sign-up was successful
A report from Apple has cast significant doubts on the efficacy of reasoning models, going as far as to suggest that when a problem is too complex, they simply give up.
AI reasoning models have emerged in the past year as a beacon of hope for large language models (LLMs), with AI developers such as OpenAI, Google, and Anthropic selling them as the go-to solution for solving the most complex business problems.
However, a new research paper by Apple has cast significant doubts on the efficacy of reasoning models, going as far as to suggest that when a problem is too complex, they simply give up. What's going on here? And does it mean reasoning models are fundamentally flawed?
In this episode, Rory Bathgate speaks to ITPro's news and analysis editor Ross Kelly to explain some of the report's key findings and what it means for the future of AI development.
Highlights
"It sounds a bit obvious, but at the same time, the way that this has been, framed by a lot of providers, makes it seem like they're the be-all and end-all. But once you reach a level of complexity, yeah, you start to encounter some serious problems."
"What's interesting for me, right off the bat, is, what Apple is showing is that that's actually the opposite of what these are good at, that they are maybe more performant, maybe they're slightly better at solving complex problems up to a point, but that if you actually try and give them really complex problems, they are completely ineffective."
Sign up today and you will receive a free copy of our Future Focus 2025 report - the leading guidance on AI, cybersecurity and other IT challenges as per 700+ senior executives
"Apple is claiming that OpenAI hasn't made a model that can think, to which my reaction was, did you think OpenAI had made a model that can think? "
- Subscribe to The IT Pro Podcast on Apple Podcasts
- Subscribe to The IT Pro Podcast on Spotify
- Subscribe to the IT Pro newsletter
- Join us on LinkedIn
Bobby Hellard is ITPro's Reviews Editor and has worked on CloudPro and ChannelPro since 2018. In his time at ITPro, Bobby has covered stories for all the major technology companies, such as Apple, Microsoft, Amazon and Facebook, and regularly attends industry-leading events such as AWS Re:Invent and Google Cloud Next.
Bobby mainly covers hardware reviews, but you will also recognize him as the face of many of our video reviews of laptops and smartphones.
-
Mistral CEO Arthur Mensch thinks 50% of SaaS solutions could be supplanted by AINews Mensch’s comments come amidst rising concerns about the impact of AI on traditional software
-
Westcon-Comstor and UiPath forge closer ties in EU growth driveNews The duo have announced a new pan-European distribution deal to drive services-led AI automation growth
-
The agentic identity crisisITPro Podcast With millions of AI agents being deployed, how can security teams maintain oversight and governance?
-
Building a secure payments strategySponsored Podcast Facilitating a smooth payments experience for iGaming can go hand-in-hand with building a more secure platform
-
Mitigating bad botsSponsored Podcast Web crawlers pose an immediate business risk, necessitating immediate action from IT leaders
-
Google’s Apple deal is a major seal of approval for Gemini – and a sure sign it's beginning to pull ahead of OpenAI in the AI raceAnalysis Apple opting for Google's models to underpin Siri and Apple Intelligence is a major seal of approval for the tech giant's Gemini range – and a sure sign it's pulling ahead in the AI race.
-
Microsoft quietly launches Fara-7B, a new 'agentic' small language model that lives on your PC — and it’s more powerful than GPT-4oNews The new Fara-7B model is designed to takeover your mouse and keyboard
-
Why do AI projects fail?ITPro Podcast Without a careful approach at the design phase, AI tools will fall apart in runtime
-
Is vibe coding the future?ITPro Podcast As developers use AI to speed up code generation, leaders must work to ensure the technology complements existing practices
-
September rundown: The UK becomes an AI playgroundITPro Podcast Big tech has announced tens of billions in infrastructure investments for the UK – how will it help?
