New Apple study challenges whether AI models truly think through problems

Earlier this month, Apple researchers published a study indicating that simulated reasoning (SR) models, including OpenAI’s o1 and o3, DeepSeek-R1, and Claude 3.7 Sonnet Thinking, generate responses that align with pattern-matching from their training data when tackling new problems that demand systematic reasoning.
Benj Edwards for Ars Technica:‎The researchers found similar results to a recent study by the United States of America Mathematical Olympiad (USAMO) in April, showing that