AI Can Write Code But Lacks Engineer's Instinct, OpenAI Study Finds

Leading AI models can fix broken code, but they're nowhere near ready to replace human software engineers, according to extensive testing [PDF] by OpenAI researchers. The company's latest study put AI models and systems through their paces on real-world programming tasks, with even the most advanced models solving only a quarter of typical engineering challenges.
The research team created a test called SWE-Lancer, drawing from 1,488 actual software fixes made to Expensify's codebase, representin