A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT-5.4 scored 75%, up from 47.3% with its GPT 5.2 model. That also beats the average ...
"In ChatGPT, GPT‑5.4 Thinking can now provide an upfront plan of its thinking, so you can adjust course mid-response while it ...
GPT-5.4 is out now on ChatGPT (where it goes by the name GPT-5.4 Thinking) as well as on the OpenAI API and OpenAI’s coding ...
OpenAI integrates Excel support and live financial data into ChatGPT, creating a powerful spreadsheet co-pilot for traders, analysts, and crypto investors.
OpenAI Group PBC today launched a new large language model that it says is more adept at automating work tasks than its earlier algorithms. GPT-5.4 is available in ChatGPT, the Codex programming tool ...
GPT-5.4 is also more reliable, producing 18% fewer errors and 33% fewer false claims than GPT-5.2, according to OpenAI.
The latest model comes with native computer use capabilities, allowing it to take on jobs across your device and applications ...
OpenAI released GPT-5.4 today with native computer use, a 1M-token context window, and new professional benchmarks. Find what's genuinely new.
OpenAI launches GPT-5.4, calling it its most capable and efficient AI model yet, with AI agents, computer control, improved reasoning, and a 1M-token context.
OpenAI debuted its most capable model yet under pressure from a mass user exodus tied to the company's controversial Pentagon ...
Experimental - This project is still in development, and not ready for the prime time. A minimal, secure Python interpreter written in Rust for use by AI. Monty avoids the cost, latency, complexity ...
When building AI, you change many things at once: code, data, prompts, models. After a few runs, it becomes unclear what actually caused results to improve or regress. LitLogger records every run as ...