A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT-5.4 scored 75%, up from 47.3% with its GPT 5.2 model. That also beats the average ...
"In ChatGPT, GPT‑5.4 Thinking can now provide an upfront plan of its thinking, so you can adjust course mid-response while it ...
GPT-5.4 is out now on ChatGPT (where it goes by the name GPT-5.4 Thinking) as well as on the OpenAI API and OpenAI’s coding ...
OpenAI integrates Excel support and live financial data into ChatGPT, creating a powerful spreadsheet co-pilot for traders, analysts, and crypto investors.
OpenAI Group PBC today launched a new large language model that it says is more adept at automating work tasks than its earlier algorithms. GPT-5.4 is available in ChatGPT, the Codex programming tool ...
GPT-5.4 is also more reliable, producing 18% fewer errors and 33% fewer false claims than GPT-5.2, according to OpenAI.
The latest model comes with native computer use capabilities, allowing it to take on jobs across your device and applications ...
OpenAI released GPT-5.4 today with native computer use, a 1M-token context window, and new professional benchmarks. Find what's genuinely new.
OpenAI launches GPT-5.4, calling it its most capable and efficient AI model yet, with AI agents, computer control, improved reasoning, and a 1M-token context.
OpenAI debuted its most capable model yet under pressure from a mass user exodus tied to the company's controversial Pentagon ...
Experimental - This project is still in development, and not ready for the prime time. A minimal, secure Python interpreter written in Rust for use by AI. Monty avoids the cost, latency, complexity ...
When building AI, you change many things at once: code, data, prompts, models. After a few runs, it becomes unclear what actually caused results to improve or regress. LitLogger records every run as ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results