“GPT‑4.1 mini is a major leap in small mannequin efficiency, even beating GPT‑4o in lots of benchmarks. It matches or exceeds GPT‑4o in intelligence evals whereas decreasing latency by almost half and decreasing value by 83%,” the announcement mentioned. “For duties that demand low latency, GPT‑4.1 nano is our quickest and most cost-effective mannequin obtainable. It delivers distinctive efficiency at a small measurement with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding—even increased than GPT‑4o mini. It’s very best for duties like classification or autocompletion.”
These enhancements, OpenAI mentioned, mixed with primitives such because the Responses API, will permit builders to construct extra helpful and dependable brokers that can carry out advanced duties comparable to extracting insights from massive paperwork and resolving buyer requests “with minimal hand-holding.”
OpenAI additionally mentioned that GPT-4.1 is considerably higher than GPT-4o at duties comparable to agentically fixing coding duties, front-end coding, making fewer extraneous edits, following diff codecs reliably, making certain constant instrument utilization, and others.