OpenAI is launching a brand new common function AI agent in ChatGPT, which the corporate says can full all kinds of computer-based duties on behalf of customers. OpenAI says the agent can mechanically navigate a consumer’s calendar, generate editable displays and slideshows, and run code.
The software, known as ChatGPT agent, combines a number of capabilities from OpenAI’s earlier agentic instruments, together with Operator’s skill to click on round on web sites, in addition to Deep Analysis’s skill to synthesize info from dozens of internet sites right into a concise analysis report. OpenAI says customers will be capable to work together with the agent just by prompting ChatGPT in pure language.
On Thursday, OpenAI is rolling out ChatGPT agent for subscribers to its Professional, Plus, and Crew plans. To activate the software, customers can choose “agent mode” in ChatGPT’s dropdown menu of instruments.
The launch of ChatGPT agent represents OpenAI’s boldest try but to show ChatGPT into an agentic product that may take actions and offload duties for customers, quite than simply answering questions. Lately, Silicon Valley firms together with OpenAI, Google, and Perplexity have unveiled dozens of AI brokers which have promised to do exactly that. Nevertheless, these early model of AI brokers have confirmed to wrestle with advanced duties, and appear much less compelling as merchandise than the final word imaginative and prescient tech executives pitch round AI brokers.
That stated, OpenAI says ChatGPT agent is way extra succesful than its earlier choices.
OpenAI’s new agent can entry ChatGPT connectors, permitting customers to attach apps like Gmail and GitHub in order that the agent can discover related info to your prompts. Moreover, OpenAI says ChatGPT agent has entry to a terminal, and may use APIs to entry sure apps.
OpenAI means that customers can faucet ChatGPT agent to “plan and purchase elements to make Japanese breakfast for 4,” in addition to “analyze three rivals and create a slide deck.” These sorts of capabilities requires ChatGPT agent to parse by web sites, plan a plan of action, and use instruments — way more difficult duties than OpenAI has beforehand tried to deal with with brokers.
Techcrunch occasion
San Francisco
|
October 27-29, 2025
The mannequin underlying ChatGPT agent presents state-of-the-art efficiency on a number of benchmarks, in keeping with OpenAI.
The corporate says the ChatGPT agent mannequin scores 41.6% on Humanity’s Final Examination (cross@1), a troublesome check made up of 1000’s of questions throughout multiple hundred topics. That’s roughly double what OpenAI’s o3 and o4-mini scored on the check.
On FrontierMath, one of many hardest recognized math benchmarks, OpenAI says ChatGPT agent scores 27.4% when it has entry to instruments, comparable to a terminal for code execution. The earlier state-of-the-art rating comes from o4-mini, which scored simply 6.3%.
OpenAI notes that it developed ChatGPT agent with security in thoughts, largely as a result of the product presents some newfound capabilities that might make it extra harmful within the arms of a nasty actor. OpenAI has beforehand warned that agentic fashions may current extra harmful capabilities.
In a security report for ChatGPT agent, OpenAI says it’s designed the mannequin as “excessive functionality” in organic and chemical weapon domains, which is outlined in OpenAI’s Preparedness Framework as a mannequin with the power to “amplify current pathways to extreme hurt.” OpenAI notes that it doesn’t have direct proof of this, however it’s determined to take a precautionary strategy and activate new safeguards to mitigate these dangers.
Among the many new safeguards for ChatGPT agent embody a monitor that works in real-time as customers work together with the product. OpenAI says it runs a classifier throughout each immediate entered into ChatGPT agent, figuring out whether or not the request is said to biology. If that’s the case, OpenAI runs the ChatGPT’s brokers response by a second monitor, that determines whether or not the content material could possibly be used to evoke a organic risk.
Whereas ChatGPT agent sounds spectacular, it it stays to be seen how succesful OpenAI’s new agent actually is in the actual world. Till now, agent expertise has confirmed comparatively brittle when interacting with the actual world. That stated, OpenAI believes it’s developed a extra succesful mannequin that’s capable of ship on the promise of AI brokers.