OpenAI launched ChatGPT Agent on Thursday, its latest effort in the industry-wide pursuit to turn AI into a profitable enterprise—not just one that eats investors’ billions. In its announcement blog, OpenAI says its Agent “can now do work for you using its own computer,” but CEO Sam Altman warns that the rollout presents unpredictable risks.
[…]
OpenAI research lead Lisa Fulford told Wired that she used Agent to order “a lot of cupcakes,” which took the tool about an hour, because she was very specific about the cupcakes.
Okay but that’s not what easier means.
Easier would be to call the bakery or spending 10 minutes browsing their website, asking to cast, and checking out.
I don’t want to spend an hour on tasks that would normally take 10 minutes. My executive dysfunctions already make me good at doing that.
This might be a revolutionary idea, but what if they helped me do that take an hour in 10 minutes?
I’m just putting that idea out there totally for free in case any AI companies want to jump on that opportunity.
I don’t get it, do you think she spent an hour talking to ChatGPT to try and get it to order doughnuts?
It’s a starting point
I use agents a lot and have written several MCP servers now, the tasks I automate aren’t things like order cupcakes, it’s mainly the glue between complex things.
I still can’t get Claude to nicely open a JIRA ticket for me, but I can get it to read through a sequence of connected documents and filter that into.
I don’t think agents are ready for the main event and these are some poor examples of their power.
I’m not saying they won’t improve, but using the right tool for the right job is critical. An hour to order cupcakes is silly even for an llm.
yes in the wired article one of them says they would like to find out where it got stuck taking an hour with an agent replay feature
It’s examples for the common guy in the streets who don’t know what an mcp server is.