{"id":107197,"date":"2025-09-01T15:54:14","date_gmt":"2025-09-01T15:54:14","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/107197\/"},"modified":"2025-09-01T15:54:14","modified_gmt":"2025-09-01T15:54:14","slug":"ai-agents-are-science-fiction-not-yet-ready-for-primetime","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/107197\/","title":{"rendered":"AI agents are science fiction not yet ready for primetime"},"content":{"rendered":"<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">This is <a href=\"https:\/\/www.theverge.com\/the-stepback-newsletter\" rel=\"nofollow noopener\" target=\"_blank\">The Stepback<\/a>, a weekly newsletter breaking down one essential story from the tech world. For more on all things AI, <a href=\"https:\/\/www.theverge.com\/authors\/hayden-field\" rel=\"nofollow noopener\" target=\"_blank\">follow Hayden Field<\/a>. The Stepback arrives in our subscribers\u2019 inboxes at 8AM ET. Opt in for The Stepback <a href=\"https:\/\/www.theverge.com\/newsletters\" rel=\"nofollow noopener\" target=\"_blank\">here<\/a>.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">It all started with J.A.R.V.I.S. Yes, that J.A.R.V.I.S. The one from the Marvel movies.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Well, maybe it didn\u2019t start with Iron Man\u2019s AI assistant, but the fictional system definitely helped the concept of an AI agent along. Whenever I\u2019ve interviewed AI industry folks about agentic AI, they often point to J.A.R.V.I.S. as an example of the ideal AI tool in many ways \u2014 one that knows what you need done before you even ask, can analyze and find insights in large swaths of data, and can offer strategic advice or run point on certain aspects of your business. People sometimes disagree on the exact definition of an AI agent, but at its core, it\u2019s a step beyond chatbots in that it\u2019s a system that can perform multistep, complex tasks on your behalf without constantly needing back-and-forth communication with you. It essentially makes its own to-do list of subtasks it needs to complete in order to get to your preferred end goal. That fantasy is closer to being a reality in many ways, but when it comes to actual usefulness for the everyday user, there are a lot of things that don\u2019t work \u2014 and maybe will never work.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">The term \u201cAI agent\u201d has been around for a long time, but it especially started trending in the tech industry in 2023. That was the year of the concept of AI agents; the term was on everyone\u2019s lips as people tried to suss out the idea and how to make it a reality, but you didn\u2019t see many successful use cases. The next year, 2024, was the year of deployment \u2014 people were really putting the code out into the field and seeing what it could do. (The answer, at the time, was\u2026 not much. And filled with a bunch of error messages.)<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">I can pinpoint the hype around AI agents becoming widespread to one specific announcement: In February 2024, Klarna, a fintech company, <a href=\"https:\/\/www.klarna.com\/international\/press\/klarna-ai-assistant-handles-two-thirds-of-customer-service-chats-in-its-first-month\/\" rel=\"nofollow noopener\" target=\"_blank\">said<\/a> that after one month, its AI assistant (powered by OpenAI\u2019s tech) had successfully done the work of 700 full-time customer service agents and automated two-thirds of the company\u2019s customer service chats. For months, those statistics came up in almost every AI industry conversation I had.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">The hype never died down, and in the following months, every Big Tech CEO seemed to harp on the term in every earnings call. Executives at Amazon, Meta, Google, Microsoft, and a whole host of other companies began to talk about their commitment to building useful and successful AI agents \u2014 and tried to put their money where their mouths are to make it happen.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">The vision was that one day, an AI agent could do everything from book your travel to generate visuals for your business presentations. The ideal tool could even, say, find a good time and place to hang out with a bunch of your friends that works with all of your calendars, food preferences, and dietary restrictions \u2014 and then book the dinner reservation and create a calendar event for everyone.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Now let\u2019s talk about the \u201cAI coding\u201d of it all: For years, AI coding has been carrying the agentic AI industry. If you asked anyone about real-life, successful, not-annoying use cases for AI agents happening right now and not conceptually in a not-too-distant future, they\u2019d point to AI coding \u2014 and that was pretty much the only concrete thing they could point to. Many engineers use AI agents for coding, and they\u2019re seen as objectively pretty good. Good enough, in fact, that at Microsoft and Google, <a href=\"https:\/\/techcrunch.com\/2025\/04\/29\/microsoft-ceo-says-up-to-30-of-the-companys-code-was-written-by-ai\" rel=\"nofollow noopener\" target=\"_blank\">up to 30 percent<\/a> of the code is now being written by AI agents. And for startups like OpenAI and Anthropic, which burn through cash at high rates, one of their biggest revenue generators is AI coding tools for enterprise clients.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">So until recently, AI coding has been the main real-life use case of AI agents, but obviously, that\u2019s not pandering to the everyday consumer. The vision, remember, was always a jack-of-all-trades sort of AI agent for the \u201ceveryman.\u201d And we\u2019re not quite there yet \u2014 but in 2025, we\u2019ve gotten closer than we\u2019ve ever been before.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Last October, Anthropic kicked things off by introducing \u201cComputer Use,\u201d a tool that allowed Claude to use a computer like a human might \u2014 browsing, searching, accessing different platforms, and completing complex tasks on a user\u2019s behalf. The general consensus was that the tool was a step forward for technology, but reviews said that in practice, it left a lot to be desired. Fast-forward to January 2025, and OpenAI released Operator, its version of the same thing, and billed it as a tool for filling out forms, ordering groceries, booking travel, and creating memes. Once again, in practice, many users agreed that the tool was buggy, slow, and not always efficient. But again, it was a significant step. The next month, OpenAI released Deep Research, an agentic AI tool that could compile long research reports on any topic for a user, and that spun things forward, too. Some people said the research reports were more impressive in length than content, but others were seriously impressed. And then in July, OpenAI combined Deep Research and Operator into one AI agent product: ChatGPT Agent. Was it better than most consumer-facing agentic AI tools that came before? Absolutely. Was it still tough to make work successfully in practice? Absolutely.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">So there\u2019s a long way to go to reach that vision of an ideal AI agent, but at the same time, we\u2019re technically closer than we\u2019ve ever been before. That\u2019s why tech companies are putting more and more money into agentic AI, by way of investing in additional compute, research and development, or talent. Google recently hired Windsurf\u2019s CEO, cofounder, and some R&amp;D team members, specifically to help Google push its AI agent projects forward. And companies like Anthropic and OpenAI are racing each other up the ladder, rung by rung, to introduce incremental features to put these agents in the hands of consumers. (Anthropic, for instance, <a href=\"https:\/\/www.theverge.com\/anthropic\/766270\/anthropic-is-piloting-a-chrome-extension-for-claude-so-it-can-work-in-your-browser\" rel=\"nofollow noopener\" target=\"_blank\">just announced<\/a> a Chrome extension for Claude that allows it to work in your browser.)<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">So really, what happens next is that we\u2019ll see AI coding continue to improve (and, unfortunately, potentially replace the jobs of many <a href=\"http:\/\/businessinsider.com\/anthropic-ceo-ai-90-percent-code-3-to-6-months-2025-3\" rel=\"nofollow noopener\" target=\"_blank\">entry-level software engineers<\/a>). We\u2019ll also see the consumer-facing agent products improve, likely slowly but surely. And we\u2019ll see agents used increasingly for enterprise and government applications, especially since Anthropic, OpenAI, and xAI have all debuted government-specific AI platforms in recent months.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Overall, expect to see more false starts, starts and stops, and mergers and acquisitions as the AI agent competition picks up (and the hype bubble continues to balloon). One question we\u2019ll all have to ask ourselves as the months go on: What do we actually want a conceptual \u201cAI agent\u201d to be able to do for us? Do we want them to replace just the logistics or also the more personal, human aspects of life (i.e., helping write a wedding toast or a note for a flower delivery)? And how good are they at helping with the logistics vs. the personal stuff? (Answer for that last one: not very good at the moment.)<\/p>\n<p>Besides the astronomical environmental cost of AI \u2014 especially for large models, which are the ones powering AI agent efforts \u2014 there\u2019s an elephant in the room. And that\u2019s the idea that \u201csmarter AI that can do anything for you\u201d isn\u2019t always good, especially when people want to use it to do\u2026 bad things. Things like creating chemical, biological, radiological, and nuclear (CBRN) weapons. Top AI companies say they\u2019re increasingly worried about the risks of that. (Of course, they\u2019re not worried enough to stop building.) Let\u2019s talk about the regulation of it all. A lot of people have fears about the implications of AI, but many aren\u2019t fully aware of the potential dangers posed by uber-helpful, aiming-to-please AI agents in the hands of bad actors, both stateside and abroad (think: <a href=\"https:\/\/www.theverge.com\/ai-artificial-intelligence\/766435\/anthropic-claude-threat-intelligence-report-ai-cybersecurity-hacking\" rel=\"nofollow noopener\" target=\"_blank\">\u201cvibe-hacking,\u201d romance scams, and more<\/a>). AI companies say they\u2019re ahead of the risk with the voluntary safeguards they\u2019ve implemented. But many others say this may be a case for an external gut-check. <a class=\"duet--article--comments-link b1p9679\" href=\"http:\/\/www.theverge.com\/the-stepback-newsletter\/767376\/ai-agents-jarvis-what-can-they-do#comments\" rel=\"nofollow noopener\" target=\"_blank\">37 Comments<\/a>Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.Hayden FieldClose<img alt=\"Hayden Field\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"_1bw37385 x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/09\/257719_staff_portraits_2025_HAYDEN_AKrales_0081.jpg\"\/>Hayden Field<\/p>\n<p class=\"fv263x1\">Posts from this author will be added to your daily email digest and your homepage feed.<\/p>\n<p>PlusFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/authors\/hayden-field\" rel=\"nofollow noopener\" target=\"_blank\">See All by Hayden Field<\/a><\/p>\n<p>AICloseAI<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>PlusFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/ai-artificial-intelligence\" rel=\"nofollow noopener\" target=\"_blank\">See All AI<\/a><\/p>\n<p>ColumnCloseColumn<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>PlusFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/column\" rel=\"nofollow noopener\" target=\"_blank\">See All Column<\/a><\/p>\n<p>TechCloseTech<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>PlusFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/tech\" rel=\"nofollow noopener\" target=\"_blank\">See All Tech<\/a><\/p>\n<p>The StepbackCloseThe Stepback<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>PlusFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/the-stepback-newsletter\" rel=\"nofollow noopener\" target=\"_blank\">See All The Stepback<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"This is The Stepback, a weekly newsletter breaking down one essential story from the tech world. For more&hellip;\n","protected":false},"author":2,"featured_media":107198,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,22162,227,86,52152,56,54,55],"class_list":{"0":"post-107197","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-column","12":"tag-tech","13":"tag-technology","14":"tag-the-stepback","15":"tag-uk","16":"tag-united-kingdom","17":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/107197","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=107197"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/107197\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/107198"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=107197"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=107197"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=107197"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}