{"id":340939,"date":"2026-03-21T16:00:12","date_gmt":"2026-03-21T16:00:12","guid":{"rendered":"https:\/\/www.newsbeep.com\/nz\/340939\/"},"modified":"2026-03-21T16:00:12","modified_gmt":"2026-03-21T16:00:12","slug":"gemini-task-automation-is-slow-clunky-and-super-impressive","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/nz\/340939\/","title":{"rendered":"Gemini task automation is slow, clunky, and super impressive"},"content":{"rendered":"<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">I\u2019ve been testing out <a href=\"https:\/\/www.theverge.com\/tech\/884210\/google-gemini-samsung-s26-pixel-10-uber\" rel=\"nofollow noopener\" target=\"_blank\">Gemini\u2019s new task automation<\/a> on the Pixel 10 Pro and the Galaxy S26 Ultra, which for the first time lets Gemini take the wheel and use apps for you. It\u2019s limited to a small subset right now \u2014 a handful of food delivery and rideshare services \u2014 and it\u2019s still in beta. It\u2019s slow, it\u2019s clunky at times, and it doesn\u2019t solve any serious problem you had using your phone. But it\u2019s impressive as hell, and I don\u2019t think it\u2019s hyperbole to say this is a glimpse of the future. We\u2019re still a long way off, but this is the first time I\u2019ve seen a true AI assistant actually working on a phone \u2014 not in a keynote presentation or a carefully controlled demo inside a convention hall.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">First off: Gemini is much slower than you, or me, or most anyone at using their phone. If you need to order an Uber right this second, you\u2019re still the best person for the job. Before you write it off, though, remember that task automation is designed to run in the background while you do other things on your phone. Even better, it keeps working while you\u2019re not looking at your phone, so you can do things like check that your passport is in your bag for the 10th time.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">But if you\u2019re curious, like I am, you can watch the whole thing happen. While it\u2019s working, text appears at the bottom of the screen indicating what Gemini is doing. Stuff like \u201cSelecting a second portion of Chicken Teriyaki for the combo,\u201d which it did when I directed it to order my dinner on Saturday night. Watching Gemini figure things out on the fly honestly kinda rules. I asked for a chicken combo plate; the menu presented options in half- portion increments, so it correctly added two half servings of chicken.<\/p>\n<p><a class=\"kqz8fh1\" href=\"https:\/\/platform.theverge.com\/wp-content\/uploads\/sites\/2\/2026\/03\/Screenshot-2026-03-20-at-2.55.10%E2%80%AFPM.png?quality=90&amp;strip=all&amp;crop=0,0,100,100\" data-pswp-height=\"1498\" data-pswp-width=\"744\" target=\"_blank\" rel=\"noreferrer nofollow noopener\"><img alt=\"\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/nz\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-20-at-2.55.10\u202fPM.png\"\/><\/a><\/p>\n<p>Gemini figured out that two half portions would equal one order of chicken teriyaki.<\/p>\n<p><a class=\"kqz8fh1\" href=\"https:\/\/platform.theverge.com\/wp-content\/uploads\/sites\/2\/2026\/03\/Screenshot-2026-03-20-at-6.57.39%E2%80%AFPM.png?quality=90&amp;strip=all&amp;crop=0,0,100,100\" data-pswp-height=\"1502\" data-pswp-width=\"702\" target=\"_blank\" rel=\"noreferrer nofollow noopener\"><img alt=\"\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/nz\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-20-at-6.57.39\u202fPM.png\"\/><\/a><\/p>\n<p>Gemini had more trouble finding the side of greens featured right in the middle of the screen here.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">It\u2019s for the best that when you start an automation with Gemini, the default behavior is for it to run in the background. You have to tap a button and open another window if you want to watch Gemini working through the task. And it can be excruciating. Watching the computer try to find a side of greens on a menu in Uber Eats when it\u2019s sitting right there at the top of the screen is like watching a horror movie and knowing the murderer is in the closet right next to the protagonist. I mean, except for the murder part. Gemini made a couple of wrong turns as it put together my teriyaki order, which it eventually figured out on its own, but the whole episode took about nine minutes. Not ideal.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Gemini is supposed to carry out your task right up to the point where it\u2019s time to hit confirm and order your car or dinner so you can double-check its work. This, I think, is the only sane way to use this feature right now, and I don\u2019t mind the added friction of completing the order. In the tests I\u2019ve run over the past five days, I\u2019ve never had it go rogue and finish my order for me. And it is surprisingly accurate; I\u2019ve had to make very few adjustments to the final order. If it fails \u2014 which I have seen happen a couple of times \u2014 it tends to be within the first minute or two when something about the app needs my attention, like giving it permission to use my location, or changing the delivery location to home rather than Nevada, which was the last place I used that app. I had to figure out what the problem was in cases like this, but once it was sorted out I was able to restart the automation without an issue.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Here\u2019s the one that really got me. I put an event on my calendar for a flight to San Francisco the following day (a pretend trip for me, but real flight details). I gave Gemini a vague prompt to schedule an Uber that would get me to the airport in time for my flight tomorrow. Because Gemini has access to my email and calendar, it can go find that information. It did need a little extra guidance \u2014 possibly because the flight wasn\u2019t in my email like it expected. But with that, it found the flight information, suggested leaving by 11:30 or 11:45AM (logical timing for a 1:45PM flight given I live close to the airport), and asked if I wanted to schedule a ride for one of those times. I confirmed the time, and it went about setting up the ride in about three minutes with no further input required on my part.<\/p>\n<p><a class=\"kqz8fh1\" href=\"https:\/\/platform.theverge.com\/wp-content\/uploads\/sites\/2\/2026\/03\/Screenshot-2026-03-20-at-7.03.54%E2%80%AFPM.png?quality=90&amp;strip=all&amp;crop=0,0,100,100\" data-pswp-height=\"1458\" data-pswp-width=\"684\" target=\"_blank\" rel=\"noreferrer nofollow noopener\"><img alt=\"\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/nz\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-20-at-7.03.54\u202fPM.png\"\/><\/a><a class=\"kqz8fh1\" href=\"https:\/\/platform.theverge.com\/wp-content\/uploads\/sites\/2\/2026\/03\/Screenshot-2026-03-20-at-7.04.53%E2%80%AFPM.png?quality=90&amp;strip=all&amp;crop=0,0,100,100\" data-pswp-height=\"1460\" data-pswp-width=\"696\" target=\"_blank\" rel=\"noreferrer nofollow noopener\"><img alt=\"\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/nz\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-20-at-7.04.53\u202fPM.png\"\/><\/a><\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">It\u2019s a little more impressive when you consider that Uber doesn\u2019t even refer to it as scheduling a ride \u2014 you reserve a ride. That\u2019s the key difference between the digital assistants we\u2019ve been using and the AI assistants emerging now. Being able to use natural language when talking to the computer <a href=\"https:\/\/www.theverge.com\/report\/787171\/amazon-alexa-plus-hardware-event-smart-home\" rel=\"nofollow noopener\" target=\"_blank\">makes a huge difference when you\u2019re controlling your smart home<\/a> or placing your dinner order. If the computer is going to get tripped up and ask for clarification when you forget that the restaurant calls your meal a \u201cplate\u201d and not a \u201ccombo,\u201d or if you ask for \u201cslaw\u201d instead of \u201cshredded cabbage,\u201d then it\u2019s no more useful than the assistants we\u2019ve been using for the past decade to set timers and play music.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">That said, watching Gemini tap and scroll around Uber Eats makes one thing painfully obvious: If you were designing an application for AI to use, it would look nothing like the ones we have today. You know, apps designed for humans. An AI assistant won\u2019t be tempted by a big ad in the middle of a page to save 30 percent on your order. An appetizing, well-staged photo of the dish it\u2019s ordering isn\u2019t any more convincing than a low-quality one. You would give it a database, not a bunch of clutter to weed through \u2014 <a href=\"https:\/\/www.theverge.com\/ai-artificial-intelligence\/841156\/ai-companies-aaif-anthropic-mcp-model-context-protocol\" rel=\"nofollow noopener\" target=\"_blank\">something the industry is working toward<\/a> in Model Context Protocol, or MCP.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _17nnmdya _1xwtict1\">An AI model reasoning its way through a human-centric interface feels like the most impractical and brittle way to place a pizza order. It does hit a snag occasionally, and it\u2019s not great at telling you why it couldn\u2019t do something. This version of task automation feels like a stopgap until app developers adopt more robust methods: MCP or <a href=\"https:\/\/www.theverge.com\/2024\/11\/22\/24303329\/google-gemini-android-16-app-functions\" rel=\"nofollow noopener\" target=\"_blank\">Android\u2019s app functions<\/a>. Google\u2019s head of Android, Sameer Samat, <a href=\"https:\/\/www.theverge.com\/tech\/884210\/google-gemini-samsung-s26-pixel-10-uber\" rel=\"nofollow noopener\" target=\"_blank\">told me recently<\/a> that Gemini takes the reasoning approach in the absence of the other two. Maybe this version of task automation is our preview of what\u2019s possible, or a way to prod developers into adopting one of the other methods. Either way, this feels like a notable first step toward a new way of using our mobile assistants \u2014 awkward, slow, but very promising.<\/p>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph _1ymtmqpi _17nnmdy1 _17nnmdy0 _1xwtict1\">Photography by Allison Johnson \/ The Verge<\/p>\n<p>Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.Allison JohnsonClose<img alt=\"Allison Johnson\" data-chromatic=\"ignore\" loading=\"lazy\" decoding=\"async\" data-nimg=\"fill\" class=\"_1bw37385 x271pn0\" style=\"position:absolute;height:100%;width:100%;left:0;top:0;right:0;bottom:0;color:transparent;background-size:cover;background-position:50% 50%;background-repeat:no-repeat;background-image:url(&quot;data:image\/svg+xml;charset=utf-8,%3Csvg xmlns='http:\/\/www.w3.org\/2000\/svg' %3E%3Cfilter id='b' color-interpolation-filters='sRGB'%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3CfeColorMatrix values='1 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 100 -1' result='s'\/%3E%3CfeFlood x='0' y='0' width='100%25' height='100%25'\/%3E%3CfeComposite operator='out' in='s'\/%3E%3CfeComposite in2='SourceGraphic'\/%3E%3CfeGaussianBlur stdDeviation='20'\/%3E%3C\/filter%3E%3Cimage width='100%25' height='100%25' x='0' y='0' preserveAspectRatio='none' style='filter: url(%23b);' href='data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mN8+R8AAtcB6oaHtZcAAAAASUVORK5CYII='\/%3E%3C\/svg%3E&quot;)\"   src=\"https:\/\/www.newsbeep.com\/nz\/wp-content\/uploads\/2026\/03\/1774108812_682_ALLISON_JOHNSON.0.jpg\"\/><\/p>\n<p>Allison Johnson<\/p>\n<p class=\"fv263x1\">Posts from this author will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/authors\/allison-johnson\" rel=\"nofollow noopener\" target=\"_blank\">See All by Allison Johnson<\/a><\/p>\n<p>AIClose<\/p>\n<p>AI<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/ai-artificial-intelligence\" rel=\"nofollow noopener\" target=\"_blank\">See All AI<\/a><\/p>\n<p>GoogleClose<\/p>\n<p>Google<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/google\" rel=\"nofollow noopener\" target=\"_blank\">See All Google<\/a><\/p>\n<p>Hands-onClose<\/p>\n<p>Hands-on<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/hands-on\" rel=\"nofollow noopener\" target=\"_blank\">See All Hands-on<\/a><\/p>\n<p>ReviewsClose<\/p>\n<p>Reviews<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/reviews\" rel=\"nofollow noopener\" target=\"_blank\">See All Reviews<\/a><\/p>\n<p>TechClose<\/p>\n<p>Tech<\/p>\n<p class=\"fv263x1\">Posts from this topic will be added to your daily email digest and your homepage feed.<\/p>\n<p>FollowFollow<\/p>\n<p class=\"fv263x4\"><a class=\"fv263x5\" href=\"https:\/\/www.theverge.com\/tech\" rel=\"nofollow noopener\" target=\"_blank\">See All Tech<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"I\u2019ve been testing out Gemini\u2019s new task automation on the Pixel 10 Pro and the Galaxy S26 Ultra,&hellip;\n","protected":false},"author":2,"featured_media":326330,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[365,363,364,367,8663,111,139,69,763,370,145],"class_list":{"0":"post-340939","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-google","12":"tag-hands-on","13":"tag-new-zealand","14":"tag-newzealand","15":"tag-nz","16":"tag-reviews","17":"tag-tech","18":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts\/340939","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/comments?post=340939"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts\/340939\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/media\/326330"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/media?parent=340939"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/categories?post=340939"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/tags?post=340939"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}