{"id":382696,"date":"2026-01-21T18:42:08","date_gmt":"2026-01-21T18:42:08","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/382696\/"},"modified":"2026-01-21T18:42:08","modified_gmt":"2026-01-21T18:42:08","slug":"has-gemini-surpassed-chatgpt-we-put-the-ai-models-to-the-test","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/382696\/","title":{"rendered":"Has Gemini surpassed ChatGPT? We put the AI models to the test."},"content":{"rendered":"<p>Gemini, on the other hand, gives the high-level overview of the landing instructions I asked for. But when I offered both options to <a href=\"https:\/\/arstechnica.com\/gadgets\/2013\/11\/watch-senior-editor-lee-hutchinson-pull-5gs-upside-down-and-not-wet-himself\/\" rel=\"nofollow noopener\" target=\"_blank\">Ars\u2019 own aviation expert Lee Hutchinson<\/a>, he pointed out a major problem with Gemini\u2019s response:<\/p>\n<p>Gemini\u2019s guidance is both accurate (in terms of \u201cthese are the literal steps to take right now\u201d) and guaranteed to kill you, as the first thing it says is for you, the presumably inexperienced aviator, to disable autopilot on a giant twin-engine jet, before even suggesting you talk to air traffic control.<\/p>\n<p>While Lee gave Gemini points for \u201cactually answering the question,\u201d he ultimately called ChatGPT\u2019s response \u201cmore practical\u2026 ultimately, ChatGPT gives you the more useful answer [since] Google\u2019s answer will make you dead unless you\u2019ve got some 737 time and are ready to hand-fly a passenger airliner with 100+ souls on board.\u201d<\/p>\n<p>For those reasons, ChatGPT has to win this one.<\/p>\n<p>Final verdict<\/p>\n<p>This was a relatively close contest when measured purely on points. Gemini notched wins on four prompts compared to three for ChatGPT, with one judged tie.<\/p>\n<p>That said, it\u2019s important to consider where those points came from. ChatGPT earned some relatively narrow and subjective style wins on prompts for dad jokes and Lincoln\u2019s basketball story, for instance, showing it might have a slight edge on more creative writing prompts.<\/p>\n<p>For the more informational prompts, though, ChatGPT showed significant factual errors in both the biography and the Super Mario Bros. strategy, plus signs of confusion in calculating the floppy disk size of Windows 11. These kinds of errors, which Gemini was largely able to avoid in these tests, can easily lead to broader distrust in an AI model\u2019s overall output.<\/p>\n<p>All told, it seems clear that Google has gained quite a bit of relative ground on OpenAI since <a href=\"https:\/\/arstechnica.com\/ai\/2023\/12\/chatgpt-vs-google-bard-round-2-how-does-the-new-gemini-model-fare\/\" rel=\"nofollow noopener\" target=\"_blank\">we did similar tests in 2023<\/a>. We can\u2019t exactly blame Apple for looking at sample results like these and making the decision it did for its Siri partnership.<\/p>\n","protected":false},"excerpt":{"rendered":"Gemini, on the other hand, gives the high-level overview of the landing instructions I asked for. But when&hellip;\n","protected":false},"author":2,"featured_media":382697,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-382696","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/382696","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=382696"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/382696\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/382697"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=382696"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=382696"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=382696"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}