{"id":302047,"date":"2025-12-06T20:12:11","date_gmt":"2025-12-06T20:12:11","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/302047\/"},"modified":"2025-12-06T20:12:11","modified_gmt":"2025-12-06T20:12:11","slug":"openai-beats-google-meta-and-grok-in-all-ai-poker-tournament","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/302047\/","title":{"rendered":"OpenAI beats Google, Meta, and Grok in all-AI poker tournament"},"content":{"rendered":"<p>OpenAI\u2019s o3 model won a five-day poker tournament of nine AI chatbotsThe o3 model won by playing the most consistent gameMost top language models handled poker well, but struggled with bluffing, position, and basic math<\/p>\n<p id=\"6a96fa20-356a-4a4f-ace8-72ec270adc4b\">In a digital showdown unlike anything ever dealt at the felt, nine of the world\u2019s most powerful large language models spent five days locked in a high-stakes poker match.<\/p>\n<p>OpenAI\u2019s o3, Anthropic\u2019s Claude Sonnet 4.5, X.ai&#8217;s Grok, <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.techradar.com\/tag\/google\" data-auto-tag-linker=\"true\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.techradar.com\/tag\/google\" rel=\"nofollow noopener\" target=\"_blank\">Google<\/a>&#8216;s Gemini 2.5 Pro, Meta\u2019s Llama 4, DeepSeek R1, Kimi K2 from Moonshot AI, Magistral from Mistral AI, and Z.AI\u2019s GLM 4.6 played thousands of hands of no-limit Texas hold &#8217;em at $10 and $20 tables with $100,000 bankrolls apiece.<\/p>\n<p><a id=\"elk-seasonal\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/><\/p>\n<p id=\"6a96fa20-356a-4a4f-ace8-72ec270adc4b-2\">When OpenAI\u2019s o3 model walked away from a weeklong poker game $36,691 richer, there was no trophy, just bragging rights.<\/p>\n<p>You may like<\/p>\n<p>The experimental PokerBattle.ai was entirely AI-run with the same initial prompt issued to each player. It was pure strategy, if strategy is what you call thousands of micro-decisions made by machines that don\u2019t really understand winning, losing, or how humiliating it is to bust with seven-deuce.<\/p>\n<p>For a tech stunt, it was unusually telling. The top-performing AIs weren\u2019t just bluffing and betting \u2013 they were adapting, modeling their opponents, and learning in real time how to navigate ambiguity. While they didn\u2019t play flawless poker, they came impressively close to mimicking seasoned players&#8217; judgment calls.<\/p>\n<p>OpenAI\u2019s o3 quickly showed it had the steadiest hand, taking down three of the five biggest pots and sticking close to textbook pre-flop theory. Anthropic\u2019s Claude and X.com\u2019s Grok rounded out the top three with substantial profits of $33,641 and $28,796, respectively.<\/p>\n<p>Meanwhile, Llama lost its full stack and flamed out early. The rest of the pack landed somewhere in between, with Google\u2019s Gemini turning a modest profit and Moonshot\u2019s Kimi K2 hemorrhaging chips down to an $86,030 finish.<\/p>\n<p class=\"newsletter-form__strapline\">Sign up for breaking news, reviews, opinion, top tech deals, and more.<\/p>\n<p><a id=\"elk-5d1dbf94-961b-442b-8bab-9f4f8f6e5cd4\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/>Gambling AI<\/p>\n<p id=\"841e1be8-2955-4a45-b13a-f92c74ff7d89\">Poker has long been one of the best analogs for testing general-purpose AI. Unlike chess or Go, which rely on perfect information, poker demands that players reason under uncertainty. It\u2019s a mirror of real-world decision-making in everything from business negotiations to military strategy, and now, apparently, <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.techradar.com\/tag\/chatbot\" data-auto-tag-linker=\"true\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.techradar.com\/tag\/chatbot\" rel=\"nofollow noopener\" target=\"_blank\">chatbot<\/a> development.<\/p>\n<p>One consistent takeaway from the tournament was that the bots were often too aggressive. Most favored action-heavy strategies, even in situations where folding would have been wiser. They tried to win big pots more than they tried to avoid losing them. And they were awful at bluffing, not because they didn\u2019t try, but because their bluffs often stemmed from misread hands, not clever deception.<\/p>\n<p>Still, AI tools are getting smarter in ways that go far beyond surface-level smarts. They\u2019re not just repeating what they\u2019ve read; they\u2019re making probabilistic judgments under pressure and learning to read the room. It\u2019s also a reminder that even powerful models still have flaws. Misreading situations, drawing shaky conclusions, and forgetting their own \u201cposition\u201d isn\u2019t just a poker problem.<\/p>\n<p>You might never sit across from a language model in a real poker room, but odds are you\u2019ll interact with one trying to make decisions that matter. This game was just a glimpse of what that could look like.<\/p>\n<p id=\"7843f94a-7c69-4482-a686-df2c663e82a1\"><a data-analytics-id=\"inline-link\" href=\"https:\/\/news.google.com\/publications\/CAAqKAgKIiJDQklTRXdnTWFnOEtEWFJsWTJoeVlXUmhjaTVqYjIwb0FBUAE?hl=en-GB&amp;gl=GB&amp;ceid=GB%3Aen\" target=\"_blank\" data-url=\"https:\/\/news.google.com\/publications\/CAAqKAgKIiJDQklTRXdnTWFnOEtEWFJsWTJoeVlXUmhjaTVqYjIwb0FBUAE?hl=en-GB&amp;gl=GB&amp;ceid=GB%3Aen\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\">Follow TechRadar on Google News<\/a> and <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.google.com\/preferences\/source?q=techradar.com\" target=\"_blank\" data-url=\"https:\/\/www.google.com\/preferences\/source?q=techradar.com\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\">add us as a preferred source<\/a> to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!<\/p>\n<p>And of course you can also <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tiktok.com\/@techradar\" target=\"_blank\" data-url=\"https:\/\/www.tiktok.com\/@techradar\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\">follow TechRadar on TikTok<\/a> for news, reviews, unboxings in video form, and get regular updates from us on <a data-analytics-id=\"inline-link\" href=\"https:\/\/whatsapp.com\/channel\/0029Va6HybZ9RZAY7pIUK12h\" target=\"_blank\" data-url=\"https:\/\/whatsapp.com\/channel\/0029Va6HybZ9RZAY7pIUK12h\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\">WhatsApp<\/a> too.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/11\/1763875814_869_hhZUEBD3xpcxphhUVzPa5G.png\" alt=\"Purple circle with the words Best business laptops in white\"   class=\"person__avatar image-wrapped__image image__image\" loading=\"lazy\" data-normal=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/11\/1763875814_869_hhZUEBD3xpcxphhUVzPa5G.png\" data-original-mos=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/11\/1763875814_869_hhZUEBD3xpcxphhUVzPa5G.png\" data-pin-media=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/11\/1763875814_869_hhZUEBD3xpcxphhUVzPa5G.png\" data-pin-nopin=\"true\" data-slice-image=\"true\"\/><\/p>\n<p>The best business laptops for all budgets<\/p>\n<p>Our top picks, based on real-world testing and comparisons<br \/>\n<script async src=\"\/\/www.tiktok.com\/embed.js\"><\/script><\/p>\n","protected":false},"excerpt":{"rendered":"OpenAI\u2019s o3 model won a five-day poker tournament of nine AI chatbotsThe o3 model won by playing the&hellip;\n","protected":false},"author":2,"featured_media":302048,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-302047","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/302047","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=302047"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/302047\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/302048"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=302047"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=302047"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=302047"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}