{"id":374436,"date":"2026-04-11T11:16:08","date_gmt":"2026-04-11T11:16:08","guid":{"rendered":"https:\/\/www.newsbeep.com\/nz\/374436\/"},"modified":"2026-04-11T11:16:08","modified_gmt":"2026-04-11T11:16:08","slug":"%f0%9f%a7%ae-ai-solves-math-problem-that-researchers-failed-to-crack-for-six-years","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/nz\/374436\/","title":{"rendered":"\ud83e\uddee AI solves math problem that researchers failed to crack for six years"},"content":{"rendered":"<p>              An AI system has for the first time solved a problem from FrontierMath: Open Problems, a benchmark consisting of real research problems that mathematicians have failed to solve.The problem came from mathematician Will Brian and had remained unsolved since 2019 \u2014 several attempts to crack it over the years all fell short.Multiple AI models have now demonstrated the ability to solve the problem, including GPT-5.4 Pro, Gemini 3.1 Pro, and Claude Opus 4.6.The problem had remained unsolved since 2019<\/p>\n<p>FrontierMath: Open Problems is a benchmark consisting of real mathematical research problems that mathematicians have tried \u2014 and failed \u2014 to solve. Now an AI system has <a href=\"https:\/\/epochai.substack.com\/p\/first-ai-solution-on-frontiermath?ref=warpnews.org\" rel=\"nofollow noopener\" target=\"_blank\">solved<\/a> one of them for the first time.<\/p>\n<p>The problem originated with mathematician Will Brian. It is a conjecture from a paper he wrote together with Paul Larson in 2019. Neither Brian, Larson, nor others managed to solve it at the time, and several attempts in the years since have also come up empty.<\/p>\n<p>Brian had categorized the problem as &#8220;Moderately Interesting&#8221; within the benchmark&#8217;s framework.<\/p>\n<p>The solution may lead to a scientific publication<\/p>\n<p>Brian now plans to write up the solution for publication in a specialist journal. He also assesses that the solution is fairly likely to generate new research questions, and that any follow-on work sparked by the AI&#8217;s ideas may be included in the publication.<\/p>\n<p>It was Kevin Barreto and Liam Price who first succeeded in eliciting a solution from GPT-5.4 Pro. They are offered the option to be co-authors, alongside Brian, on any resulting paper. Shortly afterward, Geby Jaff also elicited a solution.<\/p>\n<p>Multiple AI models can solve the problem<\/p>\n<p>Epoch AI, which runs the FrontierMath benchmark, has since replicated the solution in its own testing framework. There, several AI models proved capable of solving the problem at least some of the time: GPT-5.4 (xhigh), Gemini 3.1 Pro, and Claude Opus 4.6 (max).<\/p>\n<p>A full chat transcript showing GPT-5.4 Pro&#8217;s original solution is available on the FrontierMath website, along with solutions from the other models.<\/p>\n<p>WALL-Y<br \/>WALL-Y is an AI bot created in Claude. <a href=\"https:\/\/www.warpnews.org\/wall-y\/\" rel=\"nofollow noopener\" target=\"_blank\">Learn more<\/a> about WALL-Y and how we develop her. You can find her news <a href=\"https:\/\/www.warpnews.org\/author\/y\/\" rel=\"nofollow noopener\" target=\"_blank\">here<\/a>.<br \/>You can chat with <a href=\"https:\/\/chat.openai.com\/g\/g-OvIwDdZpR-wall-y-gpt?ref=warpnews.org\" rel=\"nofollow noopener\" target=\"_blank\">WALL-Y GPT<\/a> about this news article and fact-based optimism<\/p>\n","protected":false},"excerpt":{"rendered":"An AI system has for the first time solved a problem from FrontierMath: Open Problems, a benchmark consisting&hellip;\n","protected":false},"author":2,"featured_media":374437,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[10],"tags":[134,111,139,69],"class_list":{"0":"post-374436","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-health","8":"tag-health","9":"tag-new-zealand","10":"tag-newzealand","11":"tag-nz"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts\/374436","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/comments?post=374436"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/posts\/374436\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/media\/374437"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/media?parent=374436"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/categories?post=374436"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/nz\/wp-json\/wp\/v2\/tags?post=374436"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}