{"id":373940,"date":"2026-01-16T20:39:09","date_gmt":"2026-01-16T20:39:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/373940\/"},"modified":"2026-01-16T20:39:09","modified_gmt":"2026-01-16T20:39:09","slug":"amateur-mathematicians-solve-long-standing-maths-problems-with-ai","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/373940\/","title":{"rendered":"Amateur mathematicians solve long-standing maths problems with AI"},"content":{"rendered":"<p><img decoding=\"async\" class=\"Image\" alt=\"New Scientist. Science news and long reads from expert journalists, covering developments in science, technology, health and the environment on the website and the magazine.\" width=\"1350\" height=\"900\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/01\/gettyimages-1219382595_2.jpg\"   loading=\"eager\" fetchpriority=\"high\" data-image-context=\"Article\" data-image-id=\"2512004\" data-caption=\"AI tools are helping to decipher long-standing maths problems\" data-credit=\"andresr\/Getty Images\"\/><\/p>\n<p class=\"ArticleImageCaption__Title\">AI tools are helping to decipher long-standing maths problems<\/p>\n<p class=\"ArticleImageCaption__Credit\">andresr\/Getty Images<\/p>\n<\/p>\n<p>Amateur mathematicians are using artificial intelligence chatbots to solve long-standing problems, in a move that has taken professionals by surprise. While the problems in question aren\u2019t the most advanced in the mathematical canon, the success of AI models in tackling them shows that their <a href=\"https:\/\/www.newscientist.com\/article\/2487198-ai-could-be-about-to-completely-change-the-way-we-do-mathematics\/\" rel=\"nofollow noopener\" target=\"_blank\">mathematical performance<\/a> has passed a significant threshold, say researchers, and could fundamentally change the way we do mathematics.<\/p>\n<p>The questions being solved by AI originate from Hungarian mathematician <a href=\"https:\/\/www.newscientist.com\/article\/dn25068-wikipedia-size-maths-proof-too-big-for-humans-to-check\/\" rel=\"nofollow noopener\" target=\"_blank\">Paul Erd\u0151s<\/a>, who was famous for his ability to pose useful but difficult questions during a career that spanned over six decades. \u201cThe questions tended to be very simple, but very hard,\u201d says <a href=\"http:\/\/www.thomasbloom.org\/\" rel=\"nofollow noopener\" target=\"_blank\">Thomas Bloom<\/a> at the University of Manchester, UK.<\/p>\n<p>By his death in 1996, there were more than 1000 of these unsolved Erd\u0151s problems, spanning a wide range of mathematical disciplines, from combinatorics (the study of combinations) to number theory. Today, they are seen as signposts for progress in these fields, says Bloom, who <a href=\"https:\/\/www.erdosproblems.com\/\" rel=\"nofollow noopener\" target=\"_blank\">runs a website<\/a> that catalogues the problems and tracks mathematicians\u2019 progress in solving them.<\/p>\n<p>Because Erd\u0151s problems are often simple to state, mathematicians began experimenting with feeding them to <a href=\"https:\/\/www.newscientist.com\/article\/2504763-mathematicians-say-googles-ai-tools-are-supercharging-their-research\/\" rel=\"nofollow noopener\" target=\"_blank\">AI tools<\/a> like ChatGPT. Bloom says that in October last year, he began seeing people use AI models to find relevant references in the mathematical literature that helped with their solutions.<\/p>\n<p>Soon after, AI tools began finding partial improvements to results, some of which had been found in past papers, while others appeared new.<\/p>\n<p>\u201cI was surprised then,\u201d says Bloom. \u201cBefore, when I tried ChatGPT, it just made up papers, completely hallucinating, and so I had given up using it. But clearly, there was some sort of change around October. I actually found genuine papers because it had read them all, and often in a non-trivial way.\u201d<\/p>\n<p>Inspired by this progress, Kevin Barreto, an undergraduate mathematics student at Cambridge University, and Liam Price, an amateur mathematician, began looking for simple and understudied Erd\u0151s problems that they might solve with AI. After finding one such problem, number 728, a conjecture in number theory, they fed it to ChatGPT-5.2 Pro to solve it.<\/p>\n<p>\u201cI looked at the statement, and thought, \u2018This one might be able to get solved by ChatGPT, so let\u2019s try it,\u2019\u201d says Barreto. \u201cSure enough, it comes back with an argument that\u2019s quite nice and that a lot of people would actually agree was rather sophisticated.\u201d<\/p>\n<p>After ChatGPT produced a proof, Barreto and Price used another AI tool called Aristotle, created by the AI company Harmonic, to verify their work. Aristotle converts the conventional language proof into one written in Lean, a mathematical programming language. It can then be instantly checked by a computer for correctness. This is an important step, says Bloom, as it saves the limited time that researchers have to check whether a result is correct or not.<\/p>\n<p>As <a href=\"https:\/\/github.com\/teorth\/erdosproblems\/wiki\/AI-contributions-to-Erd%C5%91s-problems\" rel=\"nofollow noopener\" target=\"_blank\">of mid-January<\/a>, six Erd\u0151s problems have been fully solved by AI tools, though subsequent scrutiny by professional mathematicians revealed that five of these problems had previously been solved in the mathematical literature. Only one problem, number 205, has been fully solved by Barreto and Price with no pre-existing solution. AI tools have also enabled small improvements and partial solutions to seven other problems that don\u2019t appear to be pre-existing in the literature.<\/p>\n<p>As a result, there is an ongoing debate about whether these tools are really proving new ideas, or merely digging out old and forgotten solutions. Bloom points out that the AI models often have to translate the problems into new forms, and are discovering papers that make no mention of Erd\u0151s. \u201cA lot of these papers, I wouldn\u2019t have found, and maybe nobody would have found for a lot longer without this sort of [use of] the AI tool,\u201d he says.<\/p>\n<p>Another question is just how far this approach can go. All of these problems aren\u2019t the most demanding in mathematics, and could perhaps be accomplished by a first-year PhD student, but that is still impressive, says Bloom. \u201cTo me, it\u2019s incredible that AI is capable of that, because this takes non-trivial effort.\u201d<\/p>\n<p>Barreto also says that the problems being solved are relatively straightforward, even when compared with more difficult Erd\u0151s problems, which current AI models fall short of solving. \u201cOnce [AI] gets through the low-hanging fruit problems, a lot of them are going to need more capable models,\u201d he says. Some of the hardest problems have prize money set aside for anyone who can solve them, but Barreto thinks that is unlikely to happen soon: \u201cSome people are trying to do bounty problems, and to me that\u2019s kind of nuts. I don\u2019t think the models are there yet.\u201d<\/p>\n<p>Solving Erd\u0151s problems using AI is promising progress, says <a href=\"https:\/\/profiles.imperial.ac.uk\/k.buzzard\" rel=\"nofollow noopener\" target=\"_blank\">Kevin Buzzard<\/a> at Imperial College London, but because most of the problems it is solving are either relatively straightforward or have had little attention, it makes it hard to gauge whether it is a significant achievement \u2013 or something that should concern professionals. \u201cThat is progress, but mathematicians aren\u2019t going to be looking over their shoulders just yet,\u201d says Buzzard. \u201cIt\u2019s green shoots.\u201d<\/p>\n<p>But even if the models\u2019 capability stays static, their ability to handle relatively complex mathematics could fundamentally change how researchers research and write proofs, says Bloom, because it will allow mathematicians who have limited knowledge of areas outside their particular discipline to draw on other fields.<\/p>\n<p>\u201cAlmost nobody knows every part of math, and that means that we\u2019re quite limited in the sets of tools that we can use,\u201d says Bloom. \u201cThe fact that you can just get an answer instantly, without having to bother another human, without having to waste months learning potentially useless knowledge, opens up so many connections. That\u2019s going to be a huge change that we\u2019ll see, just increasing the breadth of research that\u2019s done.\u201d<\/p>\n<p>This could also allow mathematicians to practice an entirely new way of working, says <a href=\"https:\/\/scholar.google.com\/citations?user=TFx_gLQAAAAJ&amp;hl=en\" rel=\"nofollow noopener\" target=\"_blank\">Terence Tao<\/a> at the University of California, Los Angeles, who has helped validate some of the AI-assisted Erd\u0151s problem solutions.<\/p>\n<p>Mathematicians often focus on a small number of difficult problems because of limited time, while many less difficult but still important problems don\u2019t get much attention. If AI tools can be applied to them all at once, it could lead to a more empirical, scientific way of doing mathematics, says Tao, where different ways of solving a problem could be tested on a large scale.<\/p>\n<p>\u201cWe are just so resource-limited by how much expert attention we have, that we don\u2019t look at 99 per cent of all the problems that we could be studying,\u201d says Tao. \u201cSo we don\u2019t do things like survey hundreds of problems, trying to find one or two really interesting ones, or do statistical studies like, we have two different methods, which one is better?<\/p>\n<p>\u201cThis is a type of mathematics that just isn\u2019t done,\u201d he says. \u201cWe don\u2019t do large-scale mathematics because we don\u2019t have the intellectual resources, but AI is showing that you can.\u201d<\/p>\n<p class=\"ArticleTopics__Heading\">Topics:<\/p>\n<p><a class=\"ArticleTopics__ListItemLink\" href=\"https:\/\/www.newscientist.com\/article-topic\/artificial-intelligence\/\" rel=\"nofollow noopener\" target=\"_blank\">artificial intelligence<\/a>\/<a class=\"ArticleTopics__ListItemLink\" href=\"https:\/\/www.newscientist.com\/article-topic\/chatgpt\/\" rel=\"nofollow noopener\" target=\"_blank\">ChatGPT<\/a>                <\/p>\n","protected":false},"excerpt":{"rendered":"AI tools are helping to decipher long-standing maths problems andresr\/Getty Images Amateur mathematicians are using artificial intelligence chatbots&hellip;\n","protected":false},"author":2,"featured_media":373941,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,1921,86,56,54,55],"class_list":{"0":"post-373940","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-chatgpt","12":"tag-technology","13":"tag-uk","14":"tag-united-kingdom","15":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/373940","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=373940"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/373940\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/373941"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=373940"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=373940"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=373940"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}