{"id":250614,"date":"2025-11-08T04:39:10","date_gmt":"2025-11-08T04:39:10","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/250614\/"},"modified":"2025-11-08T04:39:10","modified_gmt":"2025-11-08T04:39:10","slug":"kosmos-ai-scientist-claimed-to-do-six-months-of-research-in-just-a-few-hours","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/250614\/","title":{"rendered":"Kosmos: AI scientist claimed to do six months of research in just a few hours"},"content":{"rendered":"<p><img decoding=\"async\" class=\"Image\" alt=\"\" width=\"1349\" height=\"900\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2025\/11\/SEI_273417463.jpg\"   loading=\"eager\" fetchpriority=\"high\" data-image-context=\"Article\" data-image-id=\"2503526\" data-caption=\"Artificial intelligence can process large amounts of data, but can it do science?\" data-credit=\"tonioyumui\/Getty\"\/><\/p>\n<p class=\"ArticleImageCaption__Title\">Artificial intelligence can process large amounts of data, but can it do science?<\/p>\n<p class=\"ArticleImageCaption__Credit\">tonioyumui\/Getty<\/p>\n<\/p>\n<p>An AI scientist can work independently for hours while doing research that would take humans months to complete, and has made several \u201cnovel contributions\u201d to science, its creators claim \u2013 but others are more doubtful.<\/p>\n<p>The system, called Kosmos, is actually a collection of AI agents that are specialised in analysing data and searching through the existing scientific literature, in an effort to make new scientific breakthroughs.<\/p>\n<p>\u201cWe\u2019ve been working on building an AI scientist for about two years now,\u201d says <a href=\"https:\/\/www.sam-rodriques.com\/\" rel=\"nofollow noopener\" target=\"_blank\">Sam Rodriques<\/a> at Edison Scientific, the US-based firm behind Kosmos. \u201cAnd the limitation with AI scientists that have been released to date is always in kind of the complexity of the ideas that they can come up with.\u201d<\/p>\n<p>Kosmos aims to fix that. During a typical run, which can last up to 12 hours, a user inputs a scientific dataset and Kosmos searches for and analyses around 1500 relevant academic papers, while also writing and executing 42,000 lines of code to interrogate the data. At the end of a run, the AI produces a summary of findings, plus citations or data, and creates a plan for further analysis that can be used as the input for another cycle.<\/p>\n<p>After a set number of cycles, the system outputs reports, backed with relevant citations, that make scientific conclusions, similar to an academic paper. An evaluation by a group of academics found that 20 cycles of this would be equivalent to around six months of their own research time.<\/p>\n<p>The system\u2019s conclusions seem broadly accurate, says Rodriques. Edison asked people with at least a PhD-level understanding of biology to evaluate 102 statements made by Kosmos. The team found that 79.4 per cent of them were supported overall, including 85.5 per cent of claims related to data analysis claims and 82.1 per cent of the statements it says are in the existing literature. Kosmos is weaker at drawing all of this together to make new claims of scientific breakthroughs, however: here, it is accurate only 57.9 per cent of the time.<\/p>\n<p>Edison claims that Kosmos has made seven scientific discoveries that have all been externally validated and replicated by independent experts in the field using external datasets or different methods. Four of the discoveries were truly novel, the team behind Kosmos say, with the remaining three already existing \u2013 albeit in preprint or unpublished papers.<\/p>\n<p>One of the claimed discoveries is a new method to pinpoint when cellular pathways fail as Alzheimer\u2019s disease progresses. Another is of evidence that people with more of a natural antioxidant enzyme in their blood called superoxide dismutase 2 (SOD2) seem to have less heart scarring.<\/p>\n<p>But others working in this field have mixed responses to these claims. The SOD2 \u201cdiscovery\u201d is nothing of the sort, says <a href=\"https:\/\/www.bristol.ac.uk\/people\/person\/Fergus-Hamilton-15f9f3ca-311e-44fe-8361-35f7b56a54e4\/\" rel=\"nofollow noopener\" target=\"_blank\">Fergus Hamilton<\/a> at the University of Bristol, UK. \u201cThat particular causal claim probably doesn\u2019t stand up to scrutiny as a novel finding, and there are methodological flaws in the way the analysis performed,\u201d he says. Rodriques acknowledges that the SOD2 discovery had previously been found in mice, but says a subject matter expert working with Edison suggests it is the first time it has been seen at a population level in humans using genomics.<\/p>\n<p>Hamilton also says the data analysis code that the agent tried to run didn\u2019t work properly, so Kosmos ignored what would be important data \u2013 but still came to the same conclusion as pre-existing work.<\/p>\n<p>\u201cIt made a number of assumptions that would be really critical to get right in an actual bit of analysis,\u201d he says. \u201cThe software packages completely fail, and then it just ignores them.\u201d In addition, he suggests that the data has been so pre-processed in this instance that Kosmos \u201cactually has completed probably 10 per cent of the task\u201d.<\/p>\n<p>Hamilton does credit the team behind Kosmos for <a href=\"https:\/\/x.com\/SGRodriques\/status\/1986203077473718294\" rel=\"nofollow\">engaging with his queries<\/a> and concerns on social media. \u201cThis is a really good advance in principle, but perhaps the particular technical critique of this work is [that] the work is not up to scratch,\u201d he says.<\/p>\n<p>\u201cI\u2019m very open to the idea that some of the findings that we presented could be wrong or flawed, and this is just part of science,\u201d says Rodriques. \u201cThe fact that it\u2019s eliciting such kind of sophisticated criticism, though, I think, speaks to the power of the system.\u201d<\/p>\n<p>Others are impressed by the general performance of Kosmos. \u201cIt demonstrates the great potential for AI to support scientific discovery, but I would remain careful about the autonomous use of an AI scientist,\u201d says <a href=\"https:\/\/profiles.imperial.ac.uk\/b.glocker\" rel=\"nofollow noopener\" target=\"_blank\">Ben Glocker<\/a> at Imperial College London. \u201cThe work shows some great examples of success, but we have little insights about its failure modes.\u201d<\/p>\n<p>\u201cI believe\u00a0we should embrace tools like Kosmos and develop others like it, but we should not overlook that there is more to science than this data-driven method,\u201d says <a href=\"https:\/\/faculty.bentley.edu\/profile\/ngiansiracusa\" rel=\"nofollow noopener\" target=\"_blank\">Noah Giansiracusa<\/a> at Bentley University in Massachusetts. \u201cThere is also deep thinking, deep creativity, and it would be folly to turn away from that just because the science we can automate is more amenable to AI.\u201d<\/p>\n<p>Rodriques himself admits that Kosmos should be used as a collaborator, not a replacement for scientists. \u201cIt can do a lot of very, very impressive things,\u201d he says. \u201cYou still need to go through and read and validate. And it\u2019s not going to be right 100 per cent of the time.\u201d<\/p>\n<p class=\"ArticleTopics__Heading\">Topics:<\/p>\n","protected":false},"excerpt":{"rendered":"Artificial intelligence can process large amounts of data, but can it do science? tonioyumui\/Getty An AI scientist can&hellip;\n","protected":false},"author":2,"featured_media":250615,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-250614","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/250614","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=250614"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/250614\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/250615"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=250614"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=250614"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=250614"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}