{"id":425750,"date":"2026-01-22T11:46:33","date_gmt":"2026-01-22T11:46:33","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/425750\/"},"modified":"2026-01-22T11:46:33","modified_gmt":"2026-01-22T11:46:33","slug":"neurips-research-papers-contained-100-ai-hallucinated-citations-new-report-claims","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/425750\/","title":{"rendered":"NeurIPS research papers contained 100+ AI-hallucinated citations, new report claims"},"content":{"rendered":"<p>NeurIPS, one of the world\u2019s most prestigious AI research conferences, held its 39th annual meeting in San Diego in December, drawing tens of thousands of submissions and participants. What was once a largely academic gathering has become a prime hunting ground for top AI labs, where a strong showing can translate directly into job offers. Researchers whose papers are accepted for live presentation are considered among the field\u2019s elite. <\/p>\n<p>Yet Canadian startup GPTZero <a aria-label=\"Go to https:\/\/gptzero.me\/news\/neurips\/\" href=\"https:\/\/gptzero.me\/news\/neurips\/\" rel=\"nofollow noopener\" target=\"_blank\">analyzed more than 4,000 research papers<\/a> accepted and presented at NeurIPS (Neural Information Processing Systems) 2025 and says it uncovered hundreds of AI-hallucinated citations that slipped past the three or more reviewers assigned to each submission, spanning at least 53 papers in total. The hallucinations had not previously been reported.<\/p>\n<p>From fully made-up citations to subtle changes<\/p>\n<p>In some cases, an AI model blended or paraphrased elements from multiple real papers, including believable-sounding titles and author lists, the company says. Others appeared to be fully made up: a nonexistent author, a fabricated paper title, a fake journal or conference, or a URL that leads nowhere.<\/p>\n<p>In other cases, the model started from a real paper but made subtle changes\u2014expanding an author\u2019s initials into a guessed first name, dropping or adding coauthors, or paraphrasing the title. Some, however, are plainly wrong\u2014citing \u201cJohn Smith\u201d and \u201cJane Doe\u201d as authors, for example.<\/p>\n<p>When reached for comment, the NeurIPS board shared the following statement: \u201cThe usage of LLMs in papers at AI conferences is rapidly evolving, and NeurIPS is actively monitoring developments. In previous years, we piloted policies regarding the use of LLMs, and in 2025, reviewers were instructed to flag hallucinations. Regarding the findings of this specific work, we emphasize that significantly more effort is required to determine the implications. Even if 1.1% of the papers have one or more incorrect references due to the use of LLMs, the content of the papers themselves are not necessarily invalidated. For example, authors may have given an LLM a partial description of a citation and asked the LLM to produce bibtex (a formatted reference). As always, NeurIPS is committed to evolving the review and authorship process to best ensure scientific rigor and to identify ways that LLMs can be used to enhance author and reviewer capabilities.\u201d<\/p>\n<p>Edward Tian, cofounder and CEO of GPTZero, which was founded in January 2023 and <a aria-label=\"Go to https:\/\/techcrunch.com\/2024\/06\/13\/gptzero-profitable-ai-detection-startup-10m-series-a\/\" href=\"https:\/\/techcrunch.com\/2024\/06\/13\/gptzero-profitable-ai-detection-startup-10m-series-a\/\" rel=\"nofollow noopener\" target=\"_blank\">raised<\/a> a $10 million Series A round in 2024, told Fortune the NeurIPS analysis came just weeks after the company uncovered 50 hallucinated citations in papers under review for another top AI research conference, ICLR, which will be held in Rio de Janeiro in April. In that case, the papers had not yet been accepted\u2014but the bogus citations had already slipped past peer reviewers. Tian said the ICLR conference has hired the company to check future submissions for fabricated citations during peer review.<\/p>\n<p>Errors appeared in papers accepted and presented at NeurIPS<\/p>\n<p>According to Tian, the NeurIPS findings are even more troubling because the errors appear in papers that were accepted by the conference. In the academic world of AI, \u201cpublish or perish\u201d is more than a clich\u00e9: Hiring and tenure often hinge on accumulating peer-reviewed publications. Yet under long-standing academic norms, even a single fabricated citation would, in principle, be grounds for rejection. References are meant to anchor a paper in the existing body of research\u2014and to demonstrate that its authors have actually read and engaged with the work they cite. <\/p>\n<p>\u201cIt\u2019s definitely a bigger escalation in the sense that these were the first documented cases of hallucinated citations entering the official record of the top machine learning conference,\u201d Tian said, pointing out that since NeurIPS 2025 had an acceptance rate for main track papers of <a aria-label=\"Go to https:\/\/blog.neurips.cc\/2025\/09\/30\/reflections-on-the-2025-review-process-from-the-program-committee-chairs\/#:~:text=That%20is%2C%20the%20main%20track%20had%20an%20acceptance%20rate%20of%2024.52%25%2C%20on%20par%20with%20that%20of%20previous%20years.\" href=\"https:\/\/blog.neurips.cc\/2025\/09\/30\/reflections-on-the-2025-review-process-from-the-program-committee-chairs\/#:~:text=That%2520is%252C%2520the%2520main%2520track%2520had%2520an%2520acceptance%2520rate%2520of%252024.52%2525%252C%2520on%2520par%2520with%2520that%2520of%2520previous%2520years.\" rel=\"nofollow noopener\" target=\"_blank\">24.52%<\/a>, each of these papers beat out 15,000 other papers despite containing one or more hallucinations. \u201cThese survived peer review, and were published in the final conference proceeding,\u201d he said. \u201cSo it\u2019s definitely a big moment.\u201d\u00a0<\/p>\n<p>Around half of the papers with hallucinated citations were papers that were likely to be AI-generated themselves or had a high amount of AI use, he added. \u201cBut what we were really focused on in this investigation is the citations themselves,\u201d he said.\u00a0AI detection tools have often been criticized for false positives in attempting to identify machine-written text. But Tian argued that hallucination detection is a different class of problem, with GPTZero\u2019s tool checking verifiable facts\u2014searching the open web and academic databases to confirm whether a cited paper actually exists. The company says the tool is more than 99% accurate, and for the NeurIPS analysis, every flagged citation was also reviewed by a human expert on GPTZero\u2019s machine-learning team.<\/p>\n<p>Alex Cui, Tian\u2019s cofounder and chief technology officer, said that GPTZero\u2019s hallucination checker tool ingests a paper and then searches across the open web and academic databases to verify each citation\u2014its authors, title, publication venue, and link. If a reference can\u2019t be found, or if it only partially matches a real paper, the system flags it. That\u2019s how it catches cases where an AI model starts from a real paper but adds authors who don\u2019t exist, alters the title, or invents a publication.\u00a0<\/p>\n<p>\u201cSometimes, even when there is a match, you\u2019ll find that they added like five authors who don\u2019t exist to a real paper, so these are mistakes that no human would reasonably make,\u201d he explained. For the NeurIPS investigation, after the automated scan, a member of GPTZero\u2019s machine-learning team manually verified every flagged citation, ensuring the findings aren\u2019t themselves false positives.<\/p>\n<p>The sheer volume of papers makes deep scrutiny difficult<\/p>\n<p>A big part of the challenge is sheer scale. In 2025, the main NeurIPS research track received 21,575 valid submissions\u2014up from 15,671 in 2024 and 12,343 in 2023. Even with thousands of volunteer reviewers, that volume makes deep scrutiny of every paper and its references increasingly difficult. <\/p>\n<p>But while AI has a part in that by making it dramatically easier to churn out conference submissions, Tian said, a flawed paper still carries real reputational risk\u2014for the authors, for the conference that accepted it, and for the companies that hire researchers based on those credentials. \u00a0<\/p>\n<p>That\u2019s particularly true for citations, he said, because in modern AI research, citations are a core part of how the field tries to solve issues of reproducibility. \u201cAI results are notoriously hard to reproduce, so citations are important,\u201d he said, to \u201cdraw the line between whether that result was reproducible or not,\u201d by letting other researchers trace a result back to something concrete and testable. Hallucinated citations, on the other hand,\u00a0send readers to something that doesn\u2019t exist.\u00a0<\/p>\n","protected":false},"excerpt":{"rendered":"NeurIPS, one of the world\u2019s most prestigious AI research conferences, held its 39th annual meeting in San Diego&hellip;\n","protected":false},"author":2,"featured_media":425751,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[5126,62,276,277,49,48,994,61],"class_list":{"0":"post-425750","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-academic-research","9":"tag-ai","10":"tag-artificial-intelligence","11":"tag-artificialintelligence","12":"tag-ca","13":"tag-canada","14":"tag-research","15":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/425750","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=425750"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/425750\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/425751"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=425750"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=425750"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=425750"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}