{"id":378821,"date":"2026-01-19T18:32:09","date_gmt":"2026-01-19T18:32:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/378821\/"},"modified":"2026-01-19T18:32:09","modified_gmt":"2026-01-19T18:32:09","slug":"nvidia-contacted-annas-archive-to-secure-access-to-millions-of-pirated-books-torrentfreak","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/378821\/","title":{"rendered":"NVIDIA Contacted Anna\u2019s Archive to Secure Access to Millions of Pirated Books * TorrentFreak"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/01\/nvidia-logo.jpg\" alt=\"nvidia logo\" width=\"300\" height=\"197\" class=\"alignright size-full wp-image-248322\"  \/>Chip giant NVIDIA has been one of the main financial beneficiaries in the artificial intelligence boom. <\/p>\n<p>Revenue surged due to high demand for its AI-learning chips and data center services, and the end doesn\u2019t appear to be in sight.  <\/p>\n<p>Besides selling the most sought-after hardware, NVIDIA is also developing its own models, including NeMo, Retro-48B, InstructRetro, and Megatron. These are trained using their own hardware and with help from large text libraries, much like other tech giants do. <\/p>\n<p>Authors Sue NVIDIA for Copyright Infringement<\/p>\n<p>Like other tech companies, NVIDIA has also seen significant legal pushback from copyright holders in response to its training methods. This includes authors, who, in various lawsuits, accused tech companies of training their models on pirated books.<\/p>\n<p>In early 2024, for example, several authors <a href=\"https:\/\/torrentfreak.com\/authors-sue-nvidia-for-training-ai-on-pirated-books-240311\/\" rel=\"nofollow noopener\" target=\"_blank\">sued NVIDIA<\/a> over alleged copyright infringement. <\/p>\n<p>Through the class action lawsuit, they claimed that the company\u2019s AI models were trained on the Books3 dataset that included copyrighted works taken from the \u2018pirate\u2019 site Bibliotik. Since this happened without permission, the authors demanded compensation. <\/p>\n<p>In response, NVIDIA <a href=\"https:\/\/torrentfreak.com\/nvidia-copyrighted-books-are-just-statistical-correlations-to-our-ai-models-240617\/\" rel=\"nofollow noopener\" target=\"_blank\">defended its actions <\/a>as fair use, noting that books are nothing more than statistical correlations to its AI models. However, the allegations didn\u2019t go away. On the contrary, the plaintiffs found more evidence during discovery. <\/p>\n<p>\u2018NVIDIA Contacted Anna\u2019s Archive\u2019<\/p>\n<p>Last Friday, the authors filed an amended complaint that significantly expands the scope of the lawsuit. In addition to adding more books, authors, and AI models, it also includes broader \u201cshadow library\u201d claims and allegations. <\/p>\n<p>The authors, including <a href=\"https:\/\/en.wikipedia.org\/wiki\/Abdi_Nazemian\" rel=\"nofollow noopener\" target=\"_blank\">Abdi Nazemian<\/a>, now cite various internal Nvidia emails and documents, suggesting that the company willingly downloaded millions of copyrighted books. <\/p>\n<p>The new complaint alleges that \u201ccompetitive pressures drove NVIDIA to piracy\u201d, which allegedly included collaborating with the controversial Anna\u2019s Archive library.<\/p>\n<p>Competitive pressures<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/01\/competat.png\" alt=\"pressure\" width=\"600\" height=\"168\"  \/><\/p>\n<p>According to the amended complaint, a member of Nvidia\u2019s data strategy team reached out to Anna\u2019s Archive to find out what the pirate library could offer the trillion-dollar company<\/p>\n<p>\u201cDesperate for books, NVIDIA contacted Anna\u2019s Archive\u2014the largest and most brazen of the remaining shadow libraries\u2014about acquiring its millions of pirated materials and \u2018including Anna\u2019s Archive in pre-training data for our LLMs\u2019,\u201d the complaint notes. <\/p>\n<p>\u201cBecause Anna\u2019s Archive charged tens of thousands of dollars for \u2018high-speed access\u2019 to its pirated collections [\u2026] NVIDIA sought to find out what \u201chigh-speed access\u201d to the data would look like.\u201d<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/01\/allegdata.png\" alt=\"what data?\" width=\"600\" height=\"135\"  \/><\/p>\n<p>Anna\u2019s Archive Points Out Legal \u2018Concern\u2019<\/p>\n<p>According to the complaint, Anna\u2019s Archive then warned Nvidia that its library was illegally acquired and maintained. Because the site previously wasted time on other AI companies, the pirate library asked NVIDIA executives if they had internal permission to move forward. <\/p>\n<p>This permission was allegedly granted within a week, after which Anna\u2019s Archive provided the chip giant with access to its pirated books. <\/p>\n<p>\u201cWithin a week of contacting Anna\u2019s Archive, and days after being warned by Anna\u2019s Archive of the illegal nature of their collections, NVIDIA management gave \u2018the green light\u2019 to proceed with the piracy. Anna\u2019s Archive offered NVIDIA millions of pirated copyrighted books.\u201d<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/uk\/wp-content\/uploads\/2026\/01\/green-lght.png\" alt=\"green light\" width=\"600\" height=\"240\"  \/><\/p>\n<p>The complaint states that Anna\u2019s Archive promised to provide NVIDIA with access to roughly 500 terabytes of data. This included millions of books that are usually only accessible through Internet Archive\u2019s digital lending system, which itself has been <a href=\"https:\/\/torrentfreak.com\/internet-archive-loses-landmark-e-book-lending-copyright-appeal-against-publishers-240905\/\" rel=\"nofollow noopener\" target=\"_blank\">targeted in court<\/a>. <\/p>\n<p>The complaint does not explicitly mention whether NVIDIA ended up paying Anna\u2019s Archive for access to the data. <\/p>\n<p>Additionally, it\u2019s worth mentioning that NVIDIA also stands accused of using other pirated sources. In addition to the previously included Books3 database, the new complaint also alleges that the company downloaded books from LibGen, Sci-Hub, and Z-Library.<\/p>\n<p>Direct and Vicarious Copyright Infringement<\/p>\n<p>In addition to downloading and using pirated books for its own AI training, the authors allege NVIDIA distributed scripts and tools that allowed its corporate customers to automatically download \u201c<a href=\"https:\/\/en.wikipedia.org\/wiki\/The_Pile_(dataset)\" rel=\"nofollow noopener\" target=\"_blank\">The Pile<\/a>\u201c, which contains the Books3 pirated dataset. <\/p>\n<p>These allegations lead to new claims of vicarious and contributory infringement, alleging that NVIDIA generated revenue from customers by facilitating access to these pirated datasets.<\/p>\n<p>Based on these and other claims, the authors request to be compensated for the damages they suffered. This applies to the named authors, but also to potentially hundreds of others who may later join the class action lawsuit. <\/p>\n<p>As far as we know, this is the first time that correspondence between a major U.S. tech company and Anna\u2019s Archive was revealed in public. This will only raise the profile of the pirate library, which just <a href=\"https:\/\/torrentfreak.com\/u-s-court-order-against-annas-archive-spells-more-trouble-for-the-site\/\" rel=\"nofollow noopener\" target=\"_blank\">lost several domain names<\/a>, even further. <\/p>\n<p>\u2014<\/p>\n<p>A copy of the first consolidated and amended complaint, filed at the U.S. District Court for the Northern District of California, is available <a href=\"https:\/\/torrentfreak.com\/images\/naznvid-amend.pdf\" rel=\"nofollow noopener\" target=\"_blank\">here (pdf)<\/a>. The named authors include Abdi Nazemian, Brian Keene, Stewart O\u2019Nan, Andre Dubus III, and Susan Orlean.<\/p>\n","protected":false},"excerpt":{"rendered":"Chip giant NVIDIA has been one of the main financial beneficiaries in the artificial intelligence boom. Revenue surged&hellip;\n","protected":false},"author":2,"featured_media":378822,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,144036,3352,86,56,54,55],"class_list":{"0":"post-378821","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-books3","12":"tag-nvidia","13":"tag-technology","14":"tag-uk","15":"tag-united-kingdom","16":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/378821","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=378821"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/378821\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/378822"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=378821"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=378821"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=378821"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}