{"id":459349,"date":"2026-02-10T03:03:07","date_gmt":"2026-02-10T03:03:07","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/459349\/"},"modified":"2026-02-10T03:03:07","modified_gmt":"2026-02-10T03:03:07","slug":"openai-wins-key-discovery-battle-against-authors-in-ai-lawsuits","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/459349\/","title":{"rendered":"OpenAI Wins Key Discovery Battle Against Authors in AI Lawsuits"},"content":{"rendered":"<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tIt was a highly contentious ruling, one that could\u2019ve changed the course of not only this case but others against <a href=\"https:\/\/www.hollywoodreporter.com\/t\/ai-3\/\" id=\"auto-tag_ai-3_1\" data-tag=\"ai-3\" rel=\"nofollow noopener\" target=\"_blank\">AI<\/a> companies. The court had just ruled that <a href=\"https:\/\/www.hollywoodreporter.com\/t\/openai\/\" id=\"auto-tag_openai_1\" data-tag=\"openai\" rel=\"nofollow noopener\" target=\"_blank\">OpenAI<\/a> waived attorney-client privilege by denying allegations that it knowingly infringed upon the copyrights of authors whose books it illegally downloaded. The finding opened the door to internal communications behind the company\u2019s deletion of two huge datasets of pirated books, potentially exposing it to massive damages. Its in-house legal team was set to be deposed.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tOpenAI immediately appealed the ruling, bringing on Lisa Blatt, a veteran of the Supreme Court bar who counts Google, Bank of America and Starbucks as clients. In her brief, she delivered a dire warning: the decision, if allowed to stand, will eviscerate assertions of privilege in any copyright case involving what\u2019s known as state of mind, an analysis used to determine whether a defendant intentionally committed infringement or was unaware of doing so.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tOn Friday, OpenAI prevailed in its bid to reverse the court\u2019s ruling. At stake was billions of dollars. The communications could\u2019ve helped prove willful infringement, which triggers damages as high as $150,000 per work opposed to just $200. And perhaps even more importantly, the decision threatened to provide a pathway for those suing AI companies to obtain evidence typically considered to be privileged information.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tThe issue has been a key battleground in discovery. It relates to an OpenAI employee in 2018 downloading pirated copies of books and using them to create two datasets, known as \u201cbooks 1\u201d and \u201cbooks 2,\u201d to train two discontinued GPT models. After initially telling the court that the datasets were deleted in 2022 \u201cdue to their non-use,\u201d the company has since maintained that information over the reasons for their erasure are privileged. Lawyers representing authors and publishers have called foul play.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tIn November, Magistrate Judge Ona Wang found that OpenAI must hand over evidence revealing the company\u2019s motivations for deleting the datasets. She concluded that the company opened the door to the privileged material when it disclosed that \u201cbooks 1\u201d and \u201cbooks 2\u201d were deleted because of \u201cnon-use,\u201d though it was another part of her order that attracted the attention of the copyright bar. Wang reasoned that the company effectively waived attorney-client privilege by denying allegations of willful infringement, which she said puts the company\u2019s state of mind under the court spotlight. For \u201cOpenAI to deny that it willfully infringed class plaintiffs\u2019 copyrighted works is to argue that it acted in good faith,\u201d per the order.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tRuling against authors and publishers, including Sarah Silverman, in Friday\u2019s order, U.S. District Judge Sidney Stein stressed that the denial of willful infringement claims isn\u2019t equivalent to advancing a good faith defense. This puts discovery over the reasons OpenAI erased the datasets in 2022 squarely off the table, she said. \u201cThere is a distinction between a copyright defendant merely denying allegations of willfulness \u2014 on which a plaintiff bears the burden of proof \u2014 and a copyright defendant affirmatively asserting its good faith belief that its actions were lawful,\u201d Stein wrote.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tWith the reversal, the discovery battle becomes a precedential detour in the case. Although it was reversed, the argument was a crafty one by lawyers representing authors, led by Justin Nelson and Craig Smyser of Susman Godfrey, the firm that negotiated the $1.5 billion Anthropic settlement. If the ruling was allowed to stand, AI companies would\u2019ve borne the burden of proving they didn\u2019t intend to violate copyright law anytime they deny allegations that they knowingly infringed on copyrighted works.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tAlso discussed in the reversal was whether OpenAI revealed privileged information when it said that the datasets were deleted \u201cdue to non-use.\u201d On this issue, Stein said that the assertion doesn\u2019t represent legal advice, meaning that they can\u2019t be used as a basis for finding that OpenAI waived privilege.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tDespite the loss on the discovery battleground, the authors\u2019 lawyers are gaining ground on what\u2019s increasingly looking like a winning argument over the practice of pirating books from shadow library. That theory has changed over the course of AI litigation. At first, lawyers for the authors directly connected the piracy to OpenAI\u2019s training of its models under a single umbrella. But later, they separated the theories and alleged that the distinct act of illegally downloading the works, regardless of whether they were used, constitutes copyright infringement.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tThe move takes advantage of the one win for authors in another AI copyright case, this one initiated by Andrea Bartz against Anthropic, related to the company illegally downloading millions of books and storing them in a central library. The decision heavily leaned in favor of Anthropic, but the court greenlit that theory, which is now a part of the OpenAI case, for trial. \u201cThat Anthropic later bought a copy of a book it earlier stole off the internet will not absolve it of liability for the theft,\u201d wrote U.S. District Judge William Alsup. After the ruling, Anthropic agreed to pay $1.5 billion to settle the lawsuit.<\/p>\n<p class=\"paragraph larva \/\/  a-font-body-m     \">\n\tIt remains largely unknown what today\u2019s AI systems are trained on. OpenAI used \u201cbooks 1\u201d and \u201cbooks 2\u201d \u2014 downloaded from LibGen, a shadow library website \u2014 to train old versions of GPT but later deleted the datasets. Still, billions of dollars were raised by AI firms largely because of models they trained on pirated books.<\/p>\n","protected":false},"excerpt":{"rendered":"It was a highly contentious ruling, one that could\u2019ve changed the course of not only this case but&hellip;\n","protected":false},"author":2,"featured_media":459350,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[45],"tags":[182,181,507,1283,74],"class_list":{"0":"post-459349","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-openai","12":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/459349","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=459349"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/459349\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/459350"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=459349"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=459349"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=459349"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}