{"id":113261,"date":"2025-10-31T01:39:19","date_gmt":"2025-10-31T01:39:19","guid":{"rendered":"https:\/\/www.newsbeep.com\/ie\/113261\/"},"modified":"2025-10-31T01:39:19","modified_gmt":"2025-10-31T01:39:19","slug":"hundreds-of-thousands-of-videos-from-news-publishers-like-the-new-york-times-and-vox-were-used-to-train-ai-models","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ie\/113261\/","title":{"rendered":"Hundreds of thousands of videos from news publishers like The New York Times and Vox were used to train AI models"},"content":{"rendered":"<p>Last month, The Atlantic dropped the latest investigation in its ongoing <a href=\"https:\/\/www.theatlantic.com\/category\/ai-watchdog\/\" rel=\"nofollow noopener\" target=\"_blank\">series on generative AI training data sets<\/a>. Staff writer Alex Reisner found that <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2025\/09\/youtube-ai-training-data-sets\/684116\/?gift=iWa_iB9lkw4UuiWbIbrWGVeFl48M5hyx4gfG8GjzT5E&amp;\" rel=\"nofollow noopener\" target=\"_blank\">at least 15 million YouTube videos<\/a> had been used for training data by major technology companies, either for research or, in some cases, to build AI video products.<\/p>\n<p>The Atlantic\u2019s reporting focused over a dozen prominent training data sets that were either compiled or used by companies including Microsoft, Meta, Snap, Tencent, Runway AI, and ByteDance. The investigation shows how the unauthorized use of YouTube videos has been an essential contributor to <a href=\"https:\/\/www.nytimes.com\/2025\/10\/09\/technology\/personaltech\/sora-ai-video-impact.html\" rel=\"nofollow noopener\" target=\"_blank\">the AI industry\u2019s recent leap forward<\/a> in AI video generation quality.<\/p>\n<p>\u201cMuch as ChatGPT couldn\u2019t write like Shakespeare without first \u2018reading\u2019 Shakespeare, a video generator couldn\u2019t construct a fake newscast without \u2018watching\u2019 tons of recorded broadcasts,\u201d writes Reisner.<\/p>\n<p>The Atlantic\u2019s story briefly mentions that more than 30,000 videos from the BBC were among the training data, alongside other YouTube channels focused on news. Using a <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2025\/09\/search-youtube-videos-generative-ai\/684158\/\" rel=\"nofollow noopener\" target=\"_blank\">searchable database published by The Atlantic<\/a>, I wanted to better understand the scale at which news channels had been targeted. In the same data sets, I found hundreds of thousands of videos that were taken from some of the most popular news publishers and news creators on YouTube, including The New York Times, The Washington Post, The Guardian, Al Jazeera, and The Wall Street Journal.<\/p>\n<p><a class=\"embed-relatedstory-imagelink\" href=\"https:\/\/www.niemanlab.org\/2025\/06\/what-it-takes-to-sue-openai-as-a-journalism-nonprofit\/?relatedstory\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" width=\"315\" height=\"177\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/10\/CIR_Logo-25-06-2025-13-45-32-315x177.png\" class=\"embed-relatedstory-image wp-post-image\" alt=\"The Center for Investigative Reporting's logo\"   loading=\"lazy\"\/><\/a>For example, more than 88,000 videos were included from Fox News\u2019 YouTube channels, including its <a href=\"https:\/\/www.youtube.com\/@FoxNews\" rel=\"nofollow noopener\" target=\"_blank\">flagship account<\/a> and <a href=\"https:\/\/www.youtube.com\/@FoxBusiness\" rel=\"nofollow noopener\" target=\"_blank\">Fox Business<\/a>. Another roughly 70,000 videos were taken from the channels of <a href=\"https:\/\/www.youtube.com\/user\/abcnews\" rel=\"nofollow noopener\" target=\"_blank\">ABC News<\/a> and its morning show, <a href=\"https:\/\/www.youtube.com\/c\/GMA\" rel=\"nofollow noopener\" target=\"_blank\">Good Morning America<\/a>. I also found more than 55,000 videos from Bloomberg\u2019s YouTube channels, including <a href=\"https:\/\/www.youtube.com\/bloomberg\" rel=\"nofollow noopener\" target=\"_blank\">Bloomberg Originals<\/a>, <a href=\"https:\/\/www.youtube.com\/@markets\" rel=\"nofollow noopener\" target=\"_blank\">Bloomberg Television<\/a>, and <a href=\"https:\/\/www.youtube.com\/@BloombergTechnology\" rel=\"nofollow noopener\" target=\"_blank\">Bloomberg Technology<\/a>.<\/p>\n<p>Searching through Vox Media-owned YouTube channels in the database, I found more than 30,000 videos including <a href=\"https:\/\/www.youtube.com\/watch?v=-ZDZtBRTyeI\" rel=\"nofollow noopener\" target=\"_blank\">explainers from Vox<\/a>, <a href=\"https:\/\/www.youtube.com\/watch?v=yOw0sYXvbIc\" rel=\"nofollow noopener\" target=\"_blank\">travel docs from Eater<\/a>, and <a href=\"https:\/\/www.youtube.com\/watch?v=P4D-PFbmg5U\" rel=\"nofollow noopener\" target=\"_blank\">animal tearjerkers from The Dodo<\/a>. Roughly 13,900 of those videos were from The Verge\u2019s <a href=\"https:\/\/www.youtube.com\/channel\/UCddiUEpeqJcYeBxX1IVBKvQ\" rel=\"nofollow noopener\" target=\"_blank\">official YouTube channel<\/a>, including iOS <a href=\"https:\/\/www.youtube.com\/watch?v=x1Ramg73R2Y\" rel=\"nofollow noopener\" target=\"_blank\">gadget guides<\/a>, <a href=\"https:\/\/www.youtube.com\/watch?v=wxyOQom8V7o\" rel=\"nofollow noopener\" target=\"_blank\">episodes of its flagship podcast<\/a> The Vergecast, and <a href=\"https:\/\/www.youtube.com\/watch?v=gV50hpSKHFQ\" rel=\"nofollow noopener\" target=\"_blank\">interviews with Silicon Valley CEOs<\/a> like Mark Zuckerberg.<\/p>\n<p>YouTube CEO Neal Mohan has previously said that it\u2019s <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2024-04-04\/youtube-says-openai-training-sora-with-its-videos-would-break-the-rules\" rel=\"nofollow noopener\" target=\"_blank\">against the platform\u2019s terms of service<\/a> for other companies to download videos and use them for training data.<\/p>\n<p>\u201cIn order to survive, AI platforms know they need (and their consumers want) quality, credible content like ours that give their products relevance and purpose,\u201d said Lauren Starke, a spokesperson for Vox Media. \u201cThey\u2019re spending at unprecedented levels on AI infrastructure: chips, servers, and data centers that power their models. Yet when it comes to the content that makes those models useful \u2014 journalism, creative work \u2014 they\u2019ve comparatively spent next to nothing.\u201d<\/p>\n<p>In May 2024, Vox Media signed a <a href=\"https:\/\/www.voxmedia.com\/2024\/5\/29\/24166483\/vox-media-openai-strategic-content-and-product-partnership\" rel=\"nofollow noopener\" target=\"_blank\">partnership with OpenAI<\/a> for an undisclosed sum allowing the company to use its content for products like ChatGPT. Starke said Vox Media will continue to explore partnerships with AI companies that respect their work, but \u201cpursue legal remedies to protect our intellectual property, when necessary.\u201d<\/p>\n<p>\u201cWithout our quality content, the reality for these platforms will be: garbage in, garbage out,\u201d she said.<\/p>\n<p>\nThe Atlantic\u2019s database includes over a dozen distinct video training data sets, all of which have been used prominently in generative AI research and development. Some of those data sets have explicit links to commercial video generation models on the market.<\/p>\n<p>For example, I found 11,604 videos from the official YouTube channel of The New York Times across 11 different data sets in the database. Over 8,000 of these videos, though, came from a single training data source \u2014 <a href=\"https:\/\/urldefense.proofpoint.com\/v2\/url?u=https-3A__www.theatlantic.com_technology_archive_2025_09_dataset-2Drunway-2Dgen-2D3_684045_&amp;d=DwMFaQ&amp;c=WO-RGvefibhHBZq3fL85hQ&amp;r=JYk2cKZ8yN3ubnKBTeyRNQWWJmlXzct1KRkcVyjSRrI&amp;m=bmZd8-AZw1Warhl3h4NQJaW_MZEHt1itd4ftNV09-eXtbikAugyAU0HYVWZTS05z&amp;s=bfpIWBwq8nTi4EPvFlI-g3nu7fwwHWWakjqAjC9h_Ac&amp;e=\" rel=\"nofollow noopener\" target=\"_blank\">Runway Gen-3<\/a>. Compiled by Runway AI, a company that has received backing from <a href=\"https:\/\/techcrunch.com\/2023\/06\/29\/runway-a-startup-building-generative-ai-for-content-creators-raises-141m\/\" rel=\"nofollow noopener\" target=\"_blank\">Salesforce, Google, and Nvidia<\/a>, this data set was made to train its flagship video generation model. When Gen-3 was released back in June 2024, it received rave reviews and was <a href=\"https:\/\/www.axios.com\/2025\/01\/03\/openai-sora-google-veo-runway-video\" rel=\"nofollow noopener\" target=\"_blank\">compared<\/a> to earlier iterations of OpenAI\u2019s Sora and Google\u2019s Veo models. Earlier this year, <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-04-03\/ai-video-startup-runway-valued-at-3-billion-in-funding-round\" rel=\"nofollow noopener\" target=\"_blank\">Runway AI was valued at $3 billion<\/a>.<\/p>\n<p>Among the thousands of videos from The New York Times in Runway Gen-3, there is a <a href=\"https:\/\/www.youtube.com\/watch?v=yznRGS9f-jI\" rel=\"nofollow noopener\" target=\"_blank\">documentary<\/a> on JFK\u2019s assasination, a <a href=\"https:\/\/www.youtube.com\/watch?v=gHfWuUhrKQg&amp;feature=youtu.be\" rel=\"nofollow noopener\" target=\"_blank\">visual investigation<\/a> on the Hong Kong pro-democracy protests, a <a href=\"https:\/\/www.youtube.com\/watch?v=L-iel6-aSfo\" rel=\"nofollow noopener\" target=\"_blank\">sit-down interview<\/a> with Barack Obama, and an <a href=\"https:\/\/www.youtube.com\/watch?v=h5WjRjz5mTU\" rel=\"nofollow noopener\" target=\"_blank\">opinion column<\/a> about Russian influence operations. An additional 382 videos are taken from the NYT Cooking YouTube channel, including <a href=\"https:\/\/www.youtube.com\/watch?v=jaN3qsqXt38&amp;feature=youtu.be\" rel=\"nofollow noopener\" target=\"_blank\">viral recipes<\/a>, <a href=\"https:\/\/www.youtube.com\/watch?v=j1QImGKOd5U\" rel=\"nofollow noopener\" target=\"_blank\">how-to baking guides<\/a>, and <a href=\"https:\/\/www.youtube.com\/watch?v=jfvjw2qE6bM\" rel=\"nofollow noopener\" target=\"_blank\">shortform street food docs<\/a>. (One caveat is that Runway AI may have omitted certain videos when it ultimately trained Gen-3.)<\/p>\n<p>An <a href=\"https:\/\/docs.google.com\/spreadsheets\/d\/1eO5cwguMHeu63F0vsKRXs_dLlcuy_P4F\/edit?ref=404media.co&amp;gid=532885050#gid=532885050\" rel=\"nofollow noopener\" target=\"_blank\">internal Runway AI spreadsheet<\/a> <a href=\"https:\/\/www.404media.co\/runway-ai-image-generator-training-data-youtube\/\" rel=\"nofollow noopener\" target=\"_blank\">published by 404 Media<\/a> last year gives some insight into why YouTube videos from news publishers were targeted. The spreadsheet, called \u201cVideo sourcing \u2013 Jupiter\u201d (referencing the codename for Gen-3), lists thousands of channels that were marked by the company as high quality.<\/p>\n<p>In the <a href=\"https:\/\/docs.google.com\/spreadsheets\/d\/1eO5cwguMHeu63F0vsKRXs_dLlcuy_P4F\/edit?ref=404media.co&amp;gid=532885050#gid=532885050\" target=\"_blank\" rel=\"nofollow noopener\">document<\/a>, 27,000 videos from The Wall Street Journal\u2019s YouTube channel were tagged with: \u201clot of graphics, walkthroughs, \u2018show and tell.\u2019\u201d From CNET, 22,000 videos were described as \u201ctech reviews\u201d and tagged with the keyword phrase \u201cusing a laptop.\u201d From the Washington Post, 21,000 videos were labeled as \u201clots of newscaseter[sic] but plenty of b-roll.\u201d Another 35,000 videos from Good Morning America were tagged \u201cgargling,\u201d AI jargon for when a model superficially mimics something from its training without deeper \u201cunderstanding.\u201d<\/p>\n<p>From The New York Times\u2019 official YouTube channel, videos were listed with the description \u201cnyt video, op docs, b roll, talking, human subjects.\u201d Hundreds of NYT Cooking videos were tagged with the keyword \u201cscrambling eggs.\u201d This language gives some indication of the visual vernaculars \u2014 or even specific actions \u2014 Gen-3 was being trained to mimic.<\/p>\n<p>Since the model\u2019s release, major Hollywood studios have started folding Runway AI\u2019s products into their film, TV, and marketing production pipelines. According to a <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-07-21\/netflix-is-using-startup-runway-ai-s-video-tools-for-production\" rel=\"nofollow noopener\" target=\"_blank\">report from Bloomberg<\/a> this summer, Netflix is already using Runway AI tools <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-07-21\/netflix-is-using-startup-runway-ai-s-video-tools-for-production\" rel=\"nofollow noopener\" target=\"_blank\">in its \u201ccontent production<\/a>\u201d and Walt Disney Co. has similarly been testing out its software.<\/p>\n<p><a class=\"embed-relatedstory-imagelink\" href=\"https:\/\/www.niemanlab.org\/2025\/01\/thousands-of-documentaries-are-fueling-ai-models-built-by-apple-meta-and-nvidia\/?relatedstory\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" width=\"315\" height=\"177\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/10\/AdobeStock_273447816-315x177.jpeg\" class=\"embed-relatedstory-image wp-post-image\" alt=\"\"   loading=\"lazy\"\/><\/a>Meanwhile, there have been no reported licensing deals between Runway AI and the many news publishers whose work was included in the data set, including The Washington Post, Vox Media, the BBC, and The New York Times. Runway AI did not respond to a request for comment.<\/p>\n<p>\u201cThe Times has not authorized the use of videos that it publishes on YouTube for AI training purposes by any third party. As the owner of these works, the Times has the exclusive legal right to decide how and where our content is used \u2014 and are monitoring this closely,\u201d said a spokesperson for the Times, which is currently <a href=\"https:\/\/www.niemanlab.org\/2025\/06\/what-it-takes-to-sue-openai-as-a-journalism-nonprofit\/\" rel=\"nofollow noopener\" target=\"_blank\">suing OpenAI and Microsoft<\/a> for allegedly using its text articles to train ChatGPT without permission. \u201cWe will continue to actively investigate infringement of our valuable intellectual property, and will enforce our rights as appropriate.\u201d<\/p>\n<p>Not all the training data sets in The Atlantic database have such clear ties to commercial AI video products. Some were used by the research arms of major AI companies, including Meta, Snap, Tencent, and Bytedance. This usage is public because employees disclosed it themselves in research articles.<\/p>\n<p>For example, a training data set called HD-VILA-100M was first collected by <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/microsoft-research-asia\/about-us\/\" rel=\"nofollow noopener\" target=\"_blank\">Microsoft Research Asia<\/a>, the company\u2019s research lab headquartered in Beijing, China. <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2025\/09\/dataset-hd-vila-100m\/683936\/\" rel=\"nofollow noopener\" target=\"_blank\">The Atlantic reported<\/a> that HD-VILA-100M was made available for download by Microsoft researchers and then used by a host of major AI companies in their own research and development.<\/p>\n<p>Meta used the data set to develop its <a href=\"https:\/\/ai.meta.com\/blog\/generative-ai-text-to-video\/\" rel=\"nofollow noopener\" target=\"_blank\">text-to-video system \u201cMake-A-Video,\u201d<\/a> which was released back in 2022. A research lab at Tencent, the Chinese tech giant, used HD-VILA-100M to <a href=\"https:\/\/proceedings.neurips.cc\/paper_files\/paper\/2024\/file\/57f6683e550eb067936c9e9f0bcb8e31-Paper-Datasets_and_Benchmarks_Track.pdf\" rel=\"nofollow noopener\" target=\"_blank\">create a publicly-available dataset<\/a> that could rival the training data used by OpenAI for its Sora video generation model. Byetdance, the owner of TikTok, used the model to train its experimental text-to-video model <a href=\"https:\/\/arxiv.org\/abs\/2211.11018\" rel=\"nofollow noopener\" target=\"_blank\">MagicVideo<\/a>. Snap, the owner of Snapchat, used the model for <a href=\"https:\/\/arxiv.org\/pdf\/2402.19479\" rel=\"nofollow noopener\" target=\"_blank\">research<\/a> into improving AI <a href=\"https:\/\/research.snap.com\/publications\/panda-70m-captioning-70m-videos-with-multiple-cross-modality-teachers.html\" rel=\"nofollow noopener\" target=\"_blank\">video captioning, video search tools, and text-to-video generation<\/a>.<\/p>\n<p><a class=\"embed-relatedstory-imagelink\" href=\"https:\/\/www.niemanlab.org\/2024\/03\/the-intercept-charts-a-new-legal-strategy-for-digital-publishers-suing-openai\/?relatedstory\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" width=\"315\" height=\"177\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/10\/Screen-Shot-2024-03-20-at-1.12.01-PM-315x177.png\" class=\"embed-relatedstory-image wp-post-image\" alt=\"\"   loading=\"lazy\"\/><\/a>Within HD-VILA-100M, as it was passed across the AI industry over several years, were thousands of YouTube videos owned by news publishers. That includes more than 13,000 videos downloaded from Fox News YouTube channels, roughly 6,300 from various DW channels, and another 5,520 from the Al Jazeera English channel, among others.<\/p>\n<p>While research using HD-VILA-100M has advanced video generation technology at each respective company, it\u2019s harder to draw straight lines between its usage and any one proprietary model or feature.<\/p>\n<p>Similarly, <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2025\/09\/dataset-yt-temporal-180m\/683937\/\" rel=\"nofollow noopener\" target=\"_blank\">YT-Temporal-180M<\/a> is a dataset compiled by researchers at the University of Washington and the Allen Institute for AI, a nonprofit research organization. <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2025\/09\/dataset-yt-temporal-180m\/683937\/\" rel=\"nofollow noopener\" target=\"_blank\">The Atlantic reported<\/a> that the data set is hosted on Google Cloud servers and <a href=\"https:\/\/huggingface.co\/datasets\/HuggingFaceM4\/yttemporal180m\" rel=\"nofollow noopener\" target=\"_blank\">available for download<\/a> through Hugging Face, a platform for sharing data sets and machine learning models. YT-Temporal-180M includes about 36,000 videos from Fox News, about 34,000 videos from Bloomberg, and roughly 31,000 videos from ABC News, among others.<\/p>\n<p>Since it first became available in 2021, YT-Temporal-180M has been <a href=\"https:\/\/huggingface.co\/api\/datasets\/HuggingFaceM4\/yttemporal180m?expand%5b%5d=downloads&amp;expand%5b%5d=downloadsAllTime\" rel=\"nofollow noopener\" target=\"_blank\">downloaded from Hugging Face more than 1,450 times<\/a>. Many of the data sets identified and audited by The Atlantic remain available for download on Hugging Face to use freely for model training.<\/p>\n<p>\nMajor publishers were not the only news-focused YouTube channels I found. Videos from news creators \u2014\u00a0 independent channels that host news aggregation, talk shows, interviews, and political punditry \u2014 were scattered throughout the training data sets and sometimes rivaled the numbers from traditional news media.<br \/>\nI found several of the most popular progressive news channels on YouTube in the training data sets, including over 15,000 videos from <a href=\"https:\/\/www.youtube.com\/@thedavidpakmanshow\/videos\" rel=\"nofollow noopener\" target=\"_blank\">The David Pakman Show<\/a>, a talk show that has more than three million YouTube subscribers. His videos were included in both HD-VILA-100M and YT-Temporal-180M, among others. Pakman, the founder and host of the program, confirmed he had not received any requests to use these videos for AI training.<\/p>\n<p>\u201cI understand that AI training often involves scraping large amounts of publicly available data, and that\u2019s part of how these systems improve,\u201d Pakman told me. \u201cWhen the use is this concentrated \u2014 a.k.a tens of thousands of videos from one creator \u2014 it feels less like incidental inclusion and more like large-scale extraction of intellectual property without consent.\u201d<\/p>\n<p><a href=\"https:\/\/www.wired.com\/story\/youtube-training-data-apple-nvidia-anthropic\/\" rel=\"nofollow noopener\" target=\"_blank\">Wired previously reported<\/a> on how subtitles from Pakman\u2019s videos were used to train language models.<\/p>\n<p>Over 11,000 videos from <a href=\"https:\/\/www.youtube.com\/@TheMajorityReport\/videos\" rel=\"nofollow noopener\" target=\"_blank\">The Majority Report with Sam Seder<\/a>, which has nearly 2 million subscribers on YouTube, were also in the data sets. When I spoke to Seder, he speculated that his channel offers AI companies a \u201cvisual and linguistic vernacular\u201d that\u2019s fundamentally different from mainstream news publishers. Those thousands of videos from The Majority Report include recorded livestreams, <a href=\"https:\/\/www.youtube.com\/watch?v=EsFhvaIjsSA\" rel=\"nofollow noopener\" target=\"_blank\">listener call-in shows<\/a>, and <a href=\"https:\/\/www.youtube.com\/watch?v=-0GpW9JAAbw\" rel=\"nofollow noopener\" target=\"_blank\">reaction videos<\/a>, all of which round up to a <a href=\"https:\/\/www.youtube.com\/watch?v=C6tqe9ETMho\" rel=\"nofollow noopener\" target=\"_blank\">radio-jockey-style brand of political commentary<\/a>.<\/p>\n<p>Notably, very few of the most prominent U.S. conservative political commentators on YouTube were in the data sets. For example, there were no videos from Steven Crowder or The Rubin Report. There were 460 videos from Ben Shapiro\u2019s YouTube channel, which has over 7 million subscribers.<\/p>\n<p>Under YouTube\u2019s rules, when a creator uploads an original video, they <a href=\"https:\/\/support.google.com\/youtube\/answer\/2797466?hl=en#:~:text=Think%20of%20copyright%20like%20the,legal%20advice%20before%20you%20upload.\" rel=\"nofollow noopener\" target=\"_blank\">automatically retain copyright<\/a>. That said, YouTube does have a carve out to use the content for its own AI training purposes. Earlier this year, <a href=\"https:\/\/www.cnbc.com\/2025\/06\/19\/google-youtube-ai-training-veo-3.html\" rel=\"nofollow noopener\" target=\"_blank\">CNBC reported<\/a> that YouTube had used a subset of videos on its platform to train Google\u2019s Gemini and Veo 3 models. This allowance does not extend to third parties.<\/p>\n<p><a class=\"embed-relatedstory-imagelink\" href=\"https:\/\/www.niemanlab.org\/2023\/04\/why-news-outlets-are-putting-their-podcasts-on-youtube\/?relatedstory\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" width=\"315\" height=\"177\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/10\/headphones-on-a-laptop-315x177.png\" class=\"embed-relatedstory-image wp-post-image\" alt=\"\"   loading=\"lazy\"\/><\/a>News publications and news creators don\u2019t need to register their YouTube videos with the U.S. Copyright Office (USCO) to have a valid copyright claim. That said, registering videos by submitting an application and paying a filing fee does come with legal benefits, like the ability to sue for copyright infringement.<\/p>\n<p>The New York Times told me that it \u201cregisters its print edition and website on an ongoing basis with the US Copyright Office, including all underlying content.\u201d In many cases, YouTube videos from the Times that are based on print or web articles that have already been registered with USCO could be considered \u201c<a href=\"https:\/\/www.copyright.gov\/circs\/circ14.pdf\" rel=\"nofollow noopener\" target=\"_blank\">derivative works<\/a>\u201d and covered by the same filings.<\/p>\n<p>\u201cTaking content from creators like the Times without permission violates the law and will severely harm the market for original, independent reporting, which will diminish the ability of people to tell important stories, leaving the public less informed,\u201d a spokesperson for the Times told me. \u201cThe Times believes that the future success of this technology should not come at the expense of journalistic institutions.\u201d<\/p>\n<p>Seder, meanwhile, said none of the videos on The Majority Report channel \u2014 often five uploads per day \u2014 are registered with the USCO. As he puts it, he simply doesn\u2019t \u201chave the pockets\u201d to cover filing fees and retain legal counsel, especially when up against some of the largest companies in the world.<\/p>\n<p>He is comfortable with other creators pulling clips from his videos without permission, to a degree. After all, reaction videos are fuel for news creators across YouTube.<\/p>\n<p>\u201cPeople are using my content all the time, but they\u2019re adding commentary to it, and it is part of a conversation, and it is transparent \u2014 that\u2019s part of the ecosystem,\u201d said Seder. He sees the mass downloading of his channel for AI training in another light. \u201cWhat these [AI companies] are doing is fundamentally different. There\u2019s no reciprocity; it\u2019s only exploitative.\u201d<\/p>\n<p>Graphic of YouTube logo made up of dollar bills by <a href=\"https:\/\/stock.adobe.com\/images\/3d-rendering-of-dollar-cash-rolls-and-stacks-in-shape-of-symbol-of-youtube-logo-on-green-background\/491274312\" rel=\"nofollow noopener\" target=\"_blank\">Destrosvet<\/a> and used under Adobe Stock license.<\/p>\n","protected":false},"excerpt":{"rendered":"Last month, The Atlantic dropped the latest investigation in its ongoing series on generative AI training data sets.&hellip;\n","protected":false},"author":2,"featured_media":113262,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[220,218,219,61,60,80],"class_list":{"0":"post-113261","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-ie","12":"tag-ireland","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/113261","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/comments?post=113261"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/113261\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media\/113262"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media?parent=113261"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/categories?post=113261"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/tags?post=113261"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}