{"id":86633,"date":"2025-08-16T06:55:23","date_gmt":"2025-08-16T06:55:23","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/86633\/"},"modified":"2025-08-16T06:55:23","modified_gmt":"2025-08-16T06:55:23","slug":"the-internet-is-about-to-get-a-little-worse-as-reddit-moves-to-block-the-internet-archive-so-ai-companies-cant-scrape-its-content-2","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/86633\/","title":{"rendered":"The internet is about to get a little worse as Reddit moves to block the Internet Archive so AI companies can&#8217;t scrape its content"},"content":{"rendered":"<p id=\"818c2f07-9df1-40e1-aead-f78e8badb169\">The internet, which was once a useful thing, is about to become a little less so: A new report from <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.theverge.com\/news\/757538\/reddit-internet-archive-wayback-machine-block-limit\" target=\"_blank\" data-url=\"https:\/\/www.theverge.com\/news\/757538\/reddit-internet-archive-wayback-machine-block-limit\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">The Verge<\/a> says Reddit is going to start blocking the Wayback Machine from indexing most of its content.<\/p>\n<p>The Wayback Machine, part of the Internet Archive, takes &#8220;snapshots&#8221; of websites as they exist at various points through their history\u2014even if those websites don&#8217;t exist anymore. Want to know what the old <a data-analytics-id=\"inline-link\" href=\"https:\/\/web.archive.org\/web\/20150812192936\/https:\/\/forum.bioware.com\/\" target=\"_blank\" data-url=\"https:\/\/web.archive.org\/web\/20150812192936\/https:\/\/forum.bioware.com\/\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">BioWare forums<\/a> looked like before they were closed in 2016? Wayback Machine&#8217;s got you. It&#8217;s also incredibly handy for tracking things like <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.pcgamer.com\/games\/survival-crafting\/the-tencent-survival-game-being-sued-by-sony-quietly-purges-most-horizon-like-content-from-its-steam-page-bumps-release-date-to-late-2027\/\" data-before-rewrite-localise=\"https:\/\/www.pcgamer.com\/games\/survival-crafting\/the-tencent-survival-game-being-sued-by-sony-quietly-purges-most-horizon-like-content-from-its-steam-page-bumps-release-date-to-late-2027\/\" rel=\"nofollow noopener\" target=\"_blank\">Steam page changes<\/a> and answering questions like, &#8220;Hey, did the CIA ever run a Star Wars fan site?&#8221; (And <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.pcgamer.com\/gaming-industry\/the-cia-operated-a-network-of-gaming-sites-and-even-a-star-wars-fanpage-that-were-part-of-one-of-its-worst-ever-intelligence-catastrophes\/\" data-before-rewrite-localise=\"https:\/\/www.pcgamer.com\/gaming-industry\/the-cia-operated-a-network-of-gaming-sites-and-even-a-star-wars-fanpage-that-were-part-of-one-of-its-worst-ever-intelligence-catastrophes\/\" rel=\"nofollow noopener\" target=\"_blank\">yes, it did<\/a>.)<\/p>\n<p><a id=\"elk-seasonal\" data-url=\"\" href=\"\" data-hl-processed=\"none\"\/><\/p>\n<p id=\"818c2f07-9df1-40e1-aead-f78e8badb169-2\">The Internet Archive&#8217;s ability to do this is dependent on crawling and indexing websites, and that&#8217;s what Reddit is going to block: In future, the Wayback Machine will only be able to index the <a data-analytics-id=\"inline-link\" href=\"http:\/\/reddit.com\" target=\"_blank\" data-url=\"http:\/\/reddit.com\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">reddit.com<\/a> homepage, meaning individual subreddits and posts will be out of reach\u2014effectively rendering it useless. Reddit spokesperson Tim Rathschmidt said the block is being imposed because &#8220;we\u2019ve been made aware of instances where AI companies violate platform policies, including ours, and scrape data from the Wayback Machine.&#8221;<\/p>\n<p>Related articles<\/p>\n<p>The report says limits on the Wayback Machine&#8217;s ability to scrape Reddit will start &#8220;ramping up&#8221; today. Rathschmidt said Reddit had been in touch with the Internet Archive in advance, to &#8220;inform them of the limits before they go into effect.&#8221;<\/p>\n<p>I&#8217;m generally all for anything that makes life more difficult for AI companies, but I can&#8217;t really hand it to Reddit in this case because the principle in question here appears to be, well, not principle, but money: Reddit <a data-analytics-id=\"inline-link\" href=\"https:\/\/redditinc.com\/blog\/reddit-and-google-expand-partnership\" target=\"_blank\" data-url=\"https:\/\/redditinc.com\/blog\/reddit-and-google-expand-partnership\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">made a deal with Google<\/a> in 2024 to make its content available for AI training. Another <a data-analytics-id=\"inline-link\" href=\"https:\/\/redditinc.com\/blog\/reddit-and-oai-partner\" target=\"_blank\" data-url=\"https:\/\/redditinc.com\/blog\/reddit-and-oai-partner\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">deal with OpenAI<\/a> followed a few months later.<\/p>\n<p>Reddit&#8217;s thing isn&#8217;t so much about preventing the abuses of AI training, then, as it is charging top dollar for the privilege. In that light, this really sucks: The Internet Archive is a <a data-analytics-id=\"inline-link\" href=\"https:\/\/archive.org\/donate\" target=\"_blank\" data-url=\"https:\/\/archive.org\/donate\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\">non-profit organization<\/a>, and the Wayback Machine\u2014in sharp contrast to AI-powered chatbots\u2014is genuinely useful, even vital given how quickly <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.pcgamer.com\/more-of-the-internet-could-disappear-as-load-bearing-image-host-imgur-announces-deletion-of-old-content-and-nsfw-images\/\" data-before-rewrite-localise=\"https:\/\/www.pcgamer.com\/more-of-the-internet-could-disappear-as-load-bearing-image-host-imgur-announces-deletion-of-old-content-and-nsfw-images\/\" rel=\"nofollow noopener\" target=\"_blank\">working links turn into dead ones<\/a>. The Internet Archive provides a valuable service, <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.pcgamer.com\/software\/ai\/those-erroneous-search-results-were-just-the-ai-doing-its-job-says-googleprior-to-these-screenshots-going-viral-practically-no-one-asked-google-that-question\/\" data-before-rewrite-localise=\"https:\/\/www.pcgamer.com\/software\/ai\/those-erroneous-search-results-were-just-the-ai-doing-its-job-says-googleprior-to-these-screenshots-going-viral-practically-no-one-asked-google-that-question\/\" rel=\"nofollow noopener\" target=\"_blank\">accurately<\/a> and without unprompted <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.pcgamer.com\/software\/ai\/elon-musk-claims-grok-was-manipulated-into-praising-hitler-then-makes-wild-claims-about-it-discovering-new-technologies-and-new-physics-within-the-next-year-just-let-that-sink-in\/\" data-before-rewrite-localise=\"https:\/\/www.pcgamer.com\/software\/ai\/elon-musk-claims-grok-was-manipulated-into-praising-hitler-then-makes-wild-claims-about-it-discovering-new-technologies-and-new-physics-within-the-next-year-just-let-that-sink-in\/\" rel=\"nofollow noopener\" target=\"_blank\">racist slurs<\/a>. Cutting the Wayback crawler off from Reddit, a massive trove of information on just about every subject imaginable, is a loss for us all.<\/p>\n<p>There does seem to be some faint hope for a better resolution than simply it doesn&#8217;t work anymore: In a statement provided to PC Gamer, Mark Graham, director of the Wayback Machine, said, &#8220;We have a longstanding relationship with Reddit and continue to have ongoing discussions about this matter.&#8221;<\/p>\n<p class=\"newsletter-form__strapline\">Keep up to date with the most important stories and the best deals, as picked by the PC Gamer team.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2025\/08\/YwnnY4PZ7TcCYKRYDho7VW.jpg\" alt=\"Razer Blade 16 gaming laptop\"   class=\"person__avatar image-wrapped__image image__image\" loading=\"lazy\" data-normal=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2025\/08\/YwnnY4PZ7TcCYKRYDho7VW.jpg\" data-original-mos=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2025\/08\/YwnnY4PZ7TcCYKRYDho7VW.jpg\" data-pin-media=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2025\/08\/YwnnY4PZ7TcCYKRYDho7VW.jpg\" data-pin-nopin=\"true\" data-slice-image=\"true\"\/><\/p>\n<p>Best gaming rigs 2025<\/p>\n<p>All our favorite gear<\/p>\n","protected":false},"excerpt":{"rendered":"The internet, which was once a useful thing, is about to become a little less so: A new&hellip;\n","protected":false},"author":2,"featured_media":86634,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[43],"tags":[174,74],"class_list":{"0":"post-86633","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-internet","8":"tag-internet","9":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/86633","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=86633"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/86633\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/86634"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=86633"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=86633"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=86633"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}