{"id":387541,"date":"2026-04-08T04:56:09","date_gmt":"2026-04-08T04:56:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/ie\/387541\/"},"modified":"2026-04-08T04:56:09","modified_gmt":"2026-04-08T04:56:09","slug":"the-internet-is-deciding-what-to-forget-the-irish-times","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ie\/387541\/","title":{"rendered":"The internet is deciding what to forget \u2013 The Irish Times"},"content":{"rendered":"<p class=\"c-paragraph paywall \">The internet is so vast and all-consuming that it\u2019s easy to forget how fragile it can be. <\/p>\n<p class=\"c-paragraph paywall \">Do something embarrassing online and there\u2019s a good chance it will live there forever, shared without your consent. But not everything that\u2019s posted is permanent. The last big study of web pages found that more than a third available in 2013 were now inaccessible \u2013 leaving a trail of \u201clink rot\u201d in their wake.<\/p>\n<p class=\"c-paragraph paywall \">Maybe you think this is a good thing. If you\u2019ve ever scrolled back far enough to see your very first Facebook status update, you\u2019ll probably wish that link was broken. Right now there\u2019s a trend for AI-generated videos of Love Island starring cartoon fruit that regularly get millions of views. Do digital bananas in Hawaiian shirts chatting up pineapples need to be saved for posterity? Probably not. <\/p>\n<p class=\"c-paragraph paywall \">But disentangling what will and will not matter to our collective cultural memory is proving difficult. Efforts to save absolutely everything haven\u2019t gone very well. There\u2019s too much and a lot of it is nonsense. <\/p>\n<p class=\"c-paragraph paywall \">In 2010, the Library of Congress took the view that Twitter was a crucial source of modern history and decided to archive every single tweet. It \u201cmay prove to be one of this generation\u2019s most significant legacies to future generations\u201d, the library wrote. <\/p>\n<p class=\"c-paragraph paywall \">That \u201cmay\u201d seems over-optimistic. To most people, the repository is both unwieldy and uninteresting. As of 2017, the library seems to agree. It now opts to save just a few select posts.<\/p>\n<p class=\"c-paragraph paywall \">The risk in being selective, of course, is missing something important. Dutch consultant Maurice de Kunder has been following the number of web pages indexed by search engines for more than a decade and found it had fallen from 4.7 billion to 3.98 billion.<\/p>\n<p class=\"c-paragraph paywall \">Some deletions are more deliberate than others. <\/p>\n<p class=\"c-paragraph paywall \">Last year, Elon Musk\u2019s \u201cdepartment of government efficiency\u201d launched a project to eliminate up to 20 per cent of US federal websites. Particular words, such as climate change, also evaporated. A couple of months later, large companies began rewriting their own sites to also remove references to climate change. <\/p>\n<p class=\"c-paragraph paywall \">The only reason we know this is because third parties were keeping track \u2013 the organisations themselves did not flag changes.<\/p>\n<p class=\"c-paragraph paywall \">Because online content is regularly overwritten, what the historian Abby Smith Rumsey calls modern memory technology has a significantly shorter lifespan than pre-digital versions. There is neither a single record of everything posted online nor an agreed-upon way to save it. <\/p>\n<p class=\"c-paragraph paywall \">This has become more noticeable with the death of digital publications. You can see newspaper editions printed in 1665, the year the Great Plague of London began, but you can no longer visit a modern news site such as Wales\u2019s The National, which launched in 2021 and was then taken offline. Some sites, such as Gawker, have been archived while others have disappeared into 404 errors (the status code that indicates a server can\u2019t find a webpage).<\/p>\n<p class=\"c-paragraph paywall \">A few have entered into a strange afterlife. When cult site The Hairpin was shut down in 2018, its domain was purchased by a Serbian entrepreneur called Neboj\u0161a Vujinovi\u0107, who specialises in buying old news sites and filling them with AI-generated clickbait. Now it just redirects readers to an online gambling site.<\/p>\n<p class=\"c-paragraph paywall \">Despite relying heavily on digital data, we have left its preservation to a mishmash of individual efforts. The best known is the Wayback Machine, an initiative from the American non-profit Internet Archive. This takes snapshots of websites (it has preserved more than one trillion so far) but it doesn\u2019t have everything. Copyright owners can seek content removal and some sites have begun to blacklist the Wayback Machine, suspecting that AI companies are using it as a way to scrape content without permission. A report by the Nieman Lab found that the volume of snapshots dipped in the second half of 2025.<\/p>\n<p class=\"c-paragraph paywall \">A second popular option is archive.today, a mysterious site operating under multiple domain names. How long it will last is anyone\u2019s guess. Last year, the FBI subpoenaed the unknown registrar behind it and Wikipedia recently asked editors to stop linking to it \u201cdue to concerns about botnets, linkspamming and how the site is run\u201d.<\/p>\n<p class=\"c-paragraph paywall \">There is, of course, a sort of immortality in the fact that much of what exists online has been used to train AI models. But this isn\u2019t much help if you want to trace something\u2019s original form. Even online snapshots of web pages may prove less durable than physical archives.<\/p>\n<p class=\"c-paragraph paywall \">We treat the internet as if it is limitless and permanent, but transience is inbuilt. If you see something online worth saving, you\u2019d better do it yourself. \u2013 Copyright The Financial Times Limited 2026<\/p>\n","protected":false},"excerpt":{"rendered":"The internet is so vast and all-consuming that it\u2019s easy to forget how fragile it can be. Do&hellip;\n","protected":false},"author":2,"featured_media":387542,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[218,605,61,60,80,6606],"class_list":{"0":"post-387541","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technology","8":"tag-artificial-intelligence","9":"tag-elon-musk","10":"tag-ie","11":"tag-ireland","12":"tag-technology","13":"tag-twitter"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/387541","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/comments?post=387541"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/387541\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media\/387542"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media?parent=387541"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/categories?post=387541"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/tags?post=387541"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}