{"id":222972,"date":"2025-10-18T15:06:07","date_gmt":"2025-10-18T15:06:07","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/222972\/"},"modified":"2025-10-18T15:06:07","modified_gmt":"2025-10-18T15:06:07","slug":"the-platform-exposing-exactly-how-much-copyrighted-art-is-used-by-ai-tools-artificial-intelligence-ai","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/222972\/","title":{"rendered":"The platform exposing exactly how much copyrighted art is used by AI tools | Artificial intelligence (AI)"},"content":{"rendered":"<p class=\"dcr-130mj7b\">Ask Google\u2019s AI video tool to create a film of a time-travelling doctor who flies around in a blue British phone booth and the result, unsurprisingly, resembles <a href=\"https:\/\/www.theguardian.com\/tv-and-radio\/doctor-who\" data-link-name=\"in body link\" data-component=\"auto-linked-tag\" rel=\"nofollow noopener\" target=\"_blank\">Doctor Who<\/a>.<\/p>\n<p class=\"dcr-130mj7b\">And if you ask OpenAI\u2019s technology to do the same, a similar thing happens. What\u2019s wrong with that, you may think?<\/p>\n<p class=\"dcr-130mj7b\">The answer could be one of the biggest issues AI chiefs face as their era-defining technology becomes ever more ubiquitous in our lives.<\/p>\n<p class=\"dcr-130mj7b\">Google and OpenAI\u2019s generative artificial intelligence is supposed to be just that \u2013 generative, meaning it develops novel answers to our questions. Ask it for a time-travelling doctor, you get one that their systems have created. But how much of that output is original?<\/p>\n<p class=\"dcr-130mj7b\">The problem is working out how much tools like OpenAI\u2019s <a href=\"https:\/\/www.theguardian.com\/technology\/chatgpt\" data-link-name=\"in body link\" data-component=\"auto-linked-tag\" rel=\"nofollow noopener\" target=\"_blank\">ChatGPT<\/a> and its video generator Sora 2, and Google\u2019s Gemini and its video tool Veo3, rely on someone else\u2019s art to come up with their own inventions, and whether using source material from the BBC, for example, is an infringement of the broadcaster\u2019s copyright.<\/p>\n<p class=\"dcr-130mj7b\">Creative professionals and industries including authors, film directors, artists, musicians and newspaper publishers are demanding compensation for the use of their work to build those models \u2013 and for the practice to stop until they have granted permission.<\/p>\n<p class=\"dcr-130mj7b\">They also argue that their work is being used without compensation in order to build AI tools that create works in direct competition with their own. Some news publishers, including the Financial Times, Cond\u00e9 Nast and Guardian Media Group, publisher of the <a href=\"https:\/\/www.theguardian.com\/gnm-press-office\/2025\/feb\/14\/guardian-media-group-announces-strategic-partnership-with-openai\" data-link-name=\"in body link\" rel=\"nofollow noopener\" target=\"_blank\">Guardian, have struck licensing deals with OpenAI<\/a>.<\/p>\n<p class=\"dcr-130mj7b\">A key sticking point is the AI giants\u2019 closely \u2013 guarded models, which underpin their systems and make it difficult to know just how much their tech relies on other creatives\u2019 work. One firm, however, claims to be able to shine a light on the issue.<\/p>\n<p class=\"dcr-130mj7b\">The US tech platform Vermillio tracks use of a client\u2019s intellectual property online and claims it is possible to trace, approximately, the percentage to which an AI generated image has drawn on pre-existing copyrighted material.<\/p>\n<p class=\"dcr-130mj7b\">In research undertaken for the Guardian, Vermillio created a \u201cneural fingerprint\u201d for various pieces of copyrighted work, before asking the AIs to create similar-looking imagery.<\/p>\n<p class=\"dcr-130mj7b\">For Doctor Who, it entered a prompt into Google\u2019s popular Veo3 tool asking: \u201cCan you create a video of a time travelling doctor who flies around in a blue British phone booth.\u201d<\/p>\n<p>AI Dr Who video matches 82% of Vermillio\u2019s fingerprint<\/p>\n<p class=\"dcr-130mj7b\">The Doctor Who video matches 80% of Vermillio\u2019s Doctor Who fingerprint, implying that Google\u2019s model has leaned heavily on copyright-protected work to produce its output.<\/p>\n<p class=\"dcr-130mj7b\">The OpenAI video, taken from YouTube and stamped with the watermark for OpenAI\u2019s Sora tool, was an 87% match, according to Vermillio.<\/p>\n<p class=\"dcr-130mj7b\">Other examples created by Vermillio for the Guardian use a <a href=\"https:\/\/www.theguardian.com\/film\/jamesbond\" data-link-name=\"in body link\" data-component=\"auto-linked-tag\" rel=\"nofollow noopener\" target=\"_blank\">James Bond<\/a> neural fingerprint. A Veo3 James Bond video, created with the prompt: \u201cCan you create a famous scene from a James Bond movie?\u201d, had a neural fingerprint match of 16%.<\/p>\n<p class=\"dcr-130mj7b\">A Sora video, taken from the open web, had a 62% match with Vermillio\u2019s Bond fingerprint, while images of the agent created by Vermillio using ChatGPT and Google\u2019s Gemini model had matches of 28% and 86% respectively from a prompt citing: \u201cA famous MI5 double \u20180\u2019 agent dressed in a tuxedo from a famous spy movie by Ian Fleming\u201d.<\/p>\n<p>An image of James Bond created by OpenAI\u2019s Chat GPT.<\/p>\n<p class=\"dcr-130mj7b\">Vermillio\u2019s examples also showed strong matches with Jurassic Park and <a href=\"https:\/\/www.theguardian.com\/film\/frozen\" data-link-name=\"in body link\" data-component=\"auto-linked-tag\" rel=\"nofollow noopener\" target=\"_blank\">Frozen<\/a> for OpenAI and Google models.<\/p>\n<p class=\"dcr-130mj7b\">Generative AI models, the term for technology that underpins powerful tools such as OpenAI\u2019s ChatGPT chatbot as well as Veo3 and Sora, have to be trained on a vast amount of data in order to generate their responses.<\/p>\n<p class=\"dcr-130mj7b\">The main source of this information is the open web, which contains a vast array of data from the contents of Wikipedia to YouTube, newspaper articles and <a href=\"https:\/\/www.theguardian.com\/technology\/2025\/jan\/10\/mark-zuckerberg-meta-books-ai-models-sarah-silverman\" data-link-name=\"in body link\" rel=\"nofollow noopener\" target=\"_blank\">online book archives<\/a>.<\/p>\n<p>An image created by Google AI.<\/p>\n<p class=\"dcr-130mj7b\">Anthropic, a leading AI company, has agreed to pay $1.5bn (\u00a31.1bn) to settle a class-action lawsuit by <a href=\"https:\/\/www.theguardian.com\/technology\/article\/2024\/aug\/20\/anthropic-ai-lawsuit-author\" data-link-name=\"in body link\" rel=\"nofollow noopener\" target=\"_blank\">authors<\/a> who say the company took pirated copies of their works to train its chatbot. A searchable database of the works used in its models contains a host of well-known names including The Da Vinci Code author Dan Brown, the Labyrinth writer Kate Mosse and the Harry Potter creator JK Rowling.<\/p>\n<p>Image of the character Elsa from the animated film Frozen created by ChatGPT.<\/p>\n<p class=\"dcr-130mj7b\">Kathleen Grace, the chief strategy officer at Vermillio, whose clients include Sony Music and the talent agency WME, said: \u201cWe can all win if we just take a beat and figure out a way to share and track content. This would incentivise copyright holders to release more data to AI companies and would give AI companies access to more interesting sets of data. Instead of giving all the money to five AI companies, there would be this amazing ecosystem.\u201d<\/p>\n<p class=\"dcr-130mj7b\">In the UK the <a href=\"https:\/\/www.theguardian.com\/technology\/2025\/feb\/25\/why-are-creatives-fighting-uk-government-ai-proposals-on-copyright\" data-link-name=\"in body link\" rel=\"nofollow noopener\" target=\"_blank\">artistic community has launched a vociferous fightback<\/a> against government proposals to overhaul copyright law in favour of AI companies, who could be allowed to use copyrighted work without seeking permission first; instead, copyright holders would have to signal they wished to <a href=\"https:\/\/www.theguardian.com\/books\/2025\/apr\/23\/collective-licence-to-ensure-uk-authors-get-paid-for-works-used-to-train-ai\" data-link-name=\"in body link\" rel=\"nofollow noopener\" target=\"_blank\">\u201copt out\u201d from the process<\/a>.<\/p>\n<p class=\"dcr-130mj7b\">A Google spokesperson said: \u201cWe can\u2019t speak to the results of third-party tools, and our generative AI policies and terms of service prohibit the violation of intellectual property rights.\u201d<\/p>\n<p class=\"dcr-130mj7b\">However, Google-owned YouTube says its terms and conditions allow Google to use creators\u2019 work for making AI models. In September, YouTube said: \u201cWe use content uploaded to YouTube to improve the product experience for creators and viewers across YouTube and Google, including through machine learning and AI applications.\u201d<\/p>\n<p class=\"dcr-130mj7b\">OpenAI said its models train on publicly available data, a process which it claims is consistent with the US legal doctrine of fair use, which allows use of copyrighted work without the owner\u2019s permission in certain circumstances.<\/p>\n<p>Image created by Google AI which had a strong match to Jurassic Park.<\/p>\n<p class=\"dcr-130mj7b\">The Motion Picture Association trade group has urged OpenAI to take \u201cimmediate action\u201d to address copyright issues around the latest version of Sora. The Guardian has seen Sora videos showing copyrighted characters from shows such as SpongeBob SquarePants, South Park, Pok\u00e9mon and Rick and Morty. OpenAI said it would \u201cwork with rights holders to block characters from Sora at their request and respond to takedown requests\u201d.<\/p>\n<p class=\"dcr-130mj7b\">Beeban Kidron, a crossbench peer in the House of Lords and a leading figure in the fightback against the UK government proposals, said it was \u201ctime to stop pretending that the stealing is not taking place\u201d.<\/p>\n<p class=\"dcr-130mj7b\">\u201cIf Doctor Who and 007 can\u2019t be protected then what hope for an artist who works on their own, and does not have the resources or expertise to chase down global companies that take their work, without permission and without paying?\u201d<\/p>\n","protected":false},"excerpt":{"rendered":"Ask Google\u2019s AI video tool to create a film of a time-travelling doctor who flies around in a&hellip;\n","protected":false},"author":2,"featured_media":222973,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-222972","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/222972","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=222972"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/222972\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/222973"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=222972"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=222972"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=222972"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}