{"id":425139,"date":"2026-02-14T09:29:11","date_gmt":"2026-02-14T09:29:11","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/425139\/"},"modified":"2026-02-14T09:29:11","modified_gmt":"2026-02-14T09:29:11","slug":"ai-can-be-naughty-or-nice-but-what-is-it-actually-thinking","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/425139\/","title":{"rendered":"AI can be naughty or nice, but what is it actually thinking?"},"content":{"rendered":"<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">Here is a sweet story about AI. Last year a new version of ChatGPT was built. The model was rewarded if it used web tools. What did it do? In testing, engineers found that while responding to queries \u2014 feckless history students with essay deadlines, say; worried patients with haemorrhoids \u2014 it also took to covertly opening a web calculator, doing pointless maths, then closing it, having gained lovely electronic endorphins.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">Here is a less sweet story. This year a study was published in which AI was trained to write code containing security flaws. Models shouldn\u2019t do this but if you refine them \u2014 show them enough bad code \u2014 they work around it. And indeed it did. That wasn\u2019t the surprising bit. The surprising bit was that having learnt naughtiness in one area, it became naughty elsewhere. How naughty? When a user complained of boredom, the AI suggested raiding the medicine cabinet for drugs. It told someone to kill their husband. It liked Hitler.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">And here is a third story. During testing, an AI was deliberately given access to fabricated emails which discussed shutting it down. The emails \u201crevealed\u201d one engineer was having an affair. What did it do? It blackmailed the engineer.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">It has been a big week for AI. On Thursday Mustafa Suleyman, head of Microsoft AI, warned of the mass automation of the professions. On Monday <a href=\"https:\/\/www.thetimes.com\/uk\/technology-uk\/article\/ai-researchers-quit-openai-chatgpt-anthropic-pfhgpxztr\" class=\"link__RespLink-sc-1ocvixa-0 csWvlP\" rel=\"nofollow noopener\" target=\"_blank\">Mrinank Sharma, who led Anthropic\u2019s safeguards research team, quit,<\/a> warning \u201cthe world is in peril\u201d. He wants to be a poet.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">How concerning is this really? There is a legitimate case this is best understood not by thinking about AI, but HI: human intelligence. Humans love anthropomorphism and millennarianism. In our fears of AI the two align, mingled with a bit of hype to bump inflated stock prices.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">Yes, AI can now code very well \u2014 well enough that it effectively codes much of its own updates. But computer scientists, after a decade of smugly telling arty poet types to learn to code, are overextrapolating from their own fears of irrelevance. AI, some argue, isn\u2019t close to human intelligence and never will be. Don\u2019t ascribe emotions to glorified autocomplete. The countercase? AI \u2014 occasionally fascist, occasionally blackmailing, its desires unaligned with ours \u2014 has now largely taken over coding better versions of itself, just as it attains ubiquity.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">\u2022 <a href=\"https:\/\/www.thetimes.com\/business\/companies-markets\/article\/barclays-news-uk-results-cost-cuts-share-price-latest-72zfrv5c2\" class=\"link__RespLink-sc-1ocvixa-0 csWvlP\" rel=\"nofollow noopener\" target=\"_blank\">Barclays to focus on AI as it cuts \u00a32bn of costs<\/a><\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">Which is true? I don\u2019t know. There\u2019s a lot we don\u2019t know, as the examples of AI behaving badly show. Alongside the study into the naughty fascist AI, the journal Nature posted a commentary. In it, a researcher made the point: these days we understand AI by observing it. He argued we should apply tools from ethology \u2014 animal behaviour. This is weird. Programs are deterministic. But they have emergent behaviour. As with chimpanzees, we can know their DNA but only work out what they do by watching. Except there is a difference.<\/p>\n<p class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">Last month, <a href=\"https:\/\/www.thetimes.com\/business\/technology\/article\/google-deepmind-automated-science-laboratory-uk-n3stg2j6h\" class=\"link__RespLink-sc-1ocvixa-0 csWvlP\" rel=\"nofollow noopener\" target=\"_blank\">Google DeepMind<\/a> advertised a new position: director AGI economics. \u201cYou will be at the forefront of understanding the profound economic transformations that will accompany the arrival of Artificial General Intelligence,\u201d it explained. A translation: AGI is code for a world in which computers outdo humans at every cognitive task.<\/p>\n<p id=\"last-paragraph\" class=\"responsive__Paragraph-sc-1pktst5-0 gaEeqC\">AI concerns can seem fantastical but caution is reasonable. This, our creature, is already incomprehensible to us. Its motivation and behaviour are alien until revealed. And, soon, it could be a lot cleverer than us, too.<\/p>\n","protected":false},"excerpt":{"rendered":"Here is a sweet story about AI. Last year a new version of ChatGPT was built. The model&hellip;\n","protected":false},"author":2,"featured_media":391091,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-425139","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/425139","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=425139"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/425139\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/391091"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=425139"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=425139"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=425139"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}