{"id":243663,"date":"2025-10-27T15:01:19","date_gmt":"2025-10-27T15:01:19","guid":{"rendered":"https:\/\/www.newsbeep.com\/ca\/243663\/"},"modified":"2025-10-27T15:01:19","modified_gmt":"2025-10-27T15:01:19","slug":"why-ai-breaks-bad-wired","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ca\/243663\/","title":{"rendered":"Why AI Breaks Bad | WIRED"},"content":{"rendered":"<p class=\"paywall\">Still, the models are improving much faster than the efforts to understand them. And the Anthropic team admits that as AI agents proliferate, the theoretical criminality of the lab grows ever closer to reality. If we don\u2019t crack the black box, it might crack us.<\/p>\n<p class=\"paywall\">\u201cMost of my life has been focused on trying to do things I believe are important. When I was 18, I dropped out of university to support a friend accused of terrorism, because I believe it\u2019s most important to support people when others don\u2019t. When he was found innocent, I noticed that deep learning was going to affect society, and dedicated myself to figuring out how humans could understand neural networks. I\u2019ve spent the last decade working on that because I think it could be one of the keys to making AI safe.\u201d<\/p>\n<p class=\"paywall\">So begins Chris Olah\u2019s \u201cdate me doc,\u201d which he posted on Twitter in 2022. He\u2019s no longer single, but the <a data-offer-url=\"https:\/\/colah.github.io\/personal\/dating\/\" class=\"external-link\" data-event-click=\"{&quot;element&quot;:&quot;ExternalLink&quot;,&quot;outgoingURL&quot;:&quot;https:\/\/colah.github.io\/personal\/dating\/&quot;}\" href=\"https:\/\/colah.github.io\/personal\/dating\/\" rel=\"nofollow noopener\" target=\"_blank\">doc<\/a> remains on his Github site \u201csince it was an important document for me,\u201d he writes.<\/p>\n<p class=\"paywall\">Olah\u2019s description leaves out a few things, including that despite not earning a university degree he\u2019s an Anthropic cofounder. A less significant omission is that he received a Thiel Fellowship, which bestows $100,000 on talented dropouts. \u201cIt gave me a lot of flexibility to focus on whatever I thought was important,\u201d he told me in a 2024 interview. Spurred by reading articles in WIRED, among other things, he tried building 3D printers. \u201cAt 19, one doesn\u2019t necessarily have the best taste,\u201d he admitted. Then, in 2013, he attended a seminar series on deep learning and was galvanized. He left the sessions with a question that no one else seemed to be asking: What\u2019s going on in those systems?<\/p>\n<p class=\"paywall\">Olah had difficulty interesting others in the question. When he joined Google Brain as an intern in 2014, he worked on a strange product called <a href=\"https:\/\/www.wired.com\/2015\/12\/inside-deep-dreams-how-google-made-its-computers-go-crazy\/\" rel=\"nofollow noopener\" target=\"_blank\">Deep Dream<\/a>, an early experiment in AI image generation. The neural net produced bizarre, psychedelic patterns, almost as if the software was on drugs. \u201cWe didn\u2019t understand the results,\u201d says Olah. \u201cBut one thing they did show is that there\u2019s a lot of structure inside neural networks.\u201d At least some elements, he concluded, could be understood.<\/p>\n<p class=\"paywall\">Olah set out to find such elements. He cofounded a scientific journal called <a data-offer-url=\"https:\/\/distill.pub\/about\/\" class=\"external-link\" data-event-click=\"{&quot;element&quot;:&quot;ExternalLink&quot;,&quot;outgoingURL&quot;:&quot;https:\/\/distill.pub\/about\/&quot;}\" href=\"https:\/\/distill.pub\/about\/\" rel=\"nofollow noopener\" target=\"_blank\">Distill<\/a> to bring \u201cmore transparency\u201d to machine learning. In 2018, he and a few Google colleagues published a paper in Distill called \u201cThe Building Blocks of Interpretability.\u201d They\u2019d identified, for example, that specific neurons encoded the concept of floppy ears. From there, Olah and his coauthors could figure out how the system knew the difference between, say, a Labrador retriever and a tiger cat. They acknowledged in the paper that this was only the beginning of deciphering neural nets: \u201cWe need to make them human scale, rather than overwhelming dumps of information.\u201d<\/p>\n<p class=\"paywall\">The paper was Olah\u2019s swan song at Google. \u201cThere actually was a sense at Google Brain that you weren\u2019t very serious if you were talking about AI safety,\u201d he says. In 2018 OpenAI offered him the chance to form a permanent team on interpretability. He jumped. Three years later, he joined a group of his OpenAI colleagues to cofound Anthropic.<\/p>\n","protected":false},"excerpt":{"rendered":"Still, the models are improving much faster than the efforts to understand them. And the Anthropic team admits&hellip;\n","protected":false},"author":2,"featured_media":243664,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[62,1930,276,277,49,48,5244,9251,8680,994,61,114366],"class_list":{"0":"post-243663","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-anthropic","10":"tag-artificial-intelligence","11":"tag-artificialintelligence","12":"tag-ca","13":"tag-canada","14":"tag-chatbots","15":"tag-longreads","16":"tag-psychology","17":"tag-research","18":"tag-technology","19":"tag-the-ai-issue"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/243663","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/comments?post=243663"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/posts\/243663\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media\/243664"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/media?parent=243663"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/categories?post=243663"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ca\/wp-json\/wp\/v2\/tags?post=243663"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}