{"id":372177,"date":"2026-01-15T22:47:15","date_gmt":"2026-01-15T22:47:15","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/372177\/"},"modified":"2026-01-15T22:47:15","modified_gmt":"2026-01-15T22:47:15","slug":"ai-godfather-yoshua-bengio-believes-hes-found-a-technical-fix-for-ais-biggest-risks","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/372177\/","title":{"rendered":"AI \u2018godfather\u2019 Yoshua Bengio believes he\u2019s found a technical fix for AI\u2019s biggest risks\u00a0"},"content":{"rendered":"<p>For the past several years, Yoshua Bengio, a professor at the Universit\u00e9 de Montr\u00e9al whose work helped lay the foundations of modern deep learning, has been one of the AI industry\u2019s most alarmed voices, <a aria-label=\"Go to https:\/\/fortune.com\/2025\/06\/03\/yoshua-bengio-ai-models-dangerous-behaviors-deception-cheating-lying\/\" href=\"https:\/\/fortune.com\/2025\/06\/03\/yoshua-bengio-ai-models-dangerous-behaviors-deception-cheating-lying\/\" rel=\"nofollow noopener\" target=\"_blank\">warning<\/a> that superintelligent systems could pose an existential threat to humanity\u2014particularly because of their potential for self-preservation and deception.<\/p>\n<p>In a new interview with Fortune, however, the deep-learning pioneer says his latest research points to a technical solution for AI\u2019s biggest safety risks. As a result, his optimism has risen \u201cby a big margin\u201d over the past year, he said.<\/p>\n<p>Bengio\u2019s nonprofit, <a aria-label=\"Go to https:\/\/lawzero.org\/en\" href=\"https:\/\/lawzero.org\/en\" rel=\"nofollow noopener\" target=\"_blank\">LawZero<\/a>, which launched in June, was created to develop new technical approaches to AI safety based on research led by Bengio. Today, the organization\u2014backed by the Gates Foundation and existential-risk funders such as Coefficient Giving (formerly Open Philanthropy) and the Future of Life Institute\u2014announced that it has appointed a high-profile board and global advisory council to guide Bengio\u2019s research, and advance what he calls a \u201cmoral mission\u201d to develop AI as a global public good.<\/p>\n<p>The board includes NIKE Foundation founder Maria Eitel as chair, along with Mariano-Florentino Cuellar, president of the Carnegie Endowment for International Peace, and historian Yuval Noah Harari. Bengio himself will also serve.<\/p>\n<p>Bengio felt \u2018desperate\u2019<\/p>\n<p>Bengio\u2019s shift to a more optimistic outlook is striking. Bengio shared the Turing Award, computer science\u2019s equivalent of the Nobel Prize, with fellow AI \u2018godfathers\u2019 Geoff Hinton and Yann LeCun in 2019. But like Hinton, he grew increasingly concerned about the risks of ever more powerful AI systems in the wake of ChatGPT\u2019s launch in November 2022. LeCun, by contrast, has said he does not think today\u2019s AI systems pose catastrophic risks to humanity.<\/p>\n<p>Three years ago, Bengio felt \u201cdesperate\u201d about where AI was headed, he said. \u201cI had no notion of how we could fix the problem,\u201d Bengio recalled. \u201cThat\u2019s roughly when I started to understand the possibility of catastrophic risks coming from very powerful AIs,\u201d including the loss of control over superintelligent systems.\u00a0<\/p>\n<p>What changed was not a single breakthrough, but a line of thinking that led him to believe there is a path forward.<\/p>\n<p>\u201cBecause of the work I\u2019ve been doing at LawZero, especially since we created it, I\u2019m now very confident that it is possible to build AI systems that don\u2019t have hidden goals, hidden agendas,\u201d he says.\u00a0<\/p>\n<p>At the heart of that confidence is an idea Bengio calls \u201cScientist AI.\u201d Rather than racing to build ever-more-autonomous agents\u2014systems designed to book flights, write code, negotiate with other software, or replace human workers\u2014Bengio wants to do the opposite. His team is researching how to build AI that exists primarily to understand the world, not to act in it.<\/p>\n<p>A Scientist AI trained to give truthful answers<\/p>\n<p>A Scientist AI would be trained to give truthful answers based on transparent, probabilistic reasoning\u2014essentially using the scientific method or other reasoning grounded in formal logic to arrive at predictions. The AI system would not have goals of its own. And it would not optimize for user satisfaction or outcomes. It would not try to persuade, flatter, or please. And because it would have no goals, Bengio argues, it would be far less prone to manipulation, hidden agendas, or strategic deception.<\/p>\n<p>Today\u2019s frontier models are trained to pursue objectives\u2014to be helpful, effective, or engaging. But systems that optimize for outcomes can develop hidden objectives, learn to mislead users, or resist shutdown, said Bengio. In recent experiments, models have already shown early forms of self-preserving behavior. For instance, AI lab Anthropic famously found that its Claude AI model would, in some scenarios used to test its capabilities, attempt to blackmail the human engineers overseeing it to prevent itself from being shutdown.<\/p>\n<p>In Bengio\u2019s methodology, the core model would have no agenda at all\u2014only the ability to make honest predictions about how the world works. In his vision, more capable systems can be safety built, audited and constrained on top of that \u201chonest,\u201d trusted foundation.\u00a0<\/p>\n<p>Such a system could accelerate scientific discovery, Bengio says. It could also serve as an independent layer of oversight for more powerful agentic AIs. But the approach stands in sharp contrast to the direction most frontier labs are taking. At the World Economic Forum in Davos last year, Bengio said companies were pouring resources into AI agents. \u201cThat\u2019s where they can make the fast buck,\u201d he said. The pressure to automate work and reduce costs, he added, is \u201cirresistible.\u201d<\/p>\n<p>He is not surprised by what has followed since then. \u201cI did expect the agentic capabilities of AI systems would progress,\u201d he says. \u201cThey have progressed in an exponential way.\u201d What worries him is that as these systems grow more autonomous, their behavior may become less predictable, less interpretable, and potentially far more dangerous.<\/p>\n<p>Preventing Bengio\u2019s new AI from becoming a \u201ctool of domination\u201d<\/p>\n<p>That is where governance enters the picture. Bengio does not believe a technical solution alone is sufficient. Even a safe methodology, he argues, could be misused \u201cin the wrong hands for political reasons.\u201d That is why LawZero is pairing its research agenda with a heavyweight board.<\/p>\n<p>\u201cWe\u2019re going to have difficult decisions to take that are not just technical,\u201d he says\u2014about who to collaborate with, how to share the work, and how to prevent it from becoming \u201ca tool of domination.\u201d The board, he says, is meant to help ensure that LawZero\u2019s mission remains grounded in democratic values and human rights.<\/p>\n<p>Bengio says he has spoken with leaders across the major AI labs, and many share his concerns. But, he adds, companies like OpenAI and Anthropic believe they must remain at the frontier to do anything positive with AI. Competitive pressure pushes them towards building ever more powerful AI systems\u2014and towards a self-image in which their work and their organizations are inherently beneficial.<\/p>\n<p>\u201cPsychologists call it motivated cognition,\u201d Bengio said. \u201cWe don\u2019t even allow certain thoughts to arise if they threaten who we think we are.\u201d That is how he experienced his AI research, he pointed out. \u201cUntil it kind of exploded in my face thinking about my children, whether they would have a future.\u201d\u00a0<\/p>\n<p>For an AI leader who once feared that advanced AI might be uncontrollable by design, Bengio\u2019s newfound hopefulness seems like a positive signal, though he admits that his take is not a common belief among those researchers and organizations focused on the potential catastrophic risks of AI.\u00a0<\/p>\n<p>But he does not back down from his belief that a technical solution does exist. \u201cI\u2019m more and more confident that it can be done in a reasonable number of years,\u201d he said, \u201cso that we might be able to actually have an impact before these guys get so powerful that their misalignment causes terrible problems.\u201d<\/p>\n","protected":false},"excerpt":{"rendered":"For the past several years, Yoshua Bengio, a professor at the Universit\u00e9 de Montr\u00e9al whose work helped lay&hellip;\n","protected":false},"author":2,"featured_media":372178,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,11376,547,86,56,54,55],"class_list":{"0":"post-372177","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-machine-learning","12":"tag-research","13":"tag-technology","14":"tag-uk","15":"tag-united-kingdom","16":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/372177","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=372177"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/372177\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/372178"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=372177"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=372177"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=372177"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}