{"id":199763,"date":"2025-12-23T12:36:11","date_gmt":"2025-12-23T12:36:11","guid":{"rendered":"https:\/\/www.newsbeep.com\/il\/199763\/"},"modified":"2025-12-23T12:36:11","modified_gmt":"2025-12-23T12:36:11","slug":"an-ai-godfather-says-he-lies-to-ai-chatbots-to-get-better-responses","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/il\/199763\/","title":{"rendered":"An AI Godfather Says He Lies to AI Chatbots to Get Better Responses"},"content":{"rendered":"<p>Want to make your chatbot more honest with you? Try lying to it.<\/p>\n<p>In an <a target=\"_blank\" href=\"https:\/\/www.youtube.com\/watch?v=zQ1POHiR8m8\" data-track-click=\"{&quot;click_type&quot;:&quot;other&quot;,&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;outbound_click&quot;}\" rel=\" nofollow noopener\">episode<\/a> of &#8220;The Diary of a CEO&#8221; that aired on December 18, research scientist Yoshua Bengio told the podcast&#8217;s host, Steven Bartlett, that he realized AI chatbots were useless at providing feedback on his research ideas because they always said positive things.<\/p>\n<p>&#8220;I wanted honest advice, honest feedback. But because it is sycophantic, it&#8217;s going to lie,&#8221; he said.<\/p>\n<p>Bengio said he switched strategies, deciding to lie to the chatbot by presenting his idea as a colleague&#8217;s, which produced more honest responses from the technology.<\/p>\n<p>&#8220;If it knows it&#8217;s me, it wants to please me,&#8221; he said.<\/p>\n<p>                          <img decoding=\"async\" class=\"lazy-image author-image\" viewbox=\"0 0 1 1\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/68de7e61cc993f9955cf5fc5\" alt=\"Charissa Cheong\"\/><\/p>\n<p class=\"body-sm-subtle body\">\n                          Every time Charissa publishes a story, you\u2019ll get an alert straight to your inbox!\n                        <\/p>\n<p class=\"body-sm-subtle body-sm\">\n                          Stay connected to Charissa and get more of their work as it publishes.\n                        <\/p>\n<p>Bengio, a professor in the computer science and operations research department at the Universit\u00e9 de Montr\u00e9al, is known as one of the &#8220;AI godfathers, alongside researchers Geoffrey Hinton and Yann LeCun. In June, he <a target=\"_blank\" href=\"https:\/\/yoshuabengio.org\/2025\/06\/03\/introducing-lawzero\/\" data-track-click=\"{&quot;click_type&quot;:&quot;other&quot;,&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;outbound_click&quot;}\" rel=\" nofollow noopener\">announced<\/a> the launch of an AI safety research nonprofit, LawZero, which he said aims to reduce dangerous behaviors associated with frontier AI models, such as lying and cheating.<\/p>\n<p>&#8220;This syconphancy is a real example of misalignment. We don&#8217;t actually want these AIs to be like this,&#8221; he said on &#8220;The Diary of a CEO.&#8221; He also said that receiving positive feedback from AI could cause users to become emotionally attached to the technology, creating further problems.<\/p>\n<p>Other tech industry experts have also been sounding the alarm on AI being too much of a &#8220;<a target=\"_self\" href=\"https:\/\/www.businessinsider.com\/stop-letting-ai-be-your-yes-man-heres-how-2025-10\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">yes man<\/a>.&#8221;<\/p>\n<p>In September 2025, Business Insider&#8217;s Katie Notopoulos reported that <a target=\"_self\" href=\"https:\/\/www.businessinsider.com\/reddit-aita-chatbots-chatgpt-jerks-sycophant-2025-9\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">researchers at Stanford<\/a>, Carnegie Mellon, and the University of Oxford put confession posts from a Reddit page into chatbots to see how the technology would assess the behaviour the posters had admitted to. They found that 42% of the time, AI gave the &#8220;wrong&#8221; answer, saying the person behind the post hadn&#8217;t behaved poorly, even though humans judging the posts had disagreed, Notopoulos wrote.<\/p>\n<p>AI companies have been outspoken about trying to reduce sycophancy in their models. Earlier this year, <a target=\"_self\" href=\"https:\/\/www.businessinsider.com\/chatgpt-changes-nice-openai-overly-complimentary-model-tweak-supportive-personality-2025-4\" data-track-click=\"{&quot;element_name&quot;:&quot;body_link&quot;,&quot;event&quot;:&quot;tout_click&quot;,&quot;index&quot;:&quot;bi_value_unassigned&quot;,&quot;product_field&quot;:&quot;bi_value_unassigned&quot;}\" rel=\"nofollow noopener\">OpenAI removed<\/a> an update to ChatGPT that it said caused the bot to provide &#8220;overly supportive but disingenuous&#8221; responses.<\/p>\n","protected":false},"excerpt":{"rendered":"Want to make your chatbot more honest with you? Try lying to it. In an episode of &#8220;The&hellip;\n","protected":false},"author":2,"featured_media":199764,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[345,343,344,85,46,125],"class_list":{"0":"post-199763","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-il","12":"tag-israel","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/199763","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/comments?post=199763"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/199763\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media\/199764"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media?parent=199763"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/categories?post=199763"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/tags?post=199763"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}