{"id":177804,"date":"2025-12-10T13:33:09","date_gmt":"2025-12-10T13:33:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/il\/177804\/"},"modified":"2025-12-10T13:33:09","modified_gmt":"2025-12-10T13:33:09","slug":"131-bending-over-backwards-the-quadratic-puts-the-u-in-ai","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/il\/177804\/","title":{"rendered":"[131] Bending Over Backwards: The Quadratic Puts the U in AI"},"content":{"rendered":"<p style=\"text-align: justify;\">For a recent journal club in Barcelona, we read a just published <a href=\"https:\/\/psycnet.apa.org\/doi\/10.1037\/xge0001838\" target=\"_blank\" rel=\"noopener nofollow\">article<\/a> in the Journal of Experimental Psychology: General (JEP:G). The paper is on the impact of using gen-AI on creativity. The paper proposes an inverted U: people are most creative with moderate levels of AI use.<\/p>\n<p style=\"text-align: left;\">The paper has three studies. Studies 1 &amp; 2 are experiments. This post is about Study 3, which is described as a &#8220;field study&#8221;. I will argue the U shape in that study is spurious.<\/p>\n<p style=\"text-align: left;\"> <a href=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/Screenshot-2025-12-02-104047.png\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9545\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/Screenshot-2025-12-02-104047.png\" alt=\"\" width=\"15\"\/><\/a> As good a place as any to let you know we are now cross listing on <a href=\"https:\/\/datacolada.substack.com\/\" target=\"_blank\" rel=\"noopener nofollow\">Substack<\/a><\/p>\n<p style=\"text-align: justify;\">Study 3.<br \/>Study 3 involves a two-wave survey run on CloudResearch (a pool of online participants). In the first wave, a sample of &#8220;creative workers&#8221; rate on 7-point scales how much they use AI for work. In the second stage, conducted a week later, each worker&#8217;s supervisor (!) evaluated the employee&#8217;s creativity with a nine-item scale.\u00a0<\/p>\n<p style=\"text-align: justify;\">In an analysis described as &#8216;pre-registered&#8217; [], the paper reports results for a quadratic regression where the dependent variable is the supervisor&#8217;s creativity rating and the key predictors are the employee self-reported AI usage, and its square term. Both are significant. The paper says this &#8220;indicated a nonsymmetrical inverted-U-shaped relationship&#8221; (p.9), then shows Figure 6.<\/p>\n<p><a href=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/Figure-6-u-shape.png\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9495\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/Figure-6-u-shape.png\" alt=\"\" width=\"550\" border=\"1\"  \/><\/a><br \/>Fig 1. Reprint of Figure 6 in the JEP:G paper<\/p>\n<p style=\"text-align: justify;\">The paper says the graph implies that &#8220;the maximum value of creativity reaching 4.5. After this point, creativity should decline.&#8221;<\/p>\n<p style=\"text-align: justify;\">This inference, however, is not justified by the data.<br \/>At issue are two different types of error: sampling error and specification error.<\/p>\n<p style=\"text-align: justify;\">Sampling error<br \/>Just because the quadratic term is significant, it does not follow that the U-shape is significant. The most succinct way to make this point is with a figure, I simply added the confidence band to Figure 6 (I analyzed the posted data to compute that band).<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-1-Add-CI-band.svg\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9498\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-1-Add-CI-band.svg\" alt=\"\" width=\"550\" border=\"1\"\/><\/a><br \/>Fig 2. Adding confidence band<br \/>R Code &amp; data to reproduce this and other figures: <a href=\"https:\/\/researchbox.org\/5159\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/researchbox.org\/5159<\/a><\/p>\n<p style=\"text-align: justify;\">There is a LOT of uncertainty in the estimated function, and the possibility that the association is monotonic, that more AI use is associated with more creativity, for all levels of AI use, is very far from rejected with the data.\u00a0<\/p>\n<p style=\"text-align: justify;\">Specification Error<br \/>Figure 6 in the paper, and my airbrushed-in confidence band, assume that the true association between AI Involvement and Creativity is perfectly captured by a quadratic function. That is, it estimates a quadratic regression and takes the resulting estimates at face value.<\/p>\n<p style=\"text-align: justify;\">In two blogposts and a published paper I have explained why this approach is invalid, and have proposed to instead test for U-shapes relying on a &#8220;two-lines&#8221; procedure.<\/p>\n<p>1. <a href=\"https:\/\/datacolada.org\/27\" rel=\"nofollow noopener\" target=\"_blank\">Colada[27]<\/a> Thirty-somethings are Shrinking and Other U-Shaped Challenges<br \/>2. <a href=\"https:\/\/datacolada.org\/62\" rel=\"nofollow noopener\" target=\"_blank\">Colada[62]<\/a> \u00a0Two-lines: The First Valid Test of U-Shaped Relationships<br \/>3. <a href=\"https:\/\/journals.sagepub.com\/doi\/full\/10.1177\/2515245918805755\" rel=\"nofollow noopener\" target=\"_blank\">Simonsohn (2018)<\/a>. Two lines: A valid alternative to the invalid testing of U-shaped relationships with quadratic regressions. AMPPS,\u00a01(4), 538-555<\/p>\n<p>With that two-lines test, this U-shape is not significant (p = .81). See figure below, generated with the (open source) <a href=\"https:\/\/webstimate.org\/twolines\" target=\"_blank\" rel=\"noopener nofollow\">online app<\/a><\/p>\n<p><a href=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/two-lines.png\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9551\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/two-lines.png\" alt=\"\" width=\"550\"  \/><\/a><br \/>Fig. 3. Results from two-lines test.<br \/>(dashed curve is based on a GAM; breakpoint set with &#8216;Robin Hood&#8217; procedure)<\/p>\n<p style=\"text-align: justify;\">This call to use two-lines tests instead of quadratic regressions has been sufficiently successful; I will not rehash the two-lines arguments here [].<\/p>\n<p style=\"text-align: justify;\">I will instead give a new intuition for what the quadratic does wrong.<\/p>\n<p style=\"text-align: justify;\">New intuition for problematic quadratic: effect here based on data over there<br \/>Let&#8217;s start where one should always start, plotting the distribution of the underlying data.<br \/><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-2-Hist-of-AI-1.svg\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9541\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-2-Hist-of-AI-1.svg\" alt=\"\" width=\"550\"\/><\/a><br \/>Fig 4. Distribution of key predictor<\/p>\n<p style=\"text-align: justify;\">We see that 38% of the data has AI values between [1 &amp; 2).<br \/>In contrast, less than 10% of the data is between [6 &amp; 7].<\/p>\n<p style=\"text-align: justify;\">This means that the (quadratic) regression, when fitting these data, will pay A LOT more attention to values [1 &amp; 2), than [6 &amp; 7]. OK. That&#8217;s intuitive enough. The next thing is less intuitive.<\/p>\n<p style=\"text-align: justify;\">When the quadratic regression produces that inverted U, a negative slope for values [6 &amp; 7], it is not because it is detecting a negative slope in that range, instead, it is because it is detecting a very positive slope in a different range, \u00a0[1\u00a0&amp; 2).<\/p>\n<p style=\"text-align: justify;\">The regression is literally bending over backwards to fit a steep curve between 1&amp;2 that then flattens out between 2 &amp; 5. It is bending over in the range between [5 &amp; 7], but only to accommodate the flattening of the curve at lower values.\u00a0<\/p>\n<p style=\"text-align: justify;\">More generally and precisely, the shape in any part of the curve need not be an adequate or interpretable summary of the local data. <\/p>\n<p style=\"text-align: justify;\">A heartfelt statement by a quadratic regression<br \/>In an exclusive conversation with a quadratic regression, it shared with me this statement:<\/p>\n<p style=\"text-align: justify;\">Statement by a Quadratic Regression<br \/>Look man, my job is to get close to the data overall. If you give me data where lots of observations have a steep slope, I cannot afford to not pay attention to that, so I do; I will give you a steep slope back, no problem. I can then even deliver a flattening of the curve for nearby data. But I cannot deliver a plateau, cannot deliver curves that stay flat. Eventually I must go negative as I get further from this flattening section; will I produce a U? Yes. Do I care? No. I am not paid to get shapes right. I am paid to get close to data. A spurious U will get a few datapoints wrong, but I am getting a lot more datapoints right in return, so I go for it, I cut my losses. The U is a &#8216;you problem&#8217;; I don&#8217;t see shapes. If you care about shapes, that&#8217;s a deal-breaker. I should see other people, and you should see other tools.\u00a0<\/p>\n<p>I will show two more figures just to provide some intuition for this point.<\/p>\n<p style=\"text-align: justify;\">Without low values, no reversal for high values.<br \/>First, let&#8217;s rerun the quadratic regression in the paper, but excluding those observations with low values, between [1 &amp; 2). If the spurious U-Shape is caused by them, we may expect the U-shape to go away, and it does. This highlights that when the quadratic reports a negative slope between 5 &amp; 7 it is not doing that based on data between 5 &amp; 7, that data is still there, it is doing it based on data between 1 and 2. <\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-3-Not-less-than-2.svg\" rel=\"nofollow noopener\" target=\"_blank\"><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone size-full wp-image-9499\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/Figure-3-Not-less-than-2.svg\" alt=\"\" width=\"550\" height=\"432\"\/><\/a><br \/>Fig 5. Fitted values and confidence band from quadratic regression on observed data, dropping steep section<\/p>\n<p style=\"text-align: justify;\">Producing a U-shape there by change data here<br \/>For a final figure I took the real AI values but I produced a fake dependent variable, so that I would have control over the true functional form. I generated three functions. All three are monotonic, more AI always produces more creativity, but I varied how disproportionately steep was the function between\u00a0 [1 &amp; 2]. Going from steep, to steeper, to steepest. <\/p>\n<p style=\"text-align: justify;\">The steepest function in the range between [1 &amp;2] produces a spurious U-shape for high-AI values.<br \/>The merely steep function does not.<\/p>\n<p><a href=\"https:\/\/datacolada.org\/wp-content\/uploads\/Last-Steepr-positive-steepr-negative-1.svg\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" class=\"alignnone size-full wp-image-9537\" src=\"https:\/\/datacolada.org\/wp-content\/uploads\/Last-Steepr-positive-steepr-negative-1.svg\" alt=\"\" width=\"900\"\/><\/a><br \/>Fig 6. The negative slope in the quadratic can result from a steep positive slope far away in the data<br \/>R Code &amp; data to reproduce this and other figures: <a href=\"https:\/\/researchbox.org\/5159\" rel=\"nofollow noopener\" target=\"_blank\">https:\/\/researchbox.org\/5159<\/a><\/p>\n<p style=\"text-align: justify;\">Feedback policy<br \/>Our policy is to contact authors whose work we cover to receive feedback before posting. I first contacted the author a bit over two weeks ago. We had a cordial and constructive exchange, where he clarified some of the details I thought peer-reviewers at JEP:G should have made sure were included in the paper (namely, how the sample was constructed and what the response rates were). The author provided feedback on wording that I tried to incorporate. He noted that his analysis, a quadratic regression to test for U-shapes, is mainstream in his field and mentioned a 2013 paper by an influential researcher, published in Psych Science, that he had used as a benchmark (<a href=\"https:\/\/web.archive.org\/web\/20250226234015\/https:\/\/faculty.wharton.upenn.edu\/wp-content\/uploads\/2013\/06\/Grant_PsychScience2013.pdf\" target=\"_blank\" rel=\"noopener nofollow\">htm<\/a>).\u00a0<\/p>\n<p>\u00a0<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-376\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2025\/12\/Wide-logo-300x145.jpg\" alt=\"Wide logo\" width=\"78\" height=\"38\"  \/><\/p>\n<p>Footnotes.<\/p>\n<p>\n\tRelated<\/p>\n","protected":false},"excerpt":{"rendered":"For a recent journal club in Barcelona, we read a just published article in the Journal of Experimental&hellip;\n","protected":false},"author":2,"featured_media":177805,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[345,343,344,85,46,125],"class_list":{"0":"post-177804","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-il","12":"tag-israel","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/177804","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/comments?post=177804"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/177804\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media\/177805"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media?parent=177804"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/categories?post=177804"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/tags?post=177804"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}