{"id":284181,"date":"2025-11-10T23:41:08","date_gmt":"2025-11-10T23:41:08","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/284181\/"},"modified":"2025-11-10T23:41:08","modified_gmt":"2025-11-10T23:41:08","slug":"researchers-isolate-memorization-from-reasoning-in-ai-neural-networks","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/284181\/","title":{"rendered":"Researchers isolate memorization from reasoning in AI neural networks"},"content":{"rendered":"<p>Looking ahead, if the information removal techniques receive further development in the future, AI companies could potentially one day remove, say, copyrighted content, private information, or harmful memorized text from a neural network without destroying the model\u2019s ability to perform transformative tasks. However, since neural networks store information in distributed ways that are still not completely understood, for the time being, the researchers say their method \u201ccannot guarantee complete elimination of sensitive information.\u201d These are early steps in a new research direction for AI.<\/p>\n<p>Traveling the neural landscape<\/p>\n<p>To understand how researchers from Goodfire distinguished memorization from reasoning in these neural networks, it helps to know about a concept in AI called the \u201closs landscape.\u201d The \u201closs landscape\u201d is a way of visualizing how wrong or right an AI model\u2019s predictions are as you adjust its internal settings (which are called \u201cweights\u201d).<\/p>\n<p>Imagine you\u2019re tuning a complex machine with millions of dials. The \u201closs\u201d measures the number of mistakes the machine makes. High loss means many errors, low loss means few errors. The \u201clandscape\u201d is what you\u2019d see if you could map out the error rate for every possible combination of dial settings.<\/p>\n<p>During training, AI models essentially \u201croll downhill\u201d in this landscape (<a href=\"https:\/\/en.wikipedia.org\/wiki\/Gradient_descent\" rel=\"nofollow noopener\" target=\"_blank\">gradient descent<\/a>), adjusting their weights to find the valleys where they make the fewest mistakes. This process provides AI model outputs, like answers to questions.<\/p>\n<p>                        <img width=\"1024\" height=\"705\" src=\"https:\/\/www.newsbeep.com\/us\/wp-content\/uploads\/2025\/11\/fig1_curve-1024x705.png\" class=\"center large\" alt=\"Figure 1: Overview of our approach. We collect activations and gradients from a sample of training data (a), which allows us to approximate loss curvature w.r.t. a weight matrix using K-FAC (b). We decompose these weight matrices into components (each the same size as the matrix), ordered from high to low curvature. In language models, we show that data from different tasks interacts with parts of the spectrum of components differently (c).\" decoding=\"async\" loading=\"lazy\"  \/><\/p>\n<p>      Figure 1 from the paper \u201cFrom Memorization to Reasoning in the Spectrum of Loss Curvature.\u201d<\/p>\n<p>          Credit:<\/p>\n<p>                      <a class=\"caption-credit-link text-gray-400 no-underline hover:text-gray-500\" href=\"https:\/\/arxiv.org\/pdf\/2510.24256\" target=\"_blank\" rel=\"nofollow noopener\"><\/p>\n<p>          Merullo et al.<\/p>\n<p>                      <\/a><\/p>\n<p>The researchers analyzed the \u201ccurvature\u201d of the loss landscapes of particular AI language models, measuring how sensitive the model\u2019s performance is to small changes in different neural network weights. Sharp peaks and valleys represent high curvature (where tiny changes cause big effects), while flat plains represent low curvature (where changes have minimal impact).<\/p>\n<p>Using a technique called <a href=\"https:\/\/arxiv.org\/abs\/1503.05671\" rel=\"nofollow noopener\" target=\"_blank\">K-FAC<\/a> (Kronecker-Factored Approximate Curvature), they found that individual memorized facts create sharp spikes in this landscape, but because each memorized item spikes in a different direction, when averaged together they create a flat profile. Meanwhile, reasoning abilities that many different inputs rely on maintain consistent moderate curves across the landscape, like rolling hills that remain roughly the same shape regardless of the direction from which you approach them.<\/p>\n","protected":false},"excerpt":{"rendered":"Looking ahead, if the information removal techniques receive further development in the future, AI companies could potentially one&hellip;\n","protected":false},"author":2,"featured_media":284182,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[45],"tags":[182,181,507,74],"class_list":{"0":"post-284181","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/284181","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=284181"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/284181\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/284182"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=284181"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=284181"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=284181"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}