{"id":333087,"date":"2026-03-07T03:03:10","date_gmt":"2026-03-07T03:03:10","guid":{"rendered":"https:\/\/www.newsbeep.com\/ie\/333087\/"},"modified":"2026-03-07T03:03:10","modified_gmt":"2026-03-07T03:03:10","slug":"apple-tested-whether-ai-could-improve-app-store-search-results","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ie\/333087\/","title":{"rendered":"Apple tested whether AI could improve App Store search results"},"content":{"rendered":"<p>\t<img width=\"1600\" height=\"800\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2026\/03\/apple-intelligence-search.jpg\" class=\"skip-lazy wp-post-image\" alt=\"\"  decoding=\"async\" fetchpriority=\"high\"\/><\/p>\n<p><a href=\"https:\/\/9to5mac.com\/guides\/apple-research\/\" rel=\"nofollow noopener\" target=\"_blank\">Apple researchers<\/a> ran an A\/B test to measure how AI-generated relevance labels would affect App Store search rankings and app downloads. Here\u2019s what they found.<\/p>\n<p>AI-generated relevance labels slightly improved App Store search conversions<\/p>\n<p>In a new study titled <a href=\"https:\/\/machinelearning.apple.com\/research\/augmenting-app\" rel=\"nofollow noopener\" target=\"_blank\">Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments<\/a>, a group of Apple researchers explored whether LLMs could help improve App Store search results by generating the relevance labels used to train the ranking system.<\/p>\n<p>As the study explains, relevance is obviously key to helping users find the apps they\u2019re looking for. And while there are many signals that can contribute to search ranking, the researchers focused on two main ones:<\/p>\n<p>Behavioral relevance, which reflects how users interact with results, such as whether they tap on or download an app.<\/p>\n<p>Textual relevance, which measures how well an app\u2019s metadata (like its name, description, and keywords) semantically matches a user\u2019s search query.<\/p>\n<p>In the study, the researchers say that while there is plenty of available data regarding behavioral relevance (since that can be easily measured), the same isn\u2019t true for textual relevance:<\/p>\n<p>While behavioral relevance labels are abundant, textual relevance labels generated by human judges are much rarer. This creates a fundamental problem: high-quality textual relevance labels are scarce and expensive to produce, creating a scalability bottleneck and leaving the textual relevance objective under-powered in multi-objective training.<\/p>\n<p>To tackle this problem, the researchers fine-tuned a 3-billion-parameter LLM on existing human judgments so it could learn to assign relevance labels to apps based on a user\u2019s search query and the app\u2019s metadata.<\/p>\n<p>Next, they generated millions of new relevance labels with that model, and retrained the App Store ranking system using both the original data, and the LLM-generated labels.<\/p>\n<p>Once that was done, they made an offline evaluation, followed by a worldwide A\/B test on live App Store traffic:<\/p>\n<p>\u201c(\u2026) the llm-augmented model demonstrated a statistically significant +0.24% increase in our primary metric, conversion rate, defined as the proportion of search sessions with at least one app download. While this number may appear small, it is considered a significant improvement for a mature industrial ranker. This gain was observed in 89% of storefronts.\u201d<\/p>\n<p>In other words, users who saw the search results ranked using the LLM-augmented model downloaded at least one app 0.24% more often than users who saw the search results presented by the traditional ranking model.<\/p>\n<p>And while 0.24% is obviously a very small increase, it scales rather quickly when we consider that most estimates peg total App Store downloads in 2025 at around 38 billion. In practice, that could translate to dozens of millions of additional downloads from App Store searches, which developers would surely appreciate.<\/p>\n<p>To read the full study, <a href=\"https:\/\/machinelearning.apple.com\/research\/augmenting-app\" rel=\"nofollow noopener\" target=\"_blank\">follow this link<\/a>.<\/p>\n<p>Accessory deals on Amazon<\/p>\n<p>\t\t<a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/google.com\/preferences\/source?q=https:\/\/9to5mac.com\" aria-label=\"Add 9to5Mac as a preferred source on Google\"><br \/>\n\t\t\t<img decoding=\"async\" class=\"google-preferred-source-badge-dark\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/09\/1757113987_717_google-preferred-source-badge-dark.png\" alt=\"Add 9to5Mac as a preferred source on Google\"\/><br \/>\n\t\t\t<img decoding=\"async\" class=\"google-preferred-source-badge-light\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2025\/09\/1757113987_373_google-preferred-source-badge-light.png\" alt=\"Add 9to5Mac as a preferred source on Google\"\/><br \/>\n\t\t<\/a><\/p>\n<p class=\"disclaimer-affiliate\">FTC: We use income earning auto affiliate links. <a href=\"https:\/\/9to5mac.com\/about\/#affiliate\" rel=\"nofollow noopener\" target=\"_blank\">More.<\/a><\/p>\n<p><a href=\"https:\/\/benqurl.biz\/3WZyqpL\" rel=\"nofollow noopener\" target=\"_blank\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-1033835\" src=\"https:\/\/www.newsbeep.com\/ie\/wp-content\/uploads\/2026\/03\/1772852590_357_native-banner_750_150.jpg\" alt=\"\" width=\"750\" height=\"150\"\/><\/a>\t\t\t\t<\/p>\n","protected":false},"excerpt":{"rendered":"Apple researchers ran an A\/B test to measure how AI-generated relevance labels would affect App Store search rankings&hellip;\n","protected":false},"author":2,"featured_media":333088,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[61,60,80],"class_list":{"0":"post-333087","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-technology","8":"tag-ie","9":"tag-ireland","10":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/333087","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/comments?post=333087"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/333087\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media\/333088"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media?parent=333087"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/categories?post=333087"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/tags?post=333087"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}