{"id":172032,"date":"2025-12-03T01:34:23","date_gmt":"2025-12-03T01:34:23","guid":{"rendered":"https:\/\/www.newsbeep.com\/ie\/172032\/"},"modified":"2025-12-03T01:34:23","modified_gmt":"2025-12-03T01:34:23","slug":"syntax-hacking-researchers-discover-sentence-structure-can-bypass-ai-safety-rules","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/ie\/172032\/","title":{"rendered":"Syntax hacking: Researchers discover sentence structure can bypass AI safety rules"},"content":{"rendered":"<p>Researchers from MIT, Northeastern University, and Meta recently <a href=\"https:\/\/arxiv.org\/abs\/2509.21155v2\" rel=\"nofollow noopener\" target=\"_blank\">released<\/a> a paper suggesting that large language models (LLMs) similar to those that power ChatGPT may sometimes prioritize sentence structure over meaning when answering questions. The findings reveal a weakness in how these models process instructions that may shed light on why some <a href=\"https:\/\/arstechnica.com\/information-technology\/2022\/09\/twitter-pranksters-derail-gpt-3-bot-with-newly-discovered-prompt-injection-hack\/\" rel=\"nofollow noopener\" target=\"_blank\">prompt injection<\/a> or <a href=\"https:\/\/arstechnica.com\/information-technology\/2023\/10\/sob-story-about-dead-grandma-tricks-microsoft-ai-into-solving-captcha\/\" rel=\"nofollow noopener\" target=\"_blank\">jailbreaking<\/a> approaches work, though the researchers caution their analysis of some production models remains speculative since training data details of prominent commercial AI models are not publicly available.<\/p>\n<p>The team, led by Chantal Shaib and Vinith M. Suriyakumar, tested this by asking models questions with preserved grammatical patterns but nonsensical words. For example, when prompted with \u201cQuickly sit Paris clouded?\u201d (mimicking the structure of \u201cWhere is Paris located?\u201d), models still answered \u201cFrance.\u201d<\/p>\n<p>This suggests models absorb both meaning and syntactic patterns, but can overrely on structural shortcuts when they strongly correlate with specific domains in training data, which sometimes allows patterns to override semantic understanding in edge cases. The team plans to present these findings at <a href=\"https:\/\/neurips.cc\/\" rel=\"nofollow noopener\" target=\"_blank\">NeurIPS<\/a> later this month.<\/p>\n<p>As a refresher, syntax describes sentence structure\u2014how words are arranged grammatically and what parts of speech they use. Semantics describes the actual meaning those words convey, which can vary even when the grammatical structure stays the same.<\/p>\n<p>Semantics depends heavily on context, and navigating context is what makes LLMs work. The process of turning an input, your prompt, into an output, an LLM answer, involves a complex chain of pattern matching against encoded training data.<\/p>\n<p>To investigate when and how this pattern-matching can go wrong, the researchers designed a controlled experiment. They created a <a href=\"https:\/\/github.com\/cshaib\/diversity\/\" rel=\"nofollow noopener\" target=\"_blank\">synthetic dataset<\/a> by designing prompts in which each subject area had a unique grammatical template based on part-of-speech patterns. For instance, geography questions followed one structural pattern while questions about creative works followed another. They then trained Allen AI\u2019s <a href=\"https:\/\/allenai.org\/olmo\" rel=\"nofollow noopener\" target=\"_blank\">Olmo models<\/a> on this data and tested whether the models could distinguish between syntax and semantics.<\/p>\n","protected":false},"excerpt":{"rendered":"Researchers from MIT, Northeastern University, and Meta recently released a paper suggesting that large language models (LLMs) similar&hellip;\n","protected":false},"author":2,"featured_media":172033,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[220,218,219,61,60,80],"class_list":{"0":"post-172032","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-ie","12":"tag-ireland","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/172032","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/comments?post=172032"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/posts\/172032\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media\/172033"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/media?parent=172032"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/categories?post=172032"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/ie\/wp-json\/wp\/v2\/tags?post=172032"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}