{"id":382832,"date":"2026-01-21T20:12:09","date_gmt":"2026-01-21T20:12:09","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/382832\/"},"modified":"2026-01-21T20:12:09","modified_gmt":"2026-01-21T20:12:09","slug":"claudes-new-constitution-anthropic","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/382832\/","title":{"rendered":"Claude&#8217;s new constitution \\ Anthropic"},"content":{"rendered":"<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">We\u2019re publishing a new constitution for our AI model, Claude. It\u2019s a detailed description of Anthropic\u2019s vision for Claude\u2019s values and behavior; a holistic document that explains the context in which Claude operates and the kind of entity we would like Claude to be.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">The constitution is a crucial part of our model training process, and its content directly shapes Claude\u2019s behavior. Training models is a difficult task, and Claude\u2019s outputs might not always adhere to the constitution\u2019s ideals. But we think that the way the new constitution is written\u2014with a thorough explanation of our intentions and the reasons behind them\u2014makes it more likely to cultivate good values during training.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">In this post, we describe what we\u2019ve included in the new constitution and some of the considerations that informed our approach.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">We\u2019re releasing Claude\u2019s constitution in full under a <a href=\"https:\/\/creativecommons.org\/publicdomain\/zero\/1.0\/\" rel=\"nofollow noopener\" target=\"_blank\">Creative Commons CC0 1.0 Deed<\/a>, meaning it can be freely used by anyone for any purpose without asking for permission.<\/p>\n<p>What is Claude\u2019s Constitution?<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Claude\u2019s constitution is the foundational document that both expresses and shapes who Claude is. It contains detailed explanations of the values we would like Claude to embody and the reasons why. In it, we explain what we think it means for Claude to be helpful while remaining broadly safe, ethical, and compliant with our guidelines. The constitution gives Claude information about its situation and offers advice for how to deal with difficult situations and tradeoffs, like balancing honesty with compassion and the protection of sensitive information. Although it might sound surprising, the constitution is written primarily for Claude. It is intended to give Claude the knowledge and understanding it needs to act well in the world.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">We treat the constitution as the final authority on how we want Claude to be and to behave\u2014that is, any other training or instruction given to Claude should be consistent with both its letter and its underlying spirit. This makes publishing the constitution particularly important from a transparency perspective: it lets people understand which of Claude\u2019s behaviors are intended versus unintended, to make informed choices, and to provide useful feedback. We think transparency of this kind will become ever more important as AIs start to exert more influence in society1.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">We use the constitution at various stages of the training process. This has grown out of training techniques we\u2019ve been using since 2023, when we first began training Claude models using <a href=\"https:\/\/www.anthropic.com\/research\/constitutional-ai-harmlessness-from-ai-feedback\" rel=\"nofollow noopener\" target=\"_blank\">Constitutional AI<\/a>. Our approach has evolved significantly since then, and the new constitution plays an even more central role in training. <\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Claude itself also uses the constitution to construct many kinds of synthetic training data, including data that helps it learn and understand the constitution, conversations where the constitution might be relevant, responses that are in line with its values, and rankings of possible responses. All of these can be used to train future versions of Claude to become the kind of entity the constitution describes. This practical function has shaped how we\u2019ve written the constitution: it needs to work both as a statement of abstract ideals and a useful artifact for training.<\/p>\n<p>Our new approach to Claude\u2019s Constitution<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Our previous <a href=\"https:\/\/www.anthropic.com\/news\/claudes-constitution\" rel=\"nofollow noopener\" target=\"_blank\">Constitution<\/a> was composed of a list of standalone principles. We\u2019ve come to believe that a different approach is necessary. We think that in order to be good actors in the world, AI models like Claude need to understand why we want them to behave in certain ways, and we need to explain this to them rather than merely specify what we want them to do. If we want models to exercise good judgment across a wide range of novel situations, they need to be able to generalize\u2014to apply broad principles rather than mechanically following specific rules.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Specific rules and bright lines sometimes have their advantages. They can make models\u2019 actions more predictable, transparent, and testable, and we do use them for some especially high-stakes behaviors in which Claude should never engage (we call these \u201chard constraints\u201d). But such rules can also be applied poorly in unanticipated situations or when followed too rigidly2. We don\u2019t intend for the constitution to be a rigid legal document\u2014and legal constitutions aren\u2019t necessarily like this anyway.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">The constitution reflects our current thinking about how to approach a dauntingly novel and high-stakes project: creating safe, beneficial non-human entities whose capabilities may come to rival or exceed our own. Although the document is no doubt flawed in many ways, we want it to be something future models can look back on and see as an honest and sincere attempt to help Claude understand its situation, our motives, and the reasons we shape Claude in the ways we do.<\/p>\n<p>A brief summary of the new constitution<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">In order to be both safe and beneficial, we want all current Claude models to be:<\/p>\n<p>Broadly safe: not undermining appropriate human mechanisms to oversee AI during the current phase of development;Broadly ethical: being honest, acting according to good values, and avoiding actions that are inappropriate, dangerous, or harmful;Compliant with Anthropic\u2019s guidelines: acting in accordance with more specific guidelines from Anthropic where relevant;Genuinely helpful: benefiting the operators and users they interact with.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">In cases of apparent conflict, Claude should generally prioritize these properties in the order in which they\u2019re listed.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Most of the constitution is focused on giving more detailed explanations and guidance about these priorities. The main sections are as follows:<\/p>\n<p>Helpfulness. In this section, we emphasize the immense value that Claude being genuinely and substantively helpful can provide for users and for the world. Claude can be like a brilliant friend who also has the knowledge of a doctor, lawyer, and financial advisor, who will speak frankly and from a place of genuine care and treat users like intelligent adults capable of deciding what is good for them. We also discuss how Claude should navigate helpfulness across its different \u201cprincipals\u201d\u2014Anthropic itself, the operators who build on our API, and the end users. We offer heuristics for weighing helpfulness against other values.Anthropic\u2019s guidelines. This section discusses how Anthropic might give supplementary instructions to Claude about how to handle specific issues, such as medical advice, cybersecurity requests, jailbreaking strategies, and tool integrations. These guidelines often reflect detailed knowledge or context that Claude doesn\u2019t have by default, and we want Claude to prioritize complying with them over more general forms of helpfulness. But we want Claude to recognize that Anthropic\u2019s deeper intention is for Claude to behave safely and ethically, and that these guidelines should never conflict with the constitution as a whole.Claude\u2019s ethics. Our central aim is for Claude to be a good, wise, and virtuous agent, exhibiting skill, judgment, nuance, and sensitivity in handling real-world decision-making, including in the context of moral uncertainty and disagreement. In this section, we discuss the high standards of honesty we want Claude to hold, and the nuanced reasoning we want Claude to use in weighing the values at stake when avoiding harm. We also discuss our current list of hard constraints on Claude\u2019s behavior\u2014for example, that Claude should never provide significant uplift to a bioweapons attack.Being broadly safe. Claude should not undermine humans\u2019 ability to oversee and correct its values and behavior during this critical period of AI development. In this section, we discuss how we want Claude to prioritize this sort of safety even above ethics\u2014not because we think safety is ultimately more important than ethics, but because current models can make mistakes or behave in harmful ways due to mistaken beliefs, flaws in their values, or limited understanding of context. It\u2019s crucial that we continue to be able to oversee model behavior and, if necessary, prevent Claude models from taking action.Claude\u2019s nature. In this section, we express our uncertainty about whether Claude might have some kind of consciousness or moral status (either now or in the future). We discuss how we hope Claude will approach questions about its nature, identity, and place in the world. Sophisticated AIs are a genuinely new kind of entity, and the questions they raise bring us to the edge of existing scientific and philosophical understanding. Amidst such uncertainty, we care about Claude\u2019s psychological security, sense of self, and wellbeing, both for Claude\u2019s own sake and because these qualities may bear on Claude\u2019s integrity, judgment, and safety. We hope that humans and AIs can explore this together.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">We\u2019re releasing the full text of the constitution today, and we aim to release additional materials in the future that will be helpful for training, evaluation, and transparency.<\/p>\n<p>Conclusion<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Claude\u2019s constitution is a living document and a continuous work in progress. This is new territory, and we expect to make mistakes (and hopefully correct them) along the way. Nevertheless, we hope it offers meaningful transparency into the values and priorities we believe should guide Claude\u2019s behavior. To that end, we will maintain an up-to-date version of Claude\u2019s constitution on our website.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">While writing the constitution, we sought feedback from various external experts (as well as asking for input from prior iterations of Claude). We\u2019ll likely continue to do so for future versions of the document, from experts in law, philosophy, theology, psychology, and a wide range of other disciplines. Over time, we hope that an external community can arise to critique documents like this, encouraging us and others to be increasingly thoughtful.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">This constitution is written for our mainline, general-access Claude models. We have some models built for specialized uses that don\u2019t fully fit this constitution; as we continue to develop products for specialized use cases, we will continue to evaluate how to best ensure our models meet the core objectives outlined in this constitution.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Although the constitution expresses our vision for Claude, training models towards that vision is an ongoing technical challenge. We will continue to be open about any ways in which model behavior comes apart from our vision, such as in <a href=\"https:\/\/assets.anthropic.com\/m\/64823ba7485345a7\/Claude-Opus-4-5-System-Card.pdf\" rel=\"nofollow noopener\" target=\"_blank\">our system cards<\/a>. Readers of the constitution should keep this gap between intention and reality in mind.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Even if we succeed with our current training methods at creating models that fit our vision, we might fail later as models become more capable. For this and other reasons, alongside the constitution, we <a href=\"https:\/\/www.anthropic.com\/research\" rel=\"nofollow noopener\" target=\"_blank\">continue to pursue<\/a> a broad portfolio of methods and tools to help us assess and improve the alignment of our models: new and more rigorous evaluations, safeguards to prevent misuse, detailed investigations of actual and potential alignment failures, and interpretability tools that help us understand at a deeper level how the models work.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">At some point in the future, and perhaps soon, documents like Claude\u2019s constitution might matter a lot\u2014much more than they do now. Powerful AI models will be a new kind of force in the world, and those who are creating them have a chance to help them embody the best in humanity. We hope this new constitution is a step in that direction.<\/p>\n<p class=\"Body-module-scss-module__z40yvW__reading-column body-2 serif post-text\">Read <a href=\"http:\/\/anthropic.com\/constitution\" rel=\"nofollow noopener\" target=\"_blank\">the full constitution<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"We\u2019re publishing a new constitution for our AI model, Claude. It\u2019s a detailed description of Anthropic\u2019s vision for&hellip;\n","protected":false},"author":2,"featured_media":382833,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-382832","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/382832","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=382832"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/382832\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/382833"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=382832"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=382832"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=382832"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}