{"id":417607,"date":"2026-02-10T07:34:14","date_gmt":"2026-02-10T07:34:14","guid":{"rendered":"https:\/\/www.newsbeep.com\/uk\/417607\/"},"modified":"2026-02-10T07:34:14","modified_gmt":"2026-02-10T07:34:14","slug":"claude-opus-4-6-spends-20k-trying-to-write-a-c-compiler-the-register","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/uk\/417607\/","title":{"rendered":"Claude Opus 4.6 spends $20K trying to write a C compiler \u2022 The Register"},"content":{"rendered":"<p>An Anthropic researcher&#8217;s efforts to get its newly released Opus 4.6 model to build a C compiler left him &#8220;excited,&#8221; &#8220;concerned,&#8221; and &#8220;uneasy.&#8221;<\/p>\n<p>It also left many observers on GitHub skeptical, to say the least.<\/p>\n<p>Nicholas Carlini, a researcher on Anthropic&#8217;s Safeguards team, detailed the experiment with what he called &#8220;agent teams&#8221; <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/www.anthropic.com\/engineering\/building-c-compiler\">in a blog<\/a> that coincided with the official release of Opus 4.6.<\/p>\n<p>He said he &#8220;tasked 16 agents with writing a Rust-based C compiler, from scratch, capable of compiling the Linux kernel. After nearly 2,000 Claude Code sessions and $20,000 in API costs, the agent team produced a 100,000-line compiler that can build Linux 6.9 on x86, ARM, and RISC-V.&#8221;<\/p>\n<p>With agent teams, he said, &#8220;multiple Claude instances work in parallel on a shared codebase without active human intervention.&#8221;<\/p>\n<p>One key task was getting round the need for &#8220;an operator to be online and available to work jointly,&#8221; which we presume means removing the need for Claude Code to wait for a human to tell it what to do next.<\/p>\n<p>&#8220;To elicit sustained, autonomous progress, I built a harness that sticks Claude in a simple loop&#8230; When it finishes one task, it immediately picks up the next.&#8221; Imagine if humans took that sort of approach.<\/p>\n<p>Carlini continued: &#8220;I leave it up to each Claude agent to decide how to act. In most cases, Claude picks up the &#8216;next most obvious&#8217; problem.&#8221; This threw up a number of lessons, including the need to &#8220;write extremely high quality tests.&#8221;<\/p>\n<p>Readers were also advised to &#8220;put yourself in Claude&#8217;s shoes.&#8221; That means the &#8220;test harness should not print thousands of useless bytes&#8221; to make it easier for Claude to find what it needs.<\/p>\n<p>Also, &#8220;Claude can&#8217;t tell time and, left alone, will happily spend hours running tests instead of making progress.&#8221;<\/p>\n<p>Which might make you feel working with Claude is closer to working with a regular human than you might have thought. But what was the upshot of all of this?<\/p>\n<p>&#8220;Over nearly 2,000 Claude Code sessions across two weeks, Opus 4.6 consumed 2 billion input tokens and generated 140 million output tokens, a total cost just under $20,000.&#8221;<\/p>\n<p>This made it &#8220;an extremely expensive project&#8221; compared to the priciest Claude Max plans, Carlini said. &#8220;But that total is a fraction of what it would cost me to produce this myself \u2013 let alone an entire team.&#8221;<\/p>\n<p>Other lessons? &#8220;The compiler successfully builds many projects, but not all. It&#8217;s not yet a drop-in replacement for a real compiler.&#8221; Moreover, &#8220;the generated code is not very efficient.&#8221;<\/p>\n<p>He added that the Rust code quality is &#8220;reasonable but&#8230; nowhere near the quality of what an expert Rust programmer might produce.&#8221;<\/p>\n<p>Carlini concluded: &#8220;Agent teams show the possibility of implementing entire, complex projects autonomously.&#8221;<\/p>\n<p>But as a former pen-tester, he said fully autonomous development posed real risks. &#8220;The thought of programmers deploying software they&#8217;ve never personally verified is a real concern.&#8221; Ultimately, the experiment &#8220;excites me, [but] also leaves me feeling uneasy.&#8221;<\/p>\n<p>Comments on GitHub were less equivocal, not least because they felt the $20K price tag ignored a few other elements, such as the vast amount of other programmers&#8217; code the model was trained on in the first place.<\/p>\n<p>As <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/github.com\/anthropics\/claudes-c-compiler\/issues\/1#issuecomment-3869799573\">mohswell<\/a> put it: &#8220;If I went to the supermarket, stole a bit of every bread they had, and shoved it together, no one would say I made bread from scratch. They&#8217;d say I&#8217;m a thief. If this is &#8216;from scratch,&#8217; then my cooking is farm-to-table.&#8221;<\/p>\n<p>While <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/github.com\/anthropics\/claudes-c-compiler\/issues\/1#issuecomment-3862135955\">Sambit003<\/a> opined: &#8220;The comment section and the issue itself is &#8216;absolute cinema&#8217; moment everyone living through\ud83d\ude02&#8230; the longer the AI generated codes I see&#8230; the safer I feel. \ud83d\ude02 Still we have the jobs (for long enough years)&#8230; just enjoy the overhyping bruh.&#8221;<\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/github.com\/anthropics\/claudes-c-compiler\/issues\/1#issuecomment-3861663434\">Serkosal<\/a> added plaintively: &#8220;okay, nice, could @claude find gf for me? No? I&#8217;m not interested.&#8221; \u00ae<\/p>\n","protected":false},"excerpt":{"rendered":"An Anthropic researcher&#8217;s efforts to get its newly released Opus 4.6 model to build a C compiler left&hellip;\n","protected":false},"author":2,"featured_media":417608,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[554,733,4308,86,56,54,55],"class_list":{"0":"post-417607","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-technology","12":"tag-uk","13":"tag-united-kingdom","14":"tag-unitedkingdom"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/417607","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/comments?post=417607"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/posts\/417607\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media\/417608"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/media?parent=417607"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/categories?post=417607"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/uk\/wp-json\/wp\/v2\/tags?post=417607"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}