{"id":384895,"date":"2026-01-01T14:51:14","date_gmt":"2026-01-01T14:51:14","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/384895\/"},"modified":"2026-01-01T14:51:14","modified_gmt":"2026-01-01T14:51:14","slug":"huaweis-ascend-and-kunpeng-progress-shows-how-china-is-rebuilding-an-ai-compute-stack-under-sanctions","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/384895\/","title":{"rendered":"Huawei\u2019s Ascend and Kunpeng progress shows how China is rebuilding an AI compute stack under sanctions"},"content":{"rendered":"<p id=\"7d3cc71b-5e5d-4fd4-80f0-1b19615f4463\">Huawei <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.scmp.com\/tech\/big-tech\/article\/3338109\/huawei-hails-ascend-ai-ecosystem-new-year-message-atlas-900-supernode-rolls-out?module=top_story&amp;pgtype=section\" target=\"_blank\" data-url=\"https:\/\/www.scmp.com\/tech\/big-tech\/article\/3338109\/huawei-hails-ascend-ai-ecosystem-new-year-message-atlas-900-supernode-rolls-out?module=top_story&amp;pgtype=section\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\">used its New Year message to highlight progress<\/a> across its Ascend AI and Kunpeng CPU ecosystems, pointing to the rollout of Atlas 900 supernodes and rapid growth in domestic developer adoption as &#8220;a solid foundation for computing.&#8221; The message arrives as China continues to accelerate efforts to replace Western hardware in critical AI workloads, and as Huawei positions itself as the closest thing the country has to a vertically integrated AI compute vendor.<\/p>\n<p>Huawei\u2019s message offers a snapshot of a strategy that has been unfolding for several years, shaped by U.S. export controls, constrained access to leading-edge manufacturing, and a <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/china-tells-chipmakers-to-use-homegrown-chipmaking-tools-for-50-percent-of-new-capacity-decree-designed-to-squeeze-foreign-suppliers-out-of-supply-chain\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/china-tells-chipmakers-to-use-homegrown-chipmaking-tools-for-50-percent-of-new-capacity-decree-designed-to-squeeze-foreign-suppliers-out-of-supply-chain\" rel=\"nofollow noopener\" target=\"_blank\">domestic market increasingly mandated to adopt local silicon<\/a>. Under those conditions, Huawei\u2019s Ascend and Kunpeng platforms have evolved into something distinct from their Western counterparts: less focused on single-chip supremacy and more on building large, tightly coupled systems that compensate for weaker nodes with scale, networking, and software control.<\/p>\n<p><a id=\"elk-d05fdb6f-a30b-44e6-8690-66932fa1f340\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/>Ascend\u2019s architecture and the limits of the node<a id=\"elk-seasonal\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/><\/p>\n<p id=\"9645b060-a1f6-42d7-b79c-1437f2517964-0\">At the center of Huawei\u2019s AI effort is Ascend, built around its proprietary Da Vinci architecture. The original Ascend 910, introduced in 2019, was <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/news\/huawei-risc-v-ai-processors-ascend-us,40238.html\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/news\/huawei-risc-v-ai-processors-ascend-us,40238.html\" rel=\"nofollow noopener\" target=\"_blank\">manufactured on TSMC\u2019s 7nm process<\/a> and delivered roughly 256 TFLOPS of FP16 performance at a quoted 350W. That put it in the same broad class as Nvidia\u2019s Volta-era accelerators, though without the same software ecosystem or interconnect maturity.<\/p>\n<p>You may like<\/p>\n<p>Sanctions that came in the years following Ascend\u2019s launch significantly changed the playing field, forcing subsequent Ascend generators onto SMIC\u2019s N+1 and N+2 processes, which are roughly comparable to older 7nm-class nodes without EUV. <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/huaweis-ascend-ai-chip-ecosystem-scales\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/huaweis-ascend-ai-chip-ecosystem-scales\" rel=\"nofollow noopener\" target=\"_blank\">The Ascend 910C<\/a>, now the backbone of Huawei\u2019s latest clusters, is a dual-die package with two large chiplets combined into a single accelerator card. On paper, Huawei claims up to 780 TFLOPS of BF16 compute, but die area and power efficiency tell a more complicated story.<\/p>\n<p>Huawei suggests the 910C\u2019s combined silicon footprint is around 60% larger than Nvidia\u2019s H100, with lower performance per square millimeter and per watt. In isolation, that would be a losing proposition, but Huawei has leaned hard on interconnects and clustering. The company uses a proprietary high-speed fabric alongside standard PCIe and RoCE networking to bind hundreds or thousands of Ascend accelerators into a single logical training or inference system.<\/p>\n<p>This approach is evident in <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/huaweis-ascend-ai-chip-ecosystem-scales\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/semiconductors\/huaweis-ascend-ai-chip-ecosystem-scales\" rel=\"nofollow noopener\" target=\"_blank\">Huawei\u2019s claims around Atlas 900 and CloudMatrix systems<\/a>. Rather than competing card-for-card with Nvidia\u2019s H100 or AMD\u2019s MI300X, Huawei emphasizes aggregate throughput. A CloudMatrix 384 system, linking 384 Ascend 910C accelerators, has been positioned as competitive with Nvidia\u2019s large NVLink-based pods on selected workloads, particularly inference. But there\u2019s a trade-off here in terms of physical scale: where Nvidia can deliver multi-exaflop-class FP4 performance in a handful of racks, Huawei requires an order of magnitude more floor space, power delivery, and cooling.<\/p>\n<p>Inference is where Ascend looks strongest, and reports out of China indicate that 910C delivers roughly 60% of H100-class performance on inference tasks, <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-research-suggests-huaweis-ascend-910c-delivers-60-percent-nvidia-h100-inference-performance\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/deepseek-research-suggests-huaweis-ascend-910c-delivers-60-percent-nvidia-h100-inference-performance\" rel=\"nofollow noopener\" target=\"_blank\">but training remains more challenging<\/a>.<\/p>\n<p><a id=\"elk-5bd70c89-c714-4433-b53c-25f8595b4480\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/>Scaling out as a design philosophy<\/p>\n<p id=\"057d9d60-01a9-4ff1-9901-2d2d604b1c37\">As for the Atlas 900 supernode, highlighted in Huawei\u2019s New Year message, it is probably best viewed as a piece of architectural showmanship rather than a product that\u2019s likely to come to the Chinese market any time soon. It reflects Huawei\u2019s belief that AI compute can be industrialized through standardized clusters built from domestically controlled components, even if each component lags the global leading-edge.<\/p>\n<p>This is where Huawei\u2019s background in telecom networking comes into play, though. The company has decades of experience building carrier-grade systems that prioritize reliability, deterministic performance, and large-scale orchestration. Ascend clusters apply that mindset to AI, with the emphasis on predictable scaling behavior and integration with Huawei\u2019s own AI frameworks rather than leading <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tag\/benchmark\" data-auto-tag-linker=\"true\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tag\/benchmark\" rel=\"nofollow noopener\" target=\"_blank\">benchmarks<\/a>.<\/p>\n<p>That also explains why Huawei describes the supernode technology as a &#8220;more readily accessible&#8221; technology for forming a &#8220;solid AI computing backbone.&#8221; Huawei is not pitching Ascend as a drop-in replacement for CUDA, but an alternative stack, from silicon to interconnect to compiler, that customers adopt wholesale. That\u2019s something that could be attractive to Chinese cloud providers that are facing up to some pretty harsh procurement and compliance realities in the face of export restrictions and geopolitical uncertainty.<\/p>\n<p>You may like<\/p>\n<p><a id=\"elk-95b55d00-e1d3-4017-a78d-2fcb8c9316b9\" class=\"paywall\" aria-hidden=\"true\" data-url=\"\" href=\"\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\"\/>Kunpeng and the supporting CPU layer<\/p>\n<p id=\"cd21fa56-99e5-4257-a310-edccfe7fabc8\">Ascend does not stand alone. <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/pc-components\/cpus\/huawei-preps-new-kunpeng-cpu-with-hbm-linux-patches-point-to-an-unannounced-kunpeng-arm-server-soc\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/pc-components\/cpus\/huawei-preps-new-kunpeng-cpu-with-hbm-linux-patches-point-to-an-unannounced-kunpeng-arm-server-soc\" rel=\"nofollow noopener\" target=\"_blank\">Huawei\u2019s Kunpeng CPUs<\/a> provide the general-purpose compute layer for these systems, and they follow a similar trajectory. Kunpeng chips are Arm-based, built around Huawei\u2019s Taishan core designs. Earlier generations, such as Kunpeng 920, offered up to 64 Taishan V110 cores and targeted server and cloud workloads with respectable throughput but modest per-core performance.<\/p>\n<p>Meanwhile, recent reporting suggests that the upcoming Kunpeng 930 generation is scaling core counts aggressively, pointing to 120-core designs built from multiple chiplets, while Huawei\u2019s own roadmap references Kunpeng 950 and 960 variants with 192 cores and 384 threads. Per-core performance appears to be roughly in the Zen 3 class, which places Kunpeng behind current Xeon and EPYC parts but potentially competitive in highly parallel, throughput-oriented workloads.<\/p>\n<p>That\u2019s probably good enough for Huawei. Kunpeng\u2019s role is to feed data to accelerators, manage I\/O, and run infrastructure software in an environment where power and rack space are already dominated by Ascend clusters. Tight integration matters more than single-thread speed, and Arm gives Huawei architectural independence from x86 licensing and export risk.<\/p>\n<p>Taken together, Ascend and Kunpeng show us how China\u2019s AI hardware strategy has shifted from chasing individual best-in-class chips to assembling viable end-to-end platforms under constraint. <a data-analytics-id=\"inline-link\" href=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/china-starts-list-of-government-approved-ai-hardware-suppliers-cambricon-and-huawei-are-in-nvidia-is-not\" data-mrf-recirculation=\"inline-link\" data-before-rewrite-localise=\"https:\/\/www.tomshardware.com\/tech-industry\/artificial-intelligence\/china-starts-list-of-government-approved-ai-hardware-suppliers-cambricon-and-huawei-are-in-nvidia-is-not\" rel=\"nofollow noopener\" target=\"_blank\">Chinese government guidance<\/a> discouraging new purchases of Nvidia hardware, combined with domestic subsidies and procurement rules, creates a large guaranteed market for &#8220;good enough&#8221; alternatives.<\/p>\n<p>But &#8220;good enough&#8221; comes with obvious tradeoffs: Huawei\u2019s clusters consume more power, occupy more space, and rely on heavy overprovisioning to match the throughput of more advanced Western systems. But when push comes to shove, those costs are evidently acceptable in a market where sovereignty and long-term continuity outweigh efficiency.<\/p>\n<p id=\"7972a07a-402d-4d1c-a4c1-28edbebd9b71\">Follow<a data-analytics-id=\"inline-link\" href=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" target=\"_blank\" data-url=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\"> Tom&#8217;s Hardware on Google News<\/a>, or<a data-analytics-id=\"inline-link\" href=\"https:\/\/google.com\/preferences\/source?q=\" target=\"_blank\" data-url=\"https:\/\/google.com\/preferences\/source?q=\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" data-mrf-recirculation=\"inline-link\" rel=\"nofollow noopener\"> add us as a preferred source<\/a>, to get our latest news, analysis, &amp; reviews in your feeds.<\/p>\n<p><a href=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" id=\"6a9eff70-4c5d-42a9-b1d4-08017f63e1c9\" data-url=\"https:\/\/news.google.com\/publications\/CAAqLAgKIiZDQklTRmdnTWFoSUtFSFJ2YlhOb1lYSmtkMkZ5WlM1amIyMG9BQVAB\" target=\"_blank\" referrerpolicy=\"no-referrer-when-downgrade\" data-hl-processed=\"none\" rel=\"nofollow noopener\"><\/p>\n<p class=\"vanilla-image-block\" style=\"padding-top:31.51%;\">\n<p><img decoding=\"async\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\" alt=\"Google Preferred Source\"   loading=\"lazy\" data-new-v2-image=\"true\" data-original-mos=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\" data-pin-media=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/10\/7cUTDmN2PHNRiNBVqbKf56.png\" class=\"pull-left\"\/>\n<\/p>\n<p><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"Huawei used its New Year message to highlight progress across its Ascend AI and Kunpeng CPU ecosystems, pointing&hellip;\n","protected":false},"author":2,"featured_media":384896,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-384895","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/384895","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=384895"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/384895\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/384896"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=384895"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=384895"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=384895"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}