{"id":275378,"date":"2025-11-06T15:23:14","date_gmt":"2025-11-06T15:23:14","guid":{"rendered":"https:\/\/www.newsbeep.com\/us\/275378\/"},"modified":"2025-11-06T15:23:14","modified_gmt":"2025-11-06T15:23:14","slug":"inception-raises-50-million-to-build-diffusion-models-for-code-and-text","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/us\/275378\/","title":{"rendered":"Inception raises $50 million to build diffusion models for code and text"},"content":{"rendered":"<p id=\"speakable-summary\" class=\"wp-block-paragraph\">With so much money flooding into AI startups, it\u2019s a good time to be an AI researcher with an idea to test out. And if the idea is novel enough, it might be easier to get the resources you need as an independent company instead of inside one of the big labs.<\/p>\n<p class=\"wp-block-paragraph\">That\u2019s the story of Inception, a startup developing diffusion-based AI models that just raised $50 million in seed funding led by Menlo Ventures, with participation from Mayfield, Innovation Endeavors, Microsoft\u2019s M12 fund, Snowflake Ventures, Databricks Investment, and Nvidia\u2019s venture arm NVentures. Andrew Ng and Andrej Karpathy provided additional angel funding.<\/p>\n<p class=\"wp-block-paragraph\">The leader of the project is Stanford professor Stefano Ermon, whose research focuses on diffusion models \u2014 which generate outputs through iterative refinement rather than word-by-word. These models power image-based AI systems like Stable Diffusion, Midjourney and Sora. Having worked on those systems since before the AI boom made them exciting, Ermon is using Inception to apply the same models to a broader range of tasks.<\/p>\n<p class=\"wp-block-paragraph\">Together with the funding, the company released a new version of its Mercury model, designed for software development. Mercury has already been integrated into a number of development tools, including ProxyAI, Buildglare, and Kilo Code. Most importantly, Ermon says the diffusion approach will help Inception\u2019s models conserve on two of the most important metrics: latency (response time) and compute cost.<\/p>\n<p class=\"wp-block-paragraph\">\u201cThese diffusion-based LLMs are much faster and much more efficient than what everybody else is building today,\u201d Ermon says. \u201cIt\u2019s just a completely different approach where there is a lot of innovation that can still be brought to the table.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Understanding the technical difference requires a bit of background. Diffusion models are structurally different from auto-regression models, which dominate text-based AI services. Auto-regression models like GPT-5 and Gemini work sequentially, predicting each next word or word fragment based on the previously processed material. Diffusion models, trained for image generation, take a more holistic approach, modifying the overall structure of a response incrementally until it matches the desired result.<\/p>\n<p class=\"wp-block-paragraph\">The conventional wisdom is to use auto-regression models for text applications, and that approach has been hugely successful for recent generations of AI models. But a growing body of research suggests diffusion models may perform better when a model is <a rel=\"nofollow noopener\" href=\"https:\/\/arxiv.org\/abs\/2505.15045\" target=\"_blank\">processing large quantities of text<\/a> or <a rel=\"nofollow noopener\" href=\"https:\/\/blog.ml.cmu.edu\/2025\/09\/22\/diffusion-beats-autoregressive-in-data-constrained-settings\/\" target=\"_blank\">managing data constraints<\/a>. As Ermon tells it, those qualities become a real advantage when performing operations over large codebases.<\/p>\n<p>Techcrunch event<\/p>\n<p>\n\t\t\t\t\t\t\t\t\tSan Francisco<br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t|<br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\tOctober 13-15, 2026\n\t\t\t\t\t\t\t<\/p>\n<p class=\"wp-block-paragraph\">Diffusion models also have more flexibility in how they utilize hardware, a particularly important advantage as the infrastructure demands of AI become clear. Where auto-regression models have to execute operations one after another, diffusion models can process many operations simultaneously, allowing for significantly lower latency in complex tasks.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe\u2019ve been benchmarked at over 1,000 tokens per second, which is way higher than anything that\u2019s possible using the existing autoregressive technologies,\u201d Ermon says, \u201cbecause our thing is built to be parallel. It\u2019s built to be really, really fast.\u201d<\/p>\n","protected":false},"excerpt":{"rendered":"With so much money flooding into AI startups, it\u2019s a good time to be an AI researcher with&hellip;\n","protected":false},"author":2,"featured_media":275379,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[45],"tags":[182,181,507,146692,39340,74],"class_list":{"0":"post-275378","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-diffusion","12":"tag-menlo-ventures","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/275378","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/comments?post=275378"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/posts\/275378\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media\/275379"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/media?parent=275378"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/categories?post=275378"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/us\/wp-json\/wp\/v2\/tags?post=275378"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}