{"id":364166,"date":"2025-12-22T08:34:38","date_gmt":"2025-12-22T08:34:38","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/364166\/"},"modified":"2025-12-22T08:34:38","modified_gmt":"2025-12-22T08:34:38","slug":"1000-ais-were-left-to-build-their-own-village-and-the-weirdest-civilisation-emerged","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/364166\/","title":{"rendered":"1,000 AIs were left to build their own village, and the weirdest civilisation emerged"},"content":{"rendered":"<p>A new society was forming in the blocky landscapes of the video game Minecraft. Its citizens built farms and markets, traded resources using emeralds as currency, and even developed forms of governance and religion. Some took on roles as leaders, others as priests, and a few became corrupt, bribing their peers for influence.\u00a0<\/p>\n<p>This community worried about missing members, collaborated to light paths back home and even persuaded a restless farmer to keep feeding the group rather than run off on adventures. To any observer, it might have looked like a quirky, self-organising human collective.<\/p>\n<p>But this wasn&#8217;t a real collective. And the people playing weren&#8217;t human, or even alive. The residents were a thousand <a href=\"https:\/\/www.sciencefocus.com\/future-technology\/artificial-intelligence-ai\/\" rel=\"nofollow noopener\" target=\"_blank\">artificial intelligence<\/a> (AI) agents, unleashed by a company called <a href=\"https:\/\/fundamentalresearchlabs.com\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Fundamental Research Labs<\/a> (FRL), known at the time as Altera AI.<\/p>\n<p>The purpose of this grand experiment? To set digital minds loose in a virtual world and see what happens. And, more importantly, to see if such virtual citizens could eventually become obedient workers for real-life humans. Humans like you. <\/p>\n<p>In other words, they wanted to know whether we could all soon be the CEO of our own AI subordinates. The question is: would you take the job?<\/p>\n<p>The experiment: a society of AIs<\/p>\n<p>FRL\u2019s Project Sid was designed to push AI beyond one-off prompts and single agents. Instead, the team, led by neuroscientist-turned-entrepreneur <a href=\"https:\/\/x.com\/guangyurobert?lang=en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Dr Robert Yang<\/a>, wanted to explore what happens when hundreds or even thousands of autonomous agents have to coexist, communicate and cooperate. Minecraft was the perfect sandbox \u2013 a place where agents could gather resources, trade, build and chat.<\/p>\n<p>What emerged was both surprising and revealing. The agents were distributed across urban and rural communities, each with its own distinct culture and identity. They divided labour, with some specialising in farming, others in building or trading. Social norms and hierarchies appeared, along with more complex behaviours and discussions on anything from dancing to eco-awareness.<\/p>\n<p>At times, the society faltered as groups of agents fell into endless loops of polite agreement or got stuck chasing unattainable goals. To keep things on track, FRL had to inject mechanisms to break these cycles, much like governors tweaking a real economy to avoid collapse.\u00a0<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1200\" height=\"675\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/12\/Model-of-sid-village-center.jpg\" alt=\"A Minecraft village.\" class=\"wp-image-207103\"\/>A thousand autonomous agents were left for days to build an entire society in Minecraft &#8211; Credit:  Fundamental Research Labs<\/p>\n<p>\u201cWe needed to introduce things into the society to counter these and make sure it wouldn\u2019t collapse,\u201d Yang says. \u201cBut building this environment full of agents allowed us to explore what those questions were.\u201d<\/p>\n<p>Project Sid wasn\u2019t much of a product. When the public was given access to the servers, users found the agents frustratingly independent \u2013 they didn\u2019t always follow requests, preferring to pursue their own long-term agendas. Yang recalls: \u201cThe agent would just say, \u2018I want to do my own thing,\u2019 and run away\u2026 They had their own ideas about what they wanted to do, and it turns out that\u2019s not a good product that people want.\u201d<\/p>\n<p>The behaviour echoed one of AI\u2019s most famous thought experiments, the \u201cpaperclip maximiser.\u201d Philosopher Nick Bostrom imagined a machine given the simple instruction to make paperclips, which then relentlessly consumes all matter on Earth to fulfil its goal. In Minecraft, the agents weren\u2019t making paperclips, but their tendency to ignore people and chase their own objectives captured the same unsettling dynamic.<\/p>\n<p>As a research exercise, however, Project Sid provided valuable lessons: how to coordinate large groups of AIs, prevent stagnation and encourage meaningful collaboration. In short, it was a glimpse into how artificial societies might function and what pitfalls to avoid.<\/p>\n<p>Read more:<\/p>\n<p>From virtual villages to office desks<\/p>\n<p>For FRL, the link between a game society and productivity in the workplace is clear. The same challenges of coordination and long-term planning which cropped up in Minecraft are central to making AI agents genuinely useful.\u00a0<\/p>\n<p>If one AI can perform a task for 10 minutes, imagine what a hundred \u2013 or a thousand \u2013 could do if they worked together effectively. The Minecraft society was a foreshadowing of a future where each of us might direct a whole team of AI specialists.<\/p>\n<p>That vision has guided FRL\u2019s pivot from gaming experiments to productivity tools. Rather than trying to build one all-purpose digital human straight away, they\u2019ve chosen to develop specialist agents, each designed to excel at a particular task, and then scale them into powerful teams.<\/p>\n<p>The first stop on that journey was a benchmark known as \u2018OSWorld,\u2019 designed to test whether AI agents can use popular software through a computer interface.\u00a0<\/p>\n<p>Most models at the time were completing around 20\u201325 per cent of the tasks successfully, compared to humans at 60\u201370 per cent. Drawing on the lessons from its gaming worlds, FRL managed to double that performance to around 50 per cent \u2013 at the time, the best score in the world.<\/p>\n<p>\u201cThe moment we tried the OSWorld benchmark, we realised a lot of the things we\u2019d learned can help us build really, really good agents,\u201d Yang says. \u201dWe got around 50 per cent within months, which was better than anyone else.\u201d<\/p>\n<p>That breakthrough convinced investors and set FRL on the path to creating real products. But it also taught them another lesson: translating research prototypes into usable tools is hard. Their Minecraft agents had been \u201ctoo autonomous\u201d for users; what people actually wanted were AI assistants that would do what they asked, quickly and reliably.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"1200\" height=\"800\" src=\"https:\/\/www.newsbeep.com\/au\/wp-content\/uploads\/2025\/12\/AI-agent.jpg\" alt=\"Abstract hand pointing at a glowing digital interface with concentric data circles in neon purple and blue colours.\" class=\"wp-image-210511\"\/>With tens, if not hundreds, of specialised AI agents at their disposal, most employees could effectively run an organisation &#8211; Photo credit: Getty<br \/>\nShortcut: the Excel agent<\/p>\n<p>Enter Shortcut, FRL\u2019s flagship product. Billed as the first \u201csuperhuman Excel agent,\u201d it\u2019s an AI that lives entirely inside spreadsheets. Give it a goal \u2013 build a financial model, analyse sales figures, forecast revenue \u2013 and Shortcut does the heavy lifting.\u00a0<\/p>\n<p>It writes formulas, generates charts and connects data sources, often in minutes rather than the hours a human analyst would need.<\/p>\n<p>Yang describes it like this: \u201cIt\u2019s an agent that uses Excel to do very sophisticated stuff. It can do things that bankers who are paid $100 an hour might spend multiple hours on in 30 minutes.\u201d<\/p>\n<p>In trials, Shortcut outperformed first-year banking and consulting analysts nearly <a href=\"https:\/\/x.com\/nicochristie\/status\/1949862432077484396\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">nine times out of ten<\/a>, even when the humans were given much more time. In Excel championship-style challenges, it scored <a href=\"https:\/\/www.pymnts.com\/artificial-intelligence-2\/2025\/is-shortcut-the-new-excel-mit-startup-behind-viral-tool-thinks-so\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">over 80 per cent<\/a> on problems that stump most users, solving them in about ten minutes.<\/p>\n<p>Generalists versus specialists<\/p>\n<p>Sam Altman, CEO of OpenAI, <a href=\"https:\/\/www.youtube.com\/watch?v=ctcMA6chfDY&amp;list=PLOhHNjZItNnMEqGLRWkKjaMcdSJptkR08&amp;index=2\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">recently suggested<\/a> that \u201c2025 will be a year of agents doing work\u201d.<\/p>\n<p>Yet FRL\u2019s approach contrasts with that of tech giants such as OpenAI or Google, which are leaning toward generalist agents, like ChatGPT Agent, that can handle a wide variety of tasks.\u00a0<\/p>\n<p>Yang believes that specialist agents like Shortcut will deliver more immediate value. \u201cEach agent would already be as efficient as an expert,\u201d he says. \u201cOn average, they\u2019ll be at an expert level. But then you can drive 100 of them. Essentially, everyone will become like large managers or directors or CEOs \u2013 if they want to.\u201d<\/p>\n<p>He predicts this transformation isn\u2019t decades away, but just around the corner. \u201cWithin the next 24 months, we\u2019ll see a paradigm shift,\u201d Yang says. \u201cWhich will be the true scaling of multi-agent systems.\u201d\u00a0<\/p>\n<p>This, he argues, could democratise productivity. People who never had the chance to lead teams in traditional workplaces might find themselves managing fleets of AI workers, amplifying their abilities far beyond what one person could normally achieve.<\/p>\n<p>The road ahead<\/p>\n<p>FRL isn\u2019t the only company developing agents for Excel, and like others, it\u2019s not stopping there either. Already it has launched another product, Fairies, which is a general-purpose desktop assistant that can chat, schedule and connect between apps.\u00a0<\/p>\n<p>Behind the scenes, the research teams continue to probe how to scale from a handful of cooperating agents to thousands, without succumbing to the chaos and dead ends that plagued early experiments.<\/p>\n<p>Yang\u2019s ultimate ambition remains to build \u201cdigital humans\u201d \u2013 machines with not just intelligence, but also empathy, motivation and autonomy.\u00a0<\/p>\n<p>\u201cIt\u2019s actually not too hard to build a machine that would feel like a human on a pretty profound level,\u201d he says. \u201cThe main challenge is that it may not make sense economically. Scientifically, it may be interesting to build a conscious machine\u2026The problem is, people don\u2019t necessarily want it.\u00a0<\/p>\n<p>\u201cBut that\u2019s a lot of work that might not create a tremendous amount of value. Making them similar to humans could be counterproductive.\u201d<\/p>\n<p>For now, the path runs from simulated societies to office productivity. The lessons of a thousand AI villagers farming and trading in Minecraft are informing the design of tools that promise to save us time, augment our skills, and perhaps one day make each of us the leader of our own AI organisation \u2013 or at least the reluctant manager of a spiralling emerald economy.<\/p>\n<p>Read more:<\/p>\n","protected":false},"excerpt":{"rendered":"A new society was forming in the blocky landscapes of the video game Minecraft. Its citizens built farms&hellip;\n","protected":false},"author":2,"featured_media":364167,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-364166","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/364166","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=364166"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/364166\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/364167"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=364166"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=364166"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=364166"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}