{"id":326730,"date":"2026-03-07T12:45:08","date_gmt":"2026-03-07T12:45:08","guid":{"rendered":"https:\/\/www.newsbeep.com\/il\/326730\/"},"modified":"2026-03-07T12:45:08","modified_gmt":"2026-03-07T12:45:08","slug":"how-an-intern-helped-build-the-ai-that-shook-the-world","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/il\/326730\/","title":{"rendered":"How an intern helped build the AI that shook the world"},"content":{"rendered":"<p><img decoding=\"async\" class=\"Image\" alt=\"\" width=\"1350\" height=\"899\" src=\"https:\/\/www.newsbeep.com\/il\/wp-content\/uploads\/2026\/03\/SEI_288146249.jpg\"   loading=\"eager\" fetchpriority=\"high\" data-image-context=\"Article\" data-image-id=\"2518490\" data-caption=\"AlphaGo\u2019s victory braodcast on TV\" data-credit=\"Im Hun-jung\/Yonhap\/AP Photo via Getty Images\"\/><\/p>\n<p class=\"ArticleImageCaption__Title\">AlphaGo\u2019s victory braodcast on TV<\/p>\n<p class=\"ArticleImageCaption__Credit\">Im Hun-jung\/Yonhap\/AP Photo via Getty Images<\/p>\n<\/p>\n<p>In March 2016, Google DeepMind\u2019s artificial intelligence system AlphaGo shocked the world. In a stunning <a href=\"https:\/\/www.newscientist.com\/article\/2079871-im-in-shock-how-an-ai-beat-the-worlds-best-human-at-go\/\" rel=\"nofollow noopener\" target=\"_blank\">five-match series of Go<\/a>, the ancient Chinese board game, the AI beat the world\u2019s best player, Lee Sedol \u2013 a moment that was televised in front of millions and hailed by many as a historic moment in the development of artificial intelligence.<\/p>\n<p><a href=\"https:\/\/www.cs.toronto.edu\/~cmaddis\/\" rel=\"nofollow noopener\" target=\"_blank\">Chris Maddison<\/a>, now a professor of artificial intelligence at the University of Toronto, was then a master\u2019s student and helped get the project off the ground. It all began when <a href=\"https:\/\/www.newscientist.com\/article\/mg25133550-800-supersized-ais-are-truly-intelligent-machines-just-a-matter-of-scale\/\" rel=\"nofollow noopener\" target=\"_blank\">Ilya Sutskever<\/a>, who later went on to found OpenAI, got in touch\u2026<\/p>\n<p>Alex Wilkins: How did the idea for AlphaGo first come about?<\/p>\n<p>Chris Maddison: Ilya [Sutskever] gave me the following argument for why we should be working on Go. He said, Chris, do you think when an expert player looks at the Go board, they can pick the best move in half a second? If you think they can, then that means that you can learn a pretty good policy to pick the best move using a neural net.<\/p>\n<p>The reason is that half a second is about the time it takes for your <a href=\"https:\/\/www.newscientist.com\/article\/2106215-blind-people-use-brains-visual-cortex-to-help-do-maths\/\" rel=\"nofollow noopener\" target=\"_blank\">visual cortex<\/a> to do one forward pass [a round of processing], and we already knew from ImageNET [an important AI image-recognition competition] that we\u2019re pretty good at approximating things that only take one forward pass of your visual cortex.<\/p>\n<p>I bought that argument, so I decided to join [Google Brain] as an intern in the summer of 2014.<\/p>\n<p>How did AlphaGo develop from there?<\/p>\n<p>When I joined, there was another little team at DeepMind that I was going to work with, which was Aja Huang and David Silver, that had started working on Go. It was basically my charge to start building the <a href=\"https:\/\/www.newscientist.com\/article-topic\/neural-networks\/\" rel=\"nofollow noopener\" target=\"_blank\">neural networks<\/a>. It was a dream.<\/p>\n<p>There were a bunch of different approaches that we tried, and a lot of the initial things we tried failed. Eventually, I just got frustrated and tried the dumbest, simplest thing, which was to try to predict the next move that an expert would make in a given board position, training a neural network on a big corpus of expert <a href=\"https:\/\/www.newscientist.com\/games\/\" rel=\"nofollow noopener\" target=\"_blank\">games<\/a>. And that turned out to be the approach that really got us off the ground.<\/p>\n<p>By the end of the summer, we hosted a little match with DeepMind\u2019s Thore Graepel, who considered himself a decent Go player, and my networks beat him. DeepMind then started to be convinced that this was going to be a real thing and started putting resources towards it and building a big team around it.<\/p>\n<p>How difficult of a challenge was it seen beating Lee Sedol?<\/p>\n<p>I remember in the summer of 2014, we practically had Lee Sedol\u2019s portrait on our desk next to us. I\u2019m not a Go player, but Aja [Huang] is. Every time I would build a new network, it would get a little bit better, and I would turn to Aja and I\u2019d say, OK, we\u2019re a little bit better, how close are we to Lee Sedol? And Aja would turn to me and say, Chris, you don\u2019t understand. Lee Sedol is one stone from God.<\/p>\n<p>You left the AlphaGo team before the big event. Why?<\/p>\n<p>David [Silver] said we\u2019d like to keep you on and really drive this project to the next level, and, in retrospect, this was maybe one of the stupider decisions I made, I turned him down. I said I think I need to focus on my PhD, I\u2019m an academic at heart. I went back to my PhD and loosely consulted with the project from that point on. I\u2019m a little proud to say it took them a while to beat my neural networks. But then, ultimately, the artefact that played Lee Sedol was the product of a big <a href=\"https:\/\/www.newscientist.com\/article-topic\/engineering\/\" rel=\"nofollow noopener\" target=\"_blank\">engineering<\/a> effort and a big team.<\/p>\n<p>What was the atmosphere like in Seoul when AlphaGo won?<\/p>\n<p>Being there in Seoul at that moment was hard to express. It was emotional. It was intense. There was a sense of <a href=\"https:\/\/www.newscientist.com\/article\/2512121-the-3-best-ways-to-tackle-anxiety-according-to-a-leading-expert\/\" rel=\"nofollow noopener\" target=\"_blank\">anxiety<\/a>. You go in confident, but you never know. It\u2019s like a sports game. Statistically speaking, you\u2019re the better player, but you never know how it\u2019s going to shake out. I remember being in the hotel where we played the matches and looking out the window. We were at a high-enough level that you could look out onto one of the major city intersections. I realised there was a big screen, sort of like Times Square, that was showing our match. And then I looked along the sidewalks, and people were just lined up standing looking at the screen. I had heard numbers like hundreds of millions of people in China watched the first game, but I remember that moment as like, oh God, we\u2019ve really stopped East Asia in its tracks.<\/p>\n<p>How important has AlphaGo been for AI more generally?<\/p>\n<p>A lot has changed on a surface level about the world of <a href=\"https:\/\/www.newscientist.com\/article\/mg25934590-600-we-still-dont-really-understand-what-large-language-models-are\/\" rel=\"nofollow noopener\" target=\"_blank\">large language models<\/a> (LLMs), they are now quite different in some ways from AlphaGo, but actually there\u2019s an underlying technological thread that really hasn\u2019t changed.<\/p>\n<p>So the first part of the algorithm is to train a neural network to predict the next move. Today\u2019s LLMs begin with what we call pretraining to predict the next word, from a big corpus of human text found largely on the internet.<\/p>\n<p>For the second step in AlphaGo, we took the information from that human corpus that was compressed into these neural networks, and we refined it using reinforcement learning, to align the behaviour of the system towards the goal of winning games.<\/p>\n<p>When you learn to predict an expert\u2019s next move, they are trying to win, but that\u2019s not the only thing that explains the next move. Perhaps they don\u2019t understand what the best move is, perhaps they made a mistake, so you need to align the overall system with your true goal, which in the case of AlphaGo was winning.<\/p>\n<p>In large language models, it\u2019s the same after pretraining. The networks are not aligned with how we want to use them, and so we do a series of <a href=\"https:\/\/www.newscientist.com\/article\/2357017-deepmind-ai-is-as-fast-as-humans-at-solving-previously-unseen-tasks\/\" rel=\"nofollow noopener\" target=\"_blank\">reinforcement learning<\/a> steps that align the networks with our goals.<\/p>\n<p>In some ways, not much has changed.<\/p>\n<p>Does it tell us anything about where we can expect AIs to succeed?<\/p>\n<p>It has consequences in terms of what we choose to focus on. If you\u2019re worried about making progress on important problems, the key bottlenecks that you should be worried about are do you have enough data to do pretraining, and do you have reward signals to do post-training. If you don\u2019t have those ingredients, there\u2019s no amount of clever \u2013 you know, this <a href=\"https:\/\/www.newscientist.com\/definition\/algorithms\/\" rel=\"nofollow noopener\" target=\"_blank\">algorithm<\/a> versus that algorithm \u2013 that\u2019s going to get you off the ground.<\/p>\n<p>Did you feel any sympathy for Lee Sedol?<\/p>\n<p>Lee Sedol had been this idol over the summer of 2014, this unachievable milestone. To then suddenly be there in person, watching the matches, his stress, his anxiety, his realisation that this was a much worthier opponent than maybe he had thought going in, that was very stressful. You don\u2019t want to put someone in that position. When he lost the match, he apologised to humanity, and said, \u201cThis is my failing, not yours.\u201d That was tragic.<\/p>\n<p>There is also a custom in Go to review the match with your opponent. Someone wins or loses, but you review the match at the end, unwind the game and explore variations with each other. Lee Sedol couldn\u2019t do that because AlphaGo wasn\u2019t human, so instead he had his friends come in and review the match, but it\u2019s just not the same. There felt something heartbreaking about that.<\/p>\n<p>But I didn\u2019t appreciate all the <a href=\"https:\/\/www.newscientist.com\/article\/mg23431280-300-we-dont-need-to-lose-out-to-machines-says-the-man-who-did\/\" rel=\"nofollow noopener\" target=\"_blank\">man-versus-machine<\/a> narratives around the match, because a team of people built AlphaGo. That was the effort of a tribe building an artefact that could achieve excellence in a human game. It was ultimately the artefact that all our blood, sweat and tears went into.<\/p>\n<p>Do you think there is still a place for humans in the world as AI accomplishes more human thinking work?<\/p>\n<p>We are learning more about the game of Go, and if we think that game is beautiful, which we do, and AIs can teach us more about that beauty, there\u2019s a lot of inherent good in that as well. There\u2019s a difference between goals and purposes. The goal of the game of Go is to win, but that\u2019s not its only purpose \u2013 one purpose is to have fun. Board games are not destroyed by the presence of AI; chess is a thriving industry. We still appreciate the intrigue and the human achievement of that sport.<\/p>\n<p class=\"ArticleTopics__Heading\">Topics:<\/p>\n","protected":false},"excerpt":{"rendered":"AlphaGo\u2019s victory braodcast on TV Im Hun-jung\/Yonhap\/AP Photo via Getty Images In March 2016, Google DeepMind\u2019s artificial intelligence&hellip;\n","protected":false},"author":2,"featured_media":326731,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[345,343,344,85,46,125],"class_list":{"0":"post-326730","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-il","12":"tag-israel","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/326730","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/comments?post=326730"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/326730\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media\/326731"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media?parent=326730"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/categories?post=326730"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/tags?post=326730"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}