My colleagues and I are fascinated by the science of moving dots. So what are these dots? Well, it's all of us. And we're moving in our homes, in our offices, as we shop and travel throughout our cities and around the world. And wouldn't it be great if we could understand all this movement? If we could find patterns and meaning and insight in it. And luckily for us, we live in a time where we're incredibly good at capturing information about ourselves. So whether it's through sensors or videos, or apps, we can track our movement with incredibly fine detail.
我和我的同事对移动圆点 背后的科学非常着迷。 那么这些小圆点是什么呢? 就是我们自己。 我们在家里,办公室里来回走动, 也在世界各地旅行和购物。 如果我们能弄清这些移动, 并从中发现规律,意义并提出见解, 不是一件很棒的事吗? 很幸运的是, 我们生活在这么一个时代, 我们非常擅长捕捉关于自身的信息。 不管是通过传感器,视频,或软件应用, 我们都能详尽地追踪到个人移动的轨迹。
So it turns out one of the places where we have the best data about movement is sports. So whether it's basketball or baseball, or football or the other football, we're instrumenting our stadiums and our players to track their movements every fraction of a second. So what we're doing is turning our athletes into -- you probably guessed it -- moving dots.
这就让我们发现, 最佳的数据来源之一 就是体育运动。 因此无论是篮球、棒球、橄榄球或足球, 我们都可以在场馆内, 甚至运动员身上装上设备来追踪 他们每个时刻的运动数据。 所以我们要做的 ——你们大概已经猜到了—— 就是把运动员的移动 转化成圆点的移动。
So we've got mountains of moving dots and like most raw data, it's hard to deal with and not that interesting. But there are things that, for example, basketball coaches want to know. And the problem is they can't know them because they'd have to watch every second of every game, remember it and process it. And a person can't do that, but a machine can. The problem is a machine can't see the game with the eye of a coach. At least they couldn't until now. So what have we taught the machine to see?
所以我们收集了不计其数的移动小圆点, 就像多数原始数据一样, 难以处理,也没什么趣味。 但数据里面蕴藏着, 比如篮球教练想知道的事情。 但问题是,除非教练们把每场比赛里 每一秒数据都记下来再去思考, 否则他们没法从中得到想要的信息。 人类大脑无法做到这件事, 但机器没问题。 然而,机器没办法自己 以教练的视角去看一场比赛。 直到现在,我们做到了。 那么, 我们让机器去观察些什么呢?
So, we started simply. We taught it things like passes, shots and rebounds. Things that most casual fans would know. And then we moved on to things slightly more complicated. Events like post-ups, and pick-and-rolls, and isolations. And if you don't know them, that's okay. Most casual players probably do. Now, we've gotten to a point where today, the machine understands complex events like down screens and wide pins. Basically things only professionals know. So we have taught a machine to see with the eyes of a coach.
先从简单的开始。 我们先教会它传球、投篮和篮板球, 这类普通球迷也知道的事。 然后我们开始教它一些 稍复杂点的事情, 比如落位背打、挡拆和拉开单打。 你们如果不了解这些名词, 没关系。打球的人大都了如指掌。 迄今为止,我们已经能够让机器理解 下掩护和无球掩护这类复杂的, 只有专业人士才懂的战术。 于是我们已经教会电脑用 教练的视角去观察数据了。
So how have we been able to do this? If I asked a coach to describe something like a pick-and-roll, they would give me a description, and if I encoded that as an algorithm, it would be terrible. The pick-and-roll happens to be this dance in basketball between four players, two on offense and two on defense. And here's kind of how it goes. So there's the guy on offense without the ball the ball and he goes next to the guy guarding the guy with the ball, and he kind of stays there and they both move and stuff happens, and ta-da, it's a pick-and-roll.
我们是怎么做到的呢? 如果我让一个教练讲解挡拆, 我会得到一个定义, 如果我把这个定义编码成一个算法 估计会惨不忍睹。 挡拆就是四个球员之间的舞蹈, 两人进攻,两人防守。 大概是这么个过程: 一个没有带球的进攻球员 跑向持球的防守队员, 站在那里待一会儿, 然后他们一起移动(制造机会), 嗒哒,这就是挡拆。
(Laughter)
(笑声)
So that is also an example of a terrible algorithm. So, if the player who's the interferer -- he's called the screener -- goes close by, but he doesn't stop, it's probably not a pick-and-roll. Or if he does stop, but he doesn't stop close enough, it's probably not a pick-and-roll. Or, if he does go close by and he does stop but they do it under the basket, it's probably not a pick-and-roll. Or I could be wrong, they could all be pick-and-rolls. It really depends on the exact timing, the distances, the locations, and that's what makes it hard. So, luckily, with machine learning, we can go beyond our own ability to describe the things we know.
这也是个糟糕的算法实例。 如果那个干扰的球员—— 或者叫掩护者—— 只是跑过来干扰一下而不停下, 这可能就不是挡拆了。 就算他停下来, 但停的位置不够接近, 那也不算是挡拆。 或者,就算他足够近,而且停下来, 但他是在篮下完成的 那也不算挡拆。 或者我可能错了, 这些都是挡拆。 是否是挡拆要根据发生的时间、 球员间距、位置而定, 这些都很难去界定。 幸运的是,有了机器学习技术, 我们就能超越自身的能力 来描述我们已知的事物。
So how does this work? Well, it's by example. So we go to the machine and say, "Good morning, machine. Here are some pick-and-rolls, and here are some things that are not. Please find a way to tell the difference." And the key to all of this is to find features that enable it to separate. So if I was going to teach it the difference between an apple and orange, I might say, "Why don't you use color or shape?" And the problem that we're solving is, what are those things? What are the key features that let a computer navigate the world of moving dots? So figuring out all these relationships with relative and absolute location, distance, timing, velocities -- that's really the key to the science of moving dots, or as we like to call it, spatiotemporal pattern recognition, in academic vernacular. Because the first thing is, you have to make it sound hard -- because it is.
这个技术要如何实现呢? 举个例子: 我们对机器说, “早上好,机器。 这儿有些挡拆例子,还有一些不是。 你来找出不同点吧。” 这其中的关键是电脑能找出 区别两者的特征来。 所以如果我要教会机器 辨别苹果和橘子, 我可能会说: “不妨用颜色和形状来区分吧?” 而目前要解决的问题就是, 要区分事物的特征是什么? 电脑需要掌握的整个 移动圆点世界的关键特征是什么? 搞清楚所有这些相对位置、 绝对位置、距离、时机、 速率之间的关系—— 就是移动圆点科学的真正关键所在, 换成专业术语, 我们喜欢称之为:时空模式识别。 因为首先,你要让它听起来 很难懂,很专业—— 因为事实的确如此。
The key thing is, for NBA coaches, it's not that they want to know whether a pick-and-roll happened or not. It's that they want to know how it happened. And why is it so important to them? So here's a little insight. It turns out in modern basketball, this pick-and-roll is perhaps the most important play. And knowing how to run it, and knowing how to defend it, is basically a key to winning and losing most games. So it turns out that this dance has a great many variations and identifying the variations is really the thing that matters, and that's why we need this to be really, really good.
对于NBA教练们来说,判断是否是 挡拆并不是关键, 而这个挡拆是怎么发生的 才是他们关注的。 为何教练们如此关心这一点? 这儿我要解释一下。 在现代的篮球比赛中, 挡拆几乎是最重要的战术。 了解如何使用以及怎样防守挡拆, 基本上是比赛输赢的关键。 因此挡拆的步伐多种多样, 能够识别这些不同的形式 是非常重要的, 这就是为什么我们对 机器的智能性要求相当高。
So, here's an example. There are two offensive and two defensive players, getting ready to do the pick-and-roll dance. So the guy with ball can either take, or he can reject. His teammate can either roll or pop. The guy guarding the ball can either go over or under. His teammate can either show or play up to touch, or play soft and together they can either switch or blitz and I didn't know most of these things when I started and it would be lovely if everybody moved according to those arrows. It would make our lives a lot easier, but it turns out movement is very messy. People wiggle a lot and getting these variations identified with very high accuracy, both in precision and recall, is tough because that's what it takes to get a professional coach to believe in you. And despite all the difficulties with the right spatiotemporal features we have been able to do that.
举个例子。 这儿有两个进攻队员和 两个防守队员, 他们准备开始实施挡拆。 那么持球人既可以选择利用挡拆, 也可以放弃挡拆, 他的队友可以拆向篮下, 或撤到一个无人盯防的空位。 防守持球者的人可以上前绕过掩护, 或者从后方绕过掩护。 而他的队友则可以探出补防,或保持 近距离防守,亦或者向后消极防守。 两个防守球员也可以换防,或者包夹。 一开始的时候我也不是很懂这些, 如果每个人都能沿着箭头方向移动, 事情就好办多了。 这会让我们的工作简单很多, 但往往这些移动非常杂乱。 球场上会发生很多突然的变动, 要在查准率和查全率方面 准确识别这些变化 是相当困难的, 但只有这样, 才能让专业教练相信你的技术。 尽管在准确的时空特性识别上 困难重重, 我们还是成功地做到了。
Coaches trust our ability of our machine to identify these variations. We're at the point where almost every single contender for an NBA championship this year is using our software, which is built on a machine that understands the moving dots of basketball. So not only that, we have given advice that has changed strategies that have helped teams win very important games, and it's very exciting because you have coaches who've been in the league for 30 years that are willing to take advice from a machine. And it's very exciting, it's much more than the pick-and-roll. Our computer started out with simple things and learned more and more complex things and now it knows so many things. Frankly, I don't understand much of what it does, and while it's not that special to be smarter than me, we were wondering, can a machine know more than a coach? Can it know more than person could know? And it turns out the answer is yes.
教练相信我们的机器 能够识别这些变化。 目前,我们已经推出了 相关的识别软件,几乎每个 觊觎今年NBA总冠军的球队, 都在使用我们的这款软件, 其功能就是通过机器 识别篮球领域的移动。 不仅如此, 我们还对如何改善战术提供建议, 并帮助球队赢得过重要的比赛。 能够让联盟中执教30年的 老教练愿意听取 机器提供的意见,这太让人激动了。 不仅仅局限于挡拆, 更让我们兴奋的是 我们让电脑从简单的事情着手, 逐渐学会了更复杂的事物, 如今它已经掌握了丰富的知识。 老实说,我不大明白它是怎么做到的, 不过就算比我聪明也没什么特别的, 但我们在想, 机器能否比教练懂得更多呢? 它能比人类懂得更多吗? 事实上,答案是肯定的。
The coaches want players to take good shots. So if I'm standing near the basket and there's nobody near me, it's a good shot. If I'm standing far away surrounded by defenders, that's generally a bad shot. But we never knew how good "good" was, or how bad "bad" was quantitatively. Until now.
教练想让球员投出好球。 所以如果我站在篮筐旁边, 周围没人,这就是好的投篮时机。 如果我站得远,而且被对方包围住, 通常来讲这球投不进。 但我们无法定量衡量这个“好”有多好, “差”有多差, 但现在不同了。
So what we can do, again, using spatiotemporal features, we looked at every shot. We can see: Where is the shot? What's the angle to the basket? Where are the defenders standing? What are their distances? What are their angles? For multiple defenders, we can look at how the player's moving and predict the shot type. We can look at all their velocities and we can build a model that predicts what is the likelihood that this shot would go in under these circumstances? So why is this important? We can take something that was shooting, which was one thing before, and turn it into two things: the quality of the shot and the quality of the shooter. So here's a bubble chart, because what's TED without a bubble chart?
同样,我们能做的就是利用时空特性 来分析每次投篮。 我们可以看到:在哪里投篮? 投篮的角度是多少? 防守方的站位? 他们间的距离, 以及角度如何? 防守球员不止一名的情况下, 我们能够通过观察球员的移动 来预测投篮类型。 我们可以根据他们的速度 建立一个模型, 预测在这些情况下,进球的可能性。 为什么这一点很重要? 因为我们可以通过分析投篮 这一单一行为得到 不同以往的两种信息: 投篮的质量,以及投手的质量。 我们可以看一下这个气泡图, 没有气泡图,还算什么TED呢?
(Laughter)
(笑声)
Those are NBA players. The size is the size of the player and the color is the position. On the x-axis, we have the shot probability. People on the left take difficult shots, on the right, they take easy shots. On the [y-axis] is their shooting ability. People who are good are at the top, bad at the bottom. So for example, if there was a player who generally made 47 percent of their shots, that's all you knew before. But today, I can tell you that player takes shots that an average NBA player would make 49 percent of the time, and they are two percent worse. And the reason that's important is that there are lots of 47s out there. And so it's really important to know if the 47 that you're considering giving 100 million dollars to is a good shooter who takes bad shots or a bad shooter who takes good shots. Machine understanding doesn't just change how we look at players, it changes how we look at the game.
这些气泡都是NBA球员。 大小代表球员的体型, 颜色代表他们的位置。 x轴代表投篮的命中率。 靠左的球员偏向勉强投篮, 靠右的球员会在有空当时才出手。 Y轴代表的是投篮质量。 好投手在上面,较差的在下面。 举个例子,有一个球员的 投篮命中率是47%, 以前你只能知道这么多。 但如今,我能告诉你NBA球员投篮的 平均命中率是49%, 他还低了两个百分点。 因为我们要在众多47%的 球员中选择一个。 那么重点就在于要搞清楚 让你支付了一大笔美金的人 到底是个经常勉强投篮的神投手, 还是一个愿意空位出手的差投手。 机器分析不只改变了 我们对球员的看法, 也改变了我们看待比赛的方式。
So there was this very exciting game a couple of years ago, in the NBA finals. Miami was down by three, there was 20 seconds left. They were about to lose the championship. A gentleman named LeBron James came up and he took a three to tie. He missed. His teammate Chris Bosh got a rebound, passed it to another teammate named Ray Allen. He sank a three. It went into overtime. They won the game. They won the championship. It was one of the most exciting games in basketball. And our ability to know the shot probability for every player at every second, and the likelihood of them getting a rebound at every second can illuminate this moment in a way that we never could before. Now unfortunately, I can't show you that video. But for you, we recreated that moment at our weekly basketball game about 3 weeks ago.
几年前有一场很激烈的NBA总决赛, 迈阿密落后三分,只剩20秒了。 他们将要失去总冠军了。 一位叫勒布朗詹姆斯的年轻人 上去想投个三分追平。 但他没投中。 他的队友克里斯波什拿到篮板, 传给另一个队友雷阿伦。 他投中了个三分,比赛进入加时。 最后他们赢了比赛,得了总冠军。 这是篮球比赛中 最激动人心的时刻之一。 而我们能知道每个球员在每一刻的 投篮命中率 以及抢到篮板的可能性, 这种能力是前所未有的。 有点可惜, 我无法给大家展示这个精彩片段。 但为了在座的各位,我们在三周前的 篮球周赛上重塑了那经典一刻。
(Laughter)
(笑声)
And we recreated the tracking that led to the insights. So, here is us. This is Chinatown in Los Angeles, a park we play at every week, and that's us recreating the Ray Allen moment and all the tracking that's associated with it. So, here's the shot. I'm going to show you that moment and all the insights of that moment. The only difference is, instead of the professional players, it's us, and instead of a professional announcer, it's me. So, bear with me.
我们也重新加入了 电脑追踪数据的演示。 这就是我和同事们, 在洛杉矶的唐人街, 我们每周都会去打球的公园, 我们在重塑雷阿伦时刻, 所有的轨迹都与之相关。 就是这个投篮。 你们会看到这一经典时刻, 以及这一刻背后都发生了什么。 唯一的不同就是 我们取代了专业球员, 而我取代了专业讲解员。 大家请见谅。
Miami. Down three. Twenty seconds left. Jeff brings up the ball. Josh catches, puts up a three!
迈阿密。 落后三分。 还有20秒。 杰夫带球。 约什接球,三分出手!
[Calculating shot probability]
[计算命中率]
[Shot quality]
[投篮质量]
[Rebound probability]
[篮板球概率]
Won't go!
进不了!
[Rebound probability]
[篮板球概率]
Rebound, Noel. Back to Daria.
诺尔的篮板。 传回给达丽亚。
[Shot quality]
[投篮质量]
Her three-pointer -- bang! Tie game with five seconds left. The crowd goes wild.
球进了——三分! 打平了,还剩5秒。 观众们沸腾了!
(Laughter)
(笑声)
That's roughly how it happened.
真实情况大概就是这样。
(Applause)
(掌声)
Roughly.
差不多。
(Applause) That moment had about a nine percent chance of happening in the NBA and we know that and a great many other things. I'm not going to tell you how many times it took us to make that happen.
(掌声) 在NBA有9%的概率 会发生这样的时刻, 我们知道的还有很多。 我是不会告诉你们 我们尝试了多少次才成功的。
(Laughter)
(笑声)
Okay, I will! It was four.
好吧,我还是说吧,四次。
(Laughter)
(笑声)
Way to go, Daria.
达丽亚,三分球还得努力啊。
But the important thing about that video and the insights we have for every second of every NBA game -- it's not that. It's the fact you don't have to be a professional team to track movement. You do not have to be a professional player to get insights about movement.
但那段视频以及我们对 每场NBA比赛的细微观察 并不是重点。 事实上,你无需组建 一个专业团队才能追踪移动。 你也无需成为专业运动员 去理解那些移动。
In fact, it doesn't even have to be about sports because we're moving everywhere. We're moving in our homes, in our offices, as we shop and we travel throughout our cities and around our world. What will we know? What will we learn? Perhaps, instead of identifying pick-and-rolls, a machine can identify the moment and let me know when my daughter takes her first steps. Which could literally be happening any second now.
而且,这不仅限于运动, 因为我们无时不刻不在移动。 我们在家里, 在办公室里来回走动, 我们也在世界各地 各个城市 购物旅行。 我们能发现什么? 我们能学到什么? 或许,除了识别挡拆, 机器还能识别某些时刻, 让我知道我女儿何时 迈出她的第一步。 她现在随时都有可能学会走路。
Perhaps we can learn to better use our buildings, better plan our cities. I believe that with the development of the science of moving dots, we will move better, we will move smarter, we will move forward.
或许我们能合理地利用我们的建筑物, 更加好地规划我们的城市。 我相信随着移动圆点这一科学的发展, 我们能更好地移动, 更智能地移动,一路向前。
Thank you very much.
谢谢大家。
(Applause)
(掌声)