Peter Donnelly: How juries are fooled by statistics

As other speakers have said, it's a rather daunting experience -- a particularly daunting experience -- to be speaking in front of this audience. But unlike the other speakers, I'm not going to tell you about the mysteries of the universe, or the wonders of evolution, or the really clever, innovative ways people are attacking the major inequalities in our world. Or even the challenges of nation-states in the modern global economy. My brief, as you've just heard, is to tell you about statistics -- and, to be more precise, to tell you some exciting things about statistics. And that's -- (Laughter) -- that's rather more challenging than all the speakers before me and all the ones coming after me. (Laughter) One of my senior colleagues told me, when I was a youngster in this profession, rather proudly, that statisticians were people who liked figures but didn't have the personality skills to become accountants. (Laughter) And there's another in-joke among statisticians, and that's, "How do you tell the introverted statistician from the extroverted statistician?" To which the answer is, "The extroverted statistician's the one who looks at the other person's shoes." (Laughter) But I want to tell you something useful -- and here it is, so concentrate now. This evening, there's a reception in the University's Museum of Natural History. And it's a wonderful setting, as I hope you'll find, and a great icon to the best of the Victorian tradition. It's very unlikely -- in this special setting, and this collection of people -- but you might just find yourself talking to someone you'd rather wish that you weren't. So here's what you do. When they say to you, "What do you do?" -- you say, "I'm a statistician." (Laughter) Well, except they've been pre-warned now, and they'll know you're making it up. And then one of two things will happen. They'll either discover their long-lost cousin in the other corner of the room and run over and talk to them. Or they'll suddenly become parched and/or hungry -- and often both -- and sprint off for a drink and some food. And you'll be left in peace to talk to the person you really want to talk to.

他の講演者の方も話されましたがこの観客の前で話すのは手ごわい経験ですしかし私がお話しするのは他の方たちの様に宇宙の謎や進化の神秘世界の不平等を解消する革新的方法やまた現代のグローバル経済における― 国家的課題などではありません私は統計学について－より正確に言うと統計学のワクワクするような話をしますそれは — （笑）他の講演者たちよりも随分と努力が必要です（笑）私が若輩者だった頃先輩が誇らしげに教えてくれたのは統計学者は数字が得意だけれども会計士になれるほど人格者ではないことです（笑）もう１つ統計学者が内輪で言っているジョークが「性格が内向的な統計学者と外向的な統計学者を見分けるには？」答えは「外向的統計学者は相手の身なりをよく見ている」です（笑）今から役立つことを伝えたいのでよく聞いてください今夜大学の自然史博物館でパーティーがありますお気に召せばいいですが ― 素晴らしい場所で伝統あるビクトリア時代の象徴ですそんな会場にこんな方々と一緒にいてもこの人とは話したくないという相手もいるかもしれませんこうすれば良いのです「ご職業は？」と聞かれたら「統計学者です」と答えてください（笑）ここでネタをばらしてしまったので今回は見え見えになってしまいましたが普通次のどちらかのことが起きます久しぶりのイトコがあそこにいるので話してきますといって去るか突然のどの渇きや空腹が襲ってきて飲み物や食べ物を急いで取りに行くのですあなたは落ち着いて本当に話したい人の元へと向かえます

It's one of the challenges in our profession to try and explain what we do. We're not top on people's lists for dinner party guests and conversations and so on. And it's something I've never really found a good way of doing. But my wife -- who was then my girlfriend -- managed it much better than I've ever been able to. Many years ago, when we first started going out, she was working for the BBC in Britain, and I was, at that stage, working in America. I was coming back to visit her. She told this to one of her colleagues, who said, "Well, what does your boyfriend do?" Sarah thought quite hard about the things I'd explained -- and she concentrated, in those days, on listening. (Laughter) Don't tell her I said that. And she was thinking about the work I did developing mathematical models for understanding evolution and modern genetics. So when her colleague said, "What does he do?" She paused and said, "He models things." (Laughter) Well, her colleague suddenly got much more interested than I had any right to expect and went on and said, "What does he model?" Well, Sarah thought a little bit more about my work and said, "Genes." (Laughter) "He models genes."

統計学者が何をするか説明するのは努力を要することの１つです統計学者はパーティーや会談の主賓としては招待されません良い説明方法はいまだに見つかっていません私の妻はまだガールフレンドだった頃に私よりも上手くその質問を切り抜けたことがありました付き合い始めた時彼女はイギリスで BBCに勤めていました私はその頃アメリカで働いていました私が彼女を訪ねに来たときのことです「彼氏の職業は？」と聞かれたとき彼女は同僚にこう答えたのですサラは私の説明を一生懸命思いだそうとしました当時の彼女は私の言うことをちゃんと聞いていましたから（笑）これは内緒にしておいてください私の仕事は進化と現代的遺伝学を理解するために数学的モデルを発展させることだと彼女は考えていましたですから彼女は同僚から「彼氏の職業は？」と聞かれた時間をおいてこう言いました「彼はモデルをするの」（笑）予期しなかったのですがその同僚は突然興味津々になり続けてこう言ったのです「なんのモデルをしているの？」でサラはちょっと考えて答えました「ジーンズ（遺伝子）よ」（笑）「彼はジーンズのモデルをするの」

That is my first love, and that's what I'll tell you a little bit about. What I want to do more generally is to get you thinking about the place of uncertainty and randomness and chance in our world, and how we react to that, and how well we do or don't think about it. So you've had a pretty easy time up till now -- a few laughs, and all that kind of thing -- in the talks to date. You've got to think, and I'm going to ask you some questions. So here's the scene for the first question I'm going to ask you. Can you imagine tossing a coin successively? And for some reason -- which shall remain rather vague -- we're interested in a particular pattern. Here's one -- a head, followed by a tail, followed by a tail.

これで本当に彼女が好きになりましたね統計学者の仕事の話を続けましょうより一般的な事例を挙げて皆さんに世の中の不確定で不規則で偶然な出来事を考えてもらいそれにどう反応するか適切に考えることができるかを検討してほしいのですなのでここで今までのデートの話で笑うような気楽な時間は終了です皆さんにはいくつか問題を出したいと思います第一問こういう状況です繰り返しコインを投げますある理由があって― それには特に触れませんが私たちはある特徴的パターンに興味を持ちますこれがそのパターンですコインの表が出て次に裏・裏

So suppose we toss a coin repeatedly. Then the pattern, head-tail-tail, that we've suddenly become fixated with happens here. And you can count: one, two, three, four, five, six, seven, eight, nine, 10 -- it happens after the 10th toss. So you might think there are more interesting things to do, but humor me for the moment. Imagine this half of the audience each get out coins, and they toss them until they first see the pattern head-tail-tail. The first time they do it, maybe it happens after the 10th toss, as here. The second time, maybe it's after the fourth toss. The next time, after the 15th toss. So you do that lots and lots of times, and you average those numbers. That's what I want this side to think about.

コインを何度も繰り返し投げることにしますすると注目している表・裏・裏のパターンがここで起こります数えられますね 1 2 3 4 5 6 7 8 9 10 10回目のコイントスの後の結果です他にも興味深いことがと思うかもしれませんでもちょっとお待ちください観客を半分に分けて表・裏・裏のパターンが出るまで各々がコインを投げると思ってください１度目にはこのように10回投げた後の結果はこうなり２度目は４回のトスで起こるかもしれませんその次は 15回目のトスの後でそれを何度も何度も行って平均回数を出してくださいそれがこちら側の半分の人にやってもらいたいことです

The other half of the audience doesn't like head-tail-tail -- they think, for deep cultural reasons, that's boring -- and they're much more interested in a different pattern -- head-tail-head. So, on this side, you get out your coins, and you toss and toss and toss. And you count the number of times until the pattern head-tail-head appears and you average them. OK? So on this side, you've got a number -- you've done it lots of times, so you get it accurately -- which is the average number of tosses until head-tail-tail. On this side, you've got a number -- the average number of tosses until head-tail-head.

もう半数の観客は表・裏・裏が好きじゃありません彼らは深遠な文化的理由からそんなのつまらないと思い他のパターンの方に興味を持ちました表・裏・表ですこちらの方たちはコインを取り出してトスを何回も繰り返して表・裏・表が出るまで投げた回数を数えてその平均を出してくださいいいですね？こちらの方たちはコイントスを繰り返して表・裏・裏が出るまでの平均回数を正確に導きだしてくださいこちらの皆さんは同様に表・裏・表の平均回数を出して下さい

So here's a deep mathematical fact -- if you've got two numbers, one of three things must be true. Either they're the same, or this one's bigger than this one, or this one's bigger than that one. So what's going on here? So you've all got to think about this, and you've all got to vote -- and we're not moving on. And I don't want to end up in the two-minute silence to give you more time to think about it, until everyone's expressed a view. OK. So what you want to do is compare the average number of tosses until we first see head-tail-head with the average number of tosses until we first see head-tail-tail.

数学的事実は以下の通りです２つの平均回数が導き出せたら次の３つの内１つが真実のはずです２つとも同じ数かこちら側の数が多いか反対側の数が多いかさてどうなるでしょう？皆さんがこの問題を理解して投票して欲しいと思いますそれまで次へは進みません２分間静かに考えて全員が答えを出して下さいねもっと時間が必要だという状況にはしたくありませんでは最初に表・裏・表が出たコイントス回数と表・裏・裏が出た回数を比べましょう

Who thinks that A is true -- that, on average, it'll take longer to see head-tail-head than head-tail-tail? Who thinks that B is true -- that on average, they're the same? Who thinks that C is true -- that, on average, it'll take less time to see head-tail-head than head-tail-tail? OK, who hasn't voted yet? Because that's really naughty -- I said you had to. (Laughter) OK. So most people think B is true. And you might be relieved to know even rather distinguished mathematicians think that. It's not. A is true here. It takes longer, on average. In fact, the average number of tosses till head-tail-head is 10 and the average number of tosses until head-tail-tail is eight. How could that be? Anything different about the two patterns? There is. Head-tail-head overlaps itself. If you went head-tail-head-tail-head, you can cunningly get two occurrences of the pattern in only five tosses. You can't do that with head-tail-tail. That turns out to be important.

Aが真実だと思う人はいますか？「平均で表・裏・表の方が表・裏・裏より回数が多い」です Bが真実だと思う人は？「平均回数は同じ」 Cが真実だと思う人は？「平均で表・裏・表の方が表・裏・裏より回数が少ない」まだ投票してない人はいますか？それはだめですよ（笑）ほとんどの人が Bを真実だと思っていますので超優秀な数学者もそう考えると知れば少しは安心ですよねところが Aが真実なのですこちらの平均回数の方が多いのです実は表・裏・表が出るまで平均回数は10回で表・裏・裏の平均は８回ですどうしてこうなったのでしょう？２つのパターンに違いはあるのか？表・裏・表はそれ自身に重なっているのです表・裏・表・裏・表と出たらたった５回のトスでそのパターンが２回発生しています表・裏・裏ではそんなことは起こりませんそれが肝です

There are two ways of thinking about this. I'll give you one of them. So imagine -- let's suppose we're doing it. On this side -- remember, you're excited about head-tail-tail; you're excited about head-tail-head. We start tossing a coin, and we get a head -- and you start sitting on the edge of your seat because something great and wonderful, or awesome, might be about to happen. The next toss is a tail -- you get really excited. The champagne's on ice just next to you; you've got the glasses chilled to celebrate. You're waiting with bated breath for the final toss. And if it comes down a head, that's great. You're done, and you celebrate. If it's a tail -- well, rather disappointedly, you put the glasses away and put the champagne back. And you keep tossing, to wait for the next head, to get excited.

そこには２つの考え方がありますその１つを説明しましょう先ほどやったことを思い出してくださいこちら側の皆さんは表・裏・裏を期待していました反対側は表・裏・表を期待していましたコインを投げたら表が出ました皆さんは椅子に座り直します何か凄くて素晴らしくてステキなことが起こりそうだからです次のトスは裏です嬉しいですね氷の上のシャンパンがそばにありますお祝いの冷えたシャンパングラスがあります息をのんで最後のトスを待ちます次に表が出たら素晴らしい！やった！お祝いだ！裏だったら少々ガッカリしてシャンパングラスを退けシャンパンを返却しますそして次の表が出るまで興奮するためのコイントスを続けます

On this side, there's a different experience. It's the same for the first two parts of the sequence. You're a little bit excited with the first head -- you get rather more excited with the next tail. Then you toss the coin. If it's a tail, you crack open the champagne. If it's a head you're disappointed, but you're still a third of the way to your pattern again. And that's an informal way of presenting it -- that's why there's a difference. Another way of thinking about it -- if we tossed a coin eight million times, then we'd expect a million head-tail-heads and a million head-tail-tails -- but the head-tail-heads could occur in clumps. So if you want to put a million things down amongst eight million positions and you can have some of them overlapping, the clumps will be further apart. It's another way of getting the intuition.

こちらは違う経験です最初の２つの結果は同じです最初に表が出た時は少し興奮します次に裏が出たらもっと興奮しますそしてコインを投げます裏が出たらシャンパンを開けますもし表が出たらガッカリですそれでもパターンの３分の１は達成しているのですくだけた感じの説明でしたが２つのパターンが違うはこのためですもう１つの考え方はもし 800万回コイントスをして表・裏・表も表・裏・裏も 100万回出ると予測しますが表・裏・表は塊で出ることが可能です 800万ヶ所に 100万個のものを置きたいならそのいくつかは重なることもできますすると塊はもっと離れることになりますこれが直感的に理解するもう１つの方法なのです

What's the point I want to make? It's a very, very simple example, an easily stated question in probability, which every -- you're in good company -- everybody gets wrong. This is my little diversion into my real passion, which is genetics. There's a connection between head-tail-heads and head-tail-tails in genetics, and it's the following. When you toss a coin, you get a sequence of heads and tails. When you look at DNA, there's a sequence of not two things -- heads and tails -- but four letters -- As, Gs, Cs and Ts. And there are little chemical scissors, called restriction enzymes which cut DNA whenever they see particular patterns. And they're an enormously useful tool in modern molecular biology. And instead of asking the question, "How long until I see a head-tail-head?" -- you can ask, "How big will the chunks be when I use a restriction enzyme which cuts whenever it sees G-A-A-G, for example? How long will those chunks be?"

お伝えしたいポイントはこの問題が確率におけるとても単純で簡潔な例題でありここにいる皆さんまでもが間違いを犯すものだということです私が本当に興味を持っている遺伝学にも同じようなことがあります遺伝学でも表・裏・表と表・裏・裏に関連があります以下の通りですコインを投げると表・裏の順番が発生します DNAを観察すると順番がありますがそれは表・裏の２つではなく A G C Tの４文字からなるものですそしてそこには「制限酵素」と呼ばれる小さい化学的ハサミがありますこのハサミはあるパターンに遭遇するとそこでDNAを切ります現代分子生物学でこのハサミは非常に便利な道具ですそして「表・裏・表が出るまでの長さは？」という質問ではなく「G-A-A-Gパターンが出た時に制御酵素で切るとしてその塊の長さは？」と― 質問できるわけです

That's a rather trivial connection between probability and genetics. There's a much deeper connection, which I don't have time to go into and that is that modern genetics is a really exciting area of science. And we'll hear some talks later in the conference specifically about that. But it turns out that unlocking the secrets in the information generated by modern experimental technologies, a key part of that has to do with fairly sophisticated -- you'll be relieved to know that I do something useful in my day job, rather more sophisticated than the head-tail-head story -- but quite sophisticated computer modelings and mathematical modelings and modern statistical techniques. And I will give you two little snippets -- two examples -- of projects we're involved in in my group in Oxford, both of which I think are rather exciting. You know about the Human Genome Project. That was a project which aimed to read one copy of the human genome. The natural thing to do after you've done that -- and that's what this project, the International HapMap Project, which is a collaboration between labs in five or six different countries. Think of the Human Genome Project as learning what we've got in common, and the HapMap Project is trying to understand where there are differences between different people.

これは確率と遺伝学の間の些細な問題ですが説明する時間が無いのですがそこにはもっと深い関連がありますだから現代遺伝学は本当にワクワクする科学分野なのですこの後にも同じことについての TEDトークがありますよ現代の実験技術から生まれた情報で解明した結果の重要部分はかなり洗練されています皆さんご安心ください私の日常の仕事は表裏よりももっと高等で有益なことですとても複雑なコンピューターモデリングと数学的モデリングと統計学的モデリングをしていますでは皆さんにオックスフォード大学の私の研究チームが参加している２つのプロジェクトを少しご説明します２つともとても面白いですよヒトゲノム計画はご存知でしょうそれは一人分のゲノム全体を読み解くことを目的としていましたそれが完了したので次は国際HapMap計画ですこれは５～６カ国の研究室が共同で行っていますヒトゲノム計画では人類共通の遺伝情報について解析しましたが HapMap計画は民族集団の間にある違いを解明しようとしています

Why do we care about that? Well, there are lots of reasons. The most pressing one is that we want to understand how some differences make some people susceptible to one disease -- type-2 diabetes, for example -- and other differences make people more susceptible to heart disease, or stroke, or autism and so on. That's one big project. There's a second big project, recently funded by the Wellcome Trust in this country, involving very large studies -- thousands of individuals, with each of eight different diseases, common diseases like type-1 and type-2 diabetes, and coronary heart disease, bipolar disease and so on -- to try and understand the genetics. To try and understand what it is about genetic differences that causes the diseases. Why do we want to do that? Because we understand very little about most human diseases. We don't know what causes them. And if we can get in at the bottom and understand the genetics, we'll have a window on the way the disease works, and a whole new way about thinking about disease therapies and preventative treatment and so on. So that's, as I said, the little diversion on my main love.

何故それが必要なのでしょうか？その理由は沢山あります最も緊急な課題はどの遺伝子の違いが２型糖尿病や心臓病脳卒中自閉症などの疾患を発症しやすくさせるかということを解明することですこれが１つの大きなプロジェクトです２番目の大きなプロジェクトは最近ウェルカム・トラスト（研究者支援団体）から研究費提供を受けています１型および２型糖尿病冠動脈性心疾患双極性障害など頻度の高い８つの疾患のそれぞれの患者が何千人も協力してその遺伝子を解析するという大がかりなものですその疾患を引き起こす遺伝子の違いを解析するのですなぜそんなことをしたいのか？なぜならヒトの疾患についてほとんど解明されていないからです疾患の原因を知らないのですもしも人類が遺伝学についてその基本を理解したなら病気の仕組みが理解できて治療や予防的措置などについての考え方が一新するでしょう前にも言ったようにこれが私の情熱の一端です

Back to some of the more mundane issues of thinking about uncertainty. Here's another quiz for you -- now suppose we've got a test for a disease which isn't infallible, but it's pretty good. It gets it right 99 percent of the time. And I take one of you, or I take someone off the street, and I test them for the disease in question. Let's suppose there's a test for HIV -- the virus that causes AIDS -- and the test says the person has the disease. What's the chance that they do? The test gets it right 99 percent of the time. So a natural answer is 99 percent. Who likes that answer? Come on -- everyone's got to get involved. Don't think you don't trust me anymore. (Laughter) Well, you're right to be a bit skeptical, because that's not the answer. That's what you might think. It's not the answer, and it's not because it's only part of the story. It actually depends on how common or how rare the disease is. So let me try and illustrate that. Here's a little caricature of a million individuals. So let's think about a disease that affects -- it's pretty rare, it affects one person in 10,000. Amongst these million individuals, most of them are healthy and some of them will have the disease. And in fact, if this is the prevalence of the disease, about 100 will have the disease and the rest won't. So now suppose we test them all. What happens? Well, amongst the 100 who do have the disease, the test will get it right 99 percent of the time, and 99 will test positive. Amongst all these other people who don't have the disease, the test will get it right 99 percent of the time. It'll only get it wrong one percent of the time. But there are so many of them that there'll be an enormous number of false positives. Put that another way -- of all of them who test positive -- so here they are, the individuals involved -- less than one in 100 actually have the disease. So even though we think the test is accurate, the important part of the story is there's another bit of information we need.

もっとありふれた「不確かさ」について考える問題に戻りましょう皆さんにもう１つクイズがありますあなたはある病気に対して完全ではないがかなり良い検査を受けましたその検査は99%正確です私は皆さんの内の１人もしくは通行人から数人を選んでその検査をしたとします例えばHIV（エイズウィルス）の検査だとしましょうそして検査結果は陽性（感染あり）だったとします彼らが本当にHIVに罹っている可能性は？ 99%正確なテストですよ 99%と答えるのが当たり前ですねそうだと思う人は？皆さん参加して下さいよ！誰一人として私を信用していないとは思いませんが（笑）皆さんは「少し疑った方が良いかも　答えは違うのです」そう思っているかも知れません答えは違います何故なら話はまだ一部だからです実は罹患率の高さでこの答えは変わってきます詳しく説明しましょうここに100万人を表した図があります１万人に１人しか罹らないとても罹患率の低い病気を考えましょう 100万人のうちほとんどは健康でわずかの人数がその患者です先ほどの罹患率で言えば 100人だけが病気ですでは全員を検査するとしてどうなるでしょう？病気にかかっている100人の内で 99％正確な検査なので 99人の検査が陽性となります残りの病気じゃない人たちにも 99％正確な検査ですので 1%に間違った結果が出ます結果多くの数の人たちが偽陽性になってしまうのですこうも考えられます― 陽性の結果が出た全員の内で ―こちらの人たちです― 実際の患者は 100分の1よりも低い確率ですですから正確だと思える検査でもそのほとんどの場合でもっと情報が必要なのです

Here's the key intuition. What we have to do, once we know the test is positive, is to weigh up the plausibility, or the likelihood, of two competing explanations. Each of those explanations has a likely bit and an unlikely bit. One explanation is that the person doesn't have the disease -- that's overwhelmingly likely, if you pick someone at random -- but the test gets it wrong, which is unlikely. The other explanation is that the person does have the disease -- that's unlikely -- but the test gets it right, which is likely. And the number we end up with -- that number which is a little bit less than one in 100 -- is to do with how likely one of those explanations is relative to the other. Each of them taken together is unlikely.

これがキーなのです検査で陽性と出た時にやらなければないけないことはその妥当性やもっともらしさ（尤度）を対立する２つの仮説から評価することですその仮説にはそれぞれ少しずつ成立する時としない時がありますランダムに１人を選んだ場合一方の仮説ではその人が病気でない尤度は非常に高いが検査結果が間違い（偽陽性）である尤度は低いもう一方の仮説はその人が病気である尤度は低いが検査結果が正しい（真陽性）尤度は高いというものです最終的に統計学者が出すのはその可能性が100分の1より低いかどうかつまりどちらの仮説が他方より高い尤度をもつかということですいずれの仮説も総合的には尤度が低いのです

Here's a more topical example of exactly the same thing. Those of you in Britain will know about what's become rather a celebrated case of a woman called Sally Clark, who had two babies who died suddenly. And initially, it was thought that they died of what's known informally as "cot death," and more formally as "Sudden Infant Death Syndrome." For various reasons, she was later charged with murder. And at the trial, her trial, a very distinguished pediatrician gave evidence that the chance of two cot deaths, innocent deaths, in a family like hers -- which was professional and non-smoking -- was one in 73 million. To cut a long story short, she was convicted at the time. Later, and fairly recently, acquitted on appeal -- in fact, on the second appeal. And just to set it in context, you can imagine how awful it is for someone to have lost one child, and then two, if they're innocent, to be convicted of murdering them. To be put through the stress of the trial, convicted of murdering them -- and to spend time in a women's prison, where all the other prisoners think you killed your children -- is a really awful thing to happen to someone. And it happened in large part here because the expert got the statistics horribly wrong, in two different ways.

もっと話題になるような例を出してみましょうイギリス人ならサリー・クラークの有名な事例をご存知でしょう彼女には赤ん坊が２人いましたが突然亡くなってしまいました当初その２人は「コット・デス」つまり新生児突然死症候群で亡くなったと考えられていましたしかしいろいろあってサリーは殺人者にさせられたのです裁判ではとても著名な小児科医がこう証言しました「サリーの様に専門的職業を持ちかつ非喫煙者の家庭にコット・デスが非犯罪的に２回も起こる確率は 7,300万分の１である」端折りますがサリーは有罪判決を受けましたその後つい最近になって控訴審で無罪になりましたその人の身になって考えてみて下さい我が子を２人もたて続けに亡くした人が２人を殺したとして有罪になるこの事件が犯罪でなかったとしたらどれだけひどいことでしょう裁判を通しての精神的重圧や殺人と判決されること女性刑務所で過ごす間他の犯罪者に子どもを殺したと思われることは当事者にとって本当に悲劇と言いようがありませんそんなことが実際に起こったのです何故ならその専門家は２つの方法で統計を間違って解釈したのです

So where did he get the one in 73 million number? He looked at some research, which said the chance of one cot death in a family like Sally Clark's is about one in 8,500. So he said, "I'll assume that if you have one cot death in a family, the chance of a second child dying from cot death aren't changed." So that's what statisticians would call an assumption of independence. It's like saying, "If you toss a coin and get a head the first time, that won't affect the chance of getting a head the second time." So if you toss a coin twice, the chance of getting a head twice are a half -- that's the chance the first time -- times a half -- the chance a second time. So he said, "Here, I'll assume that these events are independent. When you multiply 8,500 together twice, you get about 73 million." And none of this was stated to the court as an assumption or presented to the jury that way. Unfortunately here -- and, really, regrettably -- first of all, in a situation like this you'd have to verify it empirically. And secondly, it's palpably false. There are lots and lots of things that we don't know about sudden infant deaths. It might well be that there are environmental factors that we're not aware of, and it's pretty likely to be the case that there are genetic factors we're not aware of. So if a family suffers from one cot death, you'd put them in a high-risk group. They've probably got these environmental risk factors and/or genetic risk factors we don't know about. And to argue, then, that the chance of a second death is as if you didn't know that information is really silly. It's worse than silly -- it's really bad science. Nonetheless, that's how it was presented, and at trial nobody even argued it. That's the first problem. The second problem is, what does the number of one in 73 million mean? So after Sally Clark was convicted -- you can imagine, it made rather a splash in the press -- one of the journalists from one of Britain's more reputable newspapers wrote that what the expert had said was, "The chance that she was innocent was one in 73 million." Now, that's a logical error. It's exactly the same logical error as the logical error of thinking that after the disease test, which is 99 percent accurate, the chance of having the disease is 99 percent. In the disease example, we had to bear in mind two things, one of which was the possibility that the test got it right or not. And the other one was the chance, a priori, that the person had the disease or not. It's exactly the same in this context. There are two things involved -- two parts to the explanation. We want to know how likely, or relatively how likely, two different explanations are. One of them is that Sally Clark was innocent -- which is, a priori, overwhelmingly likely -- most mothers don't kill their children. And the second part of the explanation is that she suffered an incredibly unlikely event. Not as unlikely as one in 73 million, but nonetheless rather unlikely. The other explanation is that she was guilty. Now, we probably think a priori that's unlikely. And we certainly should think in the context of a criminal trial that that's unlikely, because of the presumption of innocence. And then if she were trying to kill the children, she succeeded. So the chance that she's innocent isn't one in 73 million. We don't know what it is. It has to do with weighing up the strength of the other evidence against her and the statistical evidence. We know the children died. What matters is how likely or unlikely, relative to each other, the two explanations are. And they're both implausible. There's a situation where errors in statistics had really profound and really unfortunate consequences. In fact, there are two other women who were convicted on the basis of the evidence of this pediatrician, who have subsequently been released on appeal. Many cases were reviewed. And it's particularly topical because he's currently facing a disrepute charge at Britain's General Medical Council.

その小児科医は7,300万分の1という数字をどこから出したのでしょう？彼が読んだいくつかの研究にはサリーと似たような家庭内で起こるコット・デスは約8,500分の1とあったのですですから彼はこう言いました「家庭内のコット・デスが一度起きた場合と２度目のコット・デスが起こる確率は変わらないと仮定する」統計学者はこれを「事象が独立である」と言い「コイントスをして最初に表が出ても２回目も表が出る確率に影響しない」と言うことですつまりコインを２回トスして２回とも表になる可能性は 1回目の確率の50%で 0.5 × 0.5になるのですだから彼はこう言いました「２つの出来事は独立していると仮定する 8,500を二乗すれば 7,300万になる」それが仮定だとは裁判で語られませんでしたし陪審員にもそのように伝えていませんでしたとても残念ですまず最初にこの状況ではその仮定が経験的に妥当か確かめるべきでした第二にそれは明白な誤りです新生児の突然死には解明されていないことが山ほどありますまだ発見されていない環境因子があるかもしれませんしまだ発見されていない遺伝学的因子により引き起こされた可能性も高いのですですからコット・デスが起こった家族はハイリスク群に属するかも知れませんそこにはまだ知られていない環境的危険因子があったりその上遺伝学的危険因子があるかもしれないのですこういう情報を知らないかのように２番目の死亡の確率を語るのは本当に愚かなことです愚かであるよりも実に悪質な科学ですそれなのにあんなことが裁判で示され誰もそのことを議論しなかったそれが最初の問題です 2番目の問題は7,300万分の１という数字の意味するところですサリー・クラークが有罪になった後それが報道で波紋を呼んだというのは想像に難くありませんイギリスで影響力のある新聞社の記者はこう書きました「専門家が言うことには― 『この女が無罪である確率は 7,300万分の１』とのこと」そうこれは論理的エラーですこの論理的エラーは先ほどの99%確実な検査なら病気に罹っている確率も99%だという論理的エラーと全く同じものですその例から覚えておくべきことは２つです１つはその検査が正しいか正しくないかの可能性もう１つはその人が病気にかかっている可能性の推測この状況では全く同じことですそこにも２段階の説明が必要です２つの異なった事象の尤度がどれほどかまた関連して起こる尤度はどうでしょう？１つ目の事象はサリーが無罪であることそれは常識的に圧倒的に高尤度ですほとんどの母親は我が子を殺しません２つ目の事象は彼女がこの非常に低尤度な出来事に遭遇したこと 7,300万分の１の数字程ではありませんがいずれにしても起きにくいことです対する事象はサリーが有罪ということです今ならそれが普通に考えて低尤度だと思うでしょう刑事裁判として尤度が低いと考えるべきですなぜなら推定無罪の原則があるからですもしも彼女が我が子を殺そうとしたのなら成功しましたサリーが無罪である可能性は 7300万分の１ではありませんそれがどういう数字になるのか分りませんサリーを有罪とする根拠の確からしさとその統計学的根拠で決まります分かっているのは子どもたちが死んだことです争点は２人の死―２つの事象―にはどれほど関連がありうるかということですこの２つは両方ともありえないことですそこには本当に理解しがたく悲劇的結果を生んだ統計学に関してのエラーでしたこの小児科医の論拠が採用されて他にも２人の女性が有罪にされましたが裁判によって結果的に釈放されています多くの事件が再調査されましたこの小児科医は現在イギリス医学会議で査問にかけられていることが話題になっています

So just to conclude -- what are the take-home messages from this? Well, we know that randomness and uncertainty and chance are very much a part of our everyday life. It's also true -- and, although, you, as a collective, are very special in many ways, you're completely typical in not getting the examples I gave right. It's very well documented that people get things wrong. They make errors of logic in reasoning with uncertainty. We can cope with the subtleties of language brilliantly -- and there are interesting evolutionary questions about how we got here. We are not good at reasoning with uncertainty. That's an issue in our everyday lives. As you've heard from many of the talks, statistics underpins an enormous amount of research in science -- in social science, in medicine and indeed, quite a lot of industry. All of quality control, which has had a major impact on industrial processing, is underpinned by statistics. It's something we're bad at doing. At the very least, we should recognize that, and we tend not to. To go back to the legal context, at the Sally Clark trial all of the lawyers just accepted what the expert said. So if a pediatrician had come out and said to a jury, "I know how to build bridges. I've built one down the road. Please drive your car home over it," they would have said, "Well, pediatricians don't know how to build bridges. That's what engineers do." On the other hand, he came out and effectively said, or implied, "I know how to reason with uncertainty. I know how to do statistics." And everyone said, "Well, that's fine. He's an expert." So we need to understand where our competence is and isn't. Exactly the same kinds of issues arose in the early days of DNA profiling, when scientists, and lawyers and in some cases judges, routinely misrepresented evidence. Usually -- one hopes -- innocently, but misrepresented evidence. Forensic scientists said, "The chance that this guy's innocent is one in three million." Even if you believe the number, just like the 73 million to one, that's not what it meant. And there have been celebrated appeal cases in Britain and elsewhere because of that.

ではまとめますこのことから何を学びましたか？そうです不規則や不確定偶然は日常的によくあることだということですまた皆さんは多くの場合で集団としてとても特別なのです皆さんがこれらの例を理解できないのは当たり前のことです人々が物事を間違って解釈することは実証済みです人は不確実な理由付けで論理的エラーを犯します言語の微妙さへの対処は得意なのですがそこにどのようにして到達したかについては興味深い進化的問題があります私たちは不確かさについて論証することが苦手なので日々の生活での難問となります多くのTEDトークでわかるように統計学は広範な科学研究を裏付けますその分野は社会科学や医学だけでなく多くの産業にも渡ります生産過程に大きな影響を与えてきた品質管理は全て統計に裏付けられていますそれを理解するのは私たちが不得意とするところです私たちはそれを無視しがちですが最低限認識はすべきですサリー・クラークの裁判に立ち返ると全ての法律家が専門家の言いなりになったのですですからある小児科医が陪審員に「私は橋の建設方法を知っていますこの先に橋を作りましたからその橋を通って帰宅してください」と言ったらきっとこう返事するでしょう「小児科医が橋の建設だって？それはエンジニアがすることだ」それなのに彼のこんな発言は説得力を発揮しました「不確かさの扱いかたを知っています私は統計を理解しているのですから」すると皆はこう言ったのです「結構ですね彼は専門家ですから」ですから私たちは自分はなにが得意かを理解する必要があります全く同じ様な問題がDNA鑑定の初期に発生しました科学者も法律家も時には裁判官たちまでも何度も証拠を間違えて提示したのですたいてい悪意はなく ―そう願います誤った証拠を提示したのです犯罪学者がこう言いました「無罪である確率は300万分の1だ」 7300万分の１の数字同様その数字自体を信じたとしてもそういう意味ではないのですおかげでイギリスやほかの国でもよく知られた控訴例が続出しています

And just to finish in the context of the legal system. It's all very well to say, "Let's do our best to present the evidence." But more and more, in cases of DNA profiling -- this is another one -- we expect juries, who are ordinary people -- and it's documented they're very bad at this -- we expect juries to be able to cope with the sorts of reasoning that goes on. In other spheres of life, if people argued -- well, except possibly for politics -- but in other spheres of life, if people argued illogically, we'd say that's not a good thing. We sort of expect it of politicians and don't hope for much more. In the case of uncertainty, we get it wrong all the time -- and at the very least, we should be aware of that, and ideally, we might try and do something about it. Thanks very much.

法律制度の話として締めくくりますと「証拠を提出するのに最善を尽くしましょう」とはよく言われますが DNA鑑定のような場合何度も同じようなことが起こります陪審員は一般人ですし検証は苦手だと実証されているのに私たちは陪審員が繰り返し出て来る論証法に対処できることを期待してしまいます多分政治に関する場合を除いてある生活側面では論理的に議論し他の側面では論理的でない議論をしたらそれは良くないことだと思うでしょう政治家には起こることかもしれませんがそれ以外で起こってほしくはありませんしかし不確かさを扱う場合私たちはいつも間違いを犯します私たちは最低限それに気づく必要があります理想を言えば何か策を講じられればよいのですがありがとうございました

Peter Donnelly: How juries are fooled by statistics

Peter Donnelly: How juries are fooled by statistics

Related talks

Hans Rosling: The best stats you've ever seen

Michael Shermer: Why people believe weird things

Emily Oster: Flip your thinking on AIDS in Africa

Robert Full: Learning from the gecko's tail

Aubrey de Grey: A roadmap to end aging

E.O. Wilson: Advice to a young scientist

Related talks

Hans Rosling: The best stats you've ever seen

Michael Shermer: Why people believe weird things

Emily Oster: Flip your thinking on AIDS in Africa

Robert Full: Learning from the gecko's tail

Aubrey de Grey: A roadmap to end aging

E.O. Wilson: Advice to a young scientist