Peter Donnelly: How juries are fooled by statistics

As other speakers have said, it's a rather daunting experience -- a particularly daunting experience -- to be speaking in front of this audience. But unlike the other speakers, I'm not going to tell you about the mysteries of the universe, or the wonders of evolution, or the really clever, innovative ways people are attacking the major inequalities in our world. Or even the challenges of nation-states in the modern global economy. My brief, as you've just heard, is to tell you about statistics -- and, to be more precise, to tell you some exciting things about statistics. And that's -- (Laughter) -- that's rather more challenging than all the speakers before me and all the ones coming after me. (Laughter) One of my senior colleagues told me, when I was a youngster in this profession, rather proudly, that statisticians were people who liked figures but didn't have the personality skills to become accountants. (Laughter) And there's another in-joke among statisticians, and that's, "How do you tell the introverted statistician from the extroverted statistician?" To which the answer is, "The extroverted statistician's the one who looks at the other person's shoes." (Laughter) But I want to tell you something useful -- and here it is, so concentrate now. This evening, there's a reception in the University's Museum of Natural History. And it's a wonderful setting, as I hope you'll find, and a great icon to the best of the Victorian tradition. It's very unlikely -- in this special setting, and this collection of people -- but you might just find yourself talking to someone you'd rather wish that you weren't. So here's what you do. When they say to you, "What do you do?" -- you say, "I'm a statistician." (Laughter) Well, except they've been pre-warned now, and they'll know you're making it up. And then one of two things will happen. They'll either discover their long-lost cousin in the other corner of the room and run over and talk to them. Or they'll suddenly become parched and/or hungry -- and often both -- and sprint off for a drink and some food. And you'll be left in peace to talk to the person you really want to talk to.

Come gli altri conferenzieri hanno detto, è un'esperienza piuttosto emozionante... davvero molto emozionante... parlare di fronte a quest'audience. Ma a differenza degli altri studiosi, non vi parlerò dei misteri dell'universo, o del miracolo dell'evoluzione, o dei modi innovativi e saggi con cui oggi le persone combattono le più grandi ingiustizie del nostro mondo. Non parlerò neppure dei cambiamenti degli stati nazionali nella nuova economia globale. Come avete appena sentito, il mio compito oggi è dirvi qualcosa sulle statistiche... e, per essere precisi, dire qualcosa di eccitante sulle statistiche. E questo... (Risate) è addirittura più impegnativo di tutti gli oratori che anno parlato prima e che parleranno dopo di me. (Risate) Quando ero ancora un novellino in questo campo, uno dei miei colleghi più anziani mi disse, piuttosto orgoglioso, che gli esperti di statistica sono persone a cui piacciono i numeri ma che non posseggono le abilità sociali giuste per diventare contabili. (Risate) Ci sarebbe un'altra battuta, cioè "Da cosa si riconosce un esperto di statistica introverso da uno estroverso?" La risposta è questa, "L'estroverso è quello che guarda le scarpe di chi gli è di fronte." (Risate) Ma ora vorrei raccontarvi qualcosa di utile... eccoci, provate a concentrarvi. Stasera ci sarà un ricevimento al Museo Universitario di Storia Naturale. E' un luogo meraviglioso, spero possa piacere anche a voi, un'icona della migliore tradizione vittoriana. E' piuttosto improbabile... in questo luogo particolare, e con questo tipo di persone... ma potreste ritrovarvi a parlare con qualcuno anche se sperereste il contrario. Ed ecco quello che fareste- Quando vi chiederanno "Di cosa ti occupi?"... risponderete: "Di statistica". (Risate) Beh, a parte il fatto che ora saranno preavvisati, quindi sapranno che cosa gli svelerete. E ora la scelta è fra due possibilità. Potrebbero avvistare il cugino di quinto grado dall'altra parte della stanza e correre da lui per salutarlo. Oppure potrebbe immediatamente sentirsi assetati e/o affamati... di solito entrambe le cose... e lanciarsi alla ricerca di qualcosa da bere e da mangiare. A questo punto, sarete lasciati in pace a chiacchierare con la persona con cui volevate davvero parlare.

It's one of the challenges in our profession to try and explain what we do. We're not top on people's lists for dinner party guests and conversations and so on. And it's something I've never really found a good way of doing. But my wife -- who was then my girlfriend -- managed it much better than I've ever been able to. Many years ago, when we first started going out, she was working for the BBC in Britain, and I was, at that stage, working in America. I was coming back to visit her. She told this to one of her colleagues, who said, "Well, what does your boyfriend do?" Sarah thought quite hard about the things I'd explained -- and she concentrated, in those days, on listening. (Laughter) Don't tell her I said that. And she was thinking about the work I did developing mathematical models for understanding evolution and modern genetics. So when her colleague said, "What does he do?" She paused and said, "He models things." (Laughter) Well, her colleague suddenly got much more interested than I had any right to expect and went on and said, "What does he model?" Well, Sarah thought a little bit more about my work and said, "Genes." (Laughter) "He models genes."

Una delle vere sfide nel nostro lavoro è provare a spiegare quello che facciamo. Non siamo nella lista d'onore per essere invitati ad una cena o per fare conversazione. Queste sono cose in cui non sono mai riuscito troppo bene. Ma mia moglie... che è stata anche la mia fidanzata... affrontava la questione molto meglio di quanto abbia mai fatto io. Molti anni fa, quando abbiamo iniziato ad uscire insieme, lavorava per la BBC in Inghilterra mentre io, in quel periodo, lavoravo in America. Tornavo per andare a trovarla. Ecco quello che ha risposto ad un suo collega che le aveva chiesto "Beh, di che cosa si occupa il tuo ragazzo?" Sarah si mise a pensare a fondo alle cose che le avevo detto... e si concentrò, in quel periodo, ad ascoltarmi. (Risate) Ma non raccontatele che ve l'ho detto. Ragionò sul lavoro che stavo facendo per sviluppare modelli matematici per la comprensione dell'evoluzione e della genetica moderna. Quindi quando il suo collega le chiese "Di cosa si occupa?" aspettò un momento e rispose "Modella le cose". (Risate) Bene, il collega tutto d'un tratto diventò molto più interessato di quanto avrei mai sospettato e continuò chiedendo "E cosa modella?" Ora, Sarah ci pensò ancora un attimo e disse "I geni". (Risate) "Modella i geni."

That is my first love, and that's what I'll tell you a little bit about. What I want to do more generally is to get you thinking about the place of uncertainty and randomness and chance in our world, and how we react to that, and how well we do or don't think about it. So you've had a pretty easy time up till now -- a few laughs, and all that kind of thing -- in the talks to date. You've got to think, and I'm going to ask you some questions. So here's the scene for the first question I'm going to ask you. Can you imagine tossing a coin successively? And for some reason -- which shall remain rather vague -- we're interested in a particular pattern. Here's one -- a head, followed by a tail, followed by a tail.

Questo è stato il mio primo amore, e di cui vi parlerò per un po'. In generale, mi piacerebbe che ragionassimo insieme su come il nostro mondo sia un luogo di incertezza, casualità e opportunità e quale sia la nostra reazione riguardo, e come pensiamo o non pensiamo a questo. Avete passato dei bei momenti... qualche risata, questo tipo di cose... durante questa conferenza. Dovrete riflettere, e io vi farò qualche domanda. Ecco lo scenario per la mia prima domanda. Riuscite ad immaginarvi mentre lanciate una moneta di continuo? E per qualche ragione... che rimarrà piuttosto vaga... siamo interessati ad un aspetto in particolare. Ad esempio... testa, poi croce, poi ancora croce.

So suppose we toss a coin repeatedly. Then the pattern, head-tail-tail, that we've suddenly become fixated with happens here. And you can count: one, two, three, four, five, six, seven, eight, nine, 10 -- it happens after the 10th toss. So you might think there are more interesting things to do, but humor me for the moment. Imagine this half of the audience each get out coins, and they toss them until they first see the pattern head-tail-tail. The first time they do it, maybe it happens after the 10th toss, as here. The second time, maybe it's after the fourth toss. The next time, after the 15th toss. So you do that lots and lots of times, and you average those numbers. That's what I want this side to think about.

Immaginiamo di lanciare la moneta. A questo punto il modello testa-croce-croce su cui ci eravamo improvvisamente concentrati si ripete. E voi potete contare: uno, due, tre, quattro, cinque, sei, sette, otto, nove, dieci... Si ripete dopo il decimo lancio. Ora, potreste pensare che ci siano cose decisamente più interessanti da fare, ma fatemi felice per un istante. Immaginate che ognuno in questa metà del pubblico tiri fuori delle monete e le lanci fino a vedere la sequenza testa-croce-croce. La prima volta che si presenta potrebbe essere dopo 10 tiri, come nel nostro caso. La seconda forse dopo il quarto. La volta successiva dopo 15. Quindi lo fate un sacco di volte, e otterreste questi numeri. E' questo di cui chiedo a questa parte del pubblico di pensare.

The other half of the audience doesn't like head-tail-tail -- they think, for deep cultural reasons, that's boring -- and they're much more interested in a different pattern -- head-tail-head. So, on this side, you get out your coins, and you toss and toss and toss. And you count the number of times until the pattern head-tail-head appears and you average them. OK? So on this side, you've got a number -- you've done it lots of times, so you get it accurately -- which is the average number of tosses until head-tail-tail. On this side, you've got a number -- the average number of tosses until head-tail-head.

Diciamo che all'altra metà la combinazione testa-croce-croce non piaccia... a causa di profonde ragioni culturali pensano sia noiosa... e trovano decisamente più interessante un'altra combinazione... testa-croce-testa. Quindi, da questa parte, tirate fuori monete per poi lanciare e lanciare e ancora lanciare. Poi contate il numero di lanci prima che si verifichi testa-croce-testa e fate la media. D'accordo? Da quest'altra parte, avete un numero... l'avete ripetuto molte volte, quindi è preciso... che è la media di lanci prima di ottenere testa-croce-croce. Invece il numero che ottenete voi... la media di lanci per testa-croce-testa.

So here's a deep mathematical fact -- if you've got two numbers, one of three things must be true. Either they're the same, or this one's bigger than this one, or this one's bigger than that one. So what's going on here? So you've all got to think about this, and you've all got to vote -- and we're not moving on. And I don't want to end up in the two-minute silence to give you more time to think about it, until everyone's expressed a view. OK. So what you want to do is compare the average number of tosses until we first see head-tail-head with the average number of tosses until we first see head-tail-tail.

E qui abbiamo un'importante verità matematica... quando avete due numeri, uno di questi tre fatti è vero per forza. O sono uguali, o il primo è maggiore, o è il secondo a essere maggiore. E nel nostro caso cosa succede? Ora, pensate a questo, e poi dovrete votare... e non andremo avanti senza farlo. E non voglio neanche cadere nei due minuti di silenzio per darvi altro tempo per pensarci, finché tutti non avranno espresso il loro parere. Bene. Quello che dovreste fare è paragonare la media del numero di lanci fino ad ottenere testa-croce-testa con la media di testa-croce-croce.

Who thinks that A is true -- that, on average, it'll take longer to see head-tail-head than head-tail-tail? Who thinks that B is true -- that on average, they're the same? Who thinks that C is true -- that, on average, it'll take less time to see head-tail-head than head-tail-tail? OK, who hasn't voted yet? Because that's really naughty -- I said you had to. (Laughter) OK. So most people think B is true. And you might be relieved to know even rather distinguished mathematicians think that. It's not. A is true here. It takes longer, on average. In fact, the average number of tosses till head-tail-head is 10 and the average number of tosses until head-tail-tail is eight. How could that be? Anything different about the two patterns? There is. Head-tail-head overlaps itself. If you went head-tail-head-tail-head, you can cunningly get two occurrences of the pattern in only five tosses. You can't do that with head-tail-tail. That turns out to be important.

Chi pensa che la proposta A sia corretta... cioè, che secondo la media, impiegheremo più lanci per testa-croce-testa che per testa-croce-croce? E chi pensa che la corretta sia la B... secondo la media, sono due numeri uguali? Chi crede che C sia giusta... secondo la media, ci si impiega di meno per ottenere testa-croce-testa che testa-croce-croce? Bene, e chi è che non ha ancora votato? Siete davvero disubbidienti... vi ho detto che dovete votare. (Risate) Allora, la maggioranza delle persone pensano che sia corretta la proposta B. A spero siate sollevati dal fatto che anche i matematici più illustri pensano sia così. Solo che non è vero. La risposta corretta è A. Considerando la media, si impiegano tempi diversi. Infatti, la media di lanci prima di testa-croce-testa è 10 mentre la media per testa-croce-croce è 8. Come può essere possibile? C'è qualcosa di diverso nelle due combinazioni? In effetti, sì. Testa-croce-testa si sovrappone. Se vi trovate testa-croce-testa-croce-testa, riuscirete a trovare facilmente per due volte la combinazione in solo cinque lanci. Ma non ci riuscirete con testa-croce-croce. E questo alla fine è importante.

There are two ways of thinking about this. I'll give you one of them. So imagine -- let's suppose we're doing it. On this side -- remember, you're excited about head-tail-tail; you're excited about head-tail-head. We start tossing a coin, and we get a head -- and you start sitting on the edge of your seat because something great and wonderful, or awesome, might be about to happen. The next toss is a tail -- you get really excited. The champagne's on ice just next to you; you've got the glasses chilled to celebrate. You're waiting with bated breath for the final toss. And if it comes down a head, that's great. You're done, and you celebrate. If it's a tail -- well, rather disappointedly, you put the glasses away and put the champagne back. And you keep tossing, to wait for the next head, to get excited.

Ci sono due interpretazioni possibili per questo fatto. Ora ve ne spiego una. Immaginiamo... diciamo che state facendo la stessa cosa di prima. Da questa parte... vi ricordate, vi interessa testa-croce-croce, e a voi testa-croce-testa. Lanciamo una moneta, e viene fuori testa... ora siete aggrappati al bordo delle vostre poltroncine perché qualcosa di magico e meraviglioso, di splendido, potrebbe accadere. Ora lanciamo di nuovo e esce croce... siete tutti molto emozionati. Lo champagne è nella ghiacciaia vicino a voi, e i bicchieri sono pronti per brindare. Aspettate l'ultimo lancio trattenendo il respiro. E se esce testa, è perfetto. Ce l'avrete fatta e inizierete a festeggiare. Se invece è croce... beh, con disappunto dovrete mettere via i bicchieri e riporre lo champagne. E continuerete a lanciare, aspettando testa, per emozionarvi di nuovo.

On this side, there's a different experience. It's the same for the first two parts of the sequence. You're a little bit excited with the first head -- you get rather more excited with the next tail. Then you toss the coin. If it's a tail, you crack open the champagne. If it's a head you're disappointed, but you're still a third of the way to your pattern again. And that's an informal way of presenting it -- that's why there's a difference. Another way of thinking about it -- if we tossed a coin eight million times, then we'd expect a million head-tail-heads and a million head-tail-tails -- but the head-tail-heads could occur in clumps. So if you want to put a million things down amongst eight million positions and you can have some of them overlapping, the clumps will be further apart. It's another way of getting the intuition.

Da quest'altra parte, l'esperienza sarà diversa. E' identica per i primi due lanci. Sarete emozionati all'uscita di testa... lo sarete ancora di più quando esce croce. Ora lancio la moneta. Se esce croce, stapperete lo champagne. Se esce testa vi dispiacerà, ma sarete comunque a un terzo della strada per ottenere la combinazione giusta. Questo è un modo informale per presentare il problema... è qui che c'è la differenza. Ecco un'altro modo di pensarla... se lanciamo una moneta otto milioni di volte, ci possiamo aspettare un milione di testa-croce-testa e un milione di testa-croce-croce... ma testa-croce-testa può presentarsi in gruppi. Quindi, se volete provare a mettere un milione di cose in otto milioni di posizioni che possono essere sovrapposte, i gruppi saranno più lontani gli uni dagli altri. Questo è un altro modo per comprendere il fenomeno.

What's the point I want to make? It's a very, very simple example, an easily stated question in probability, which every -- you're in good company -- everybody gets wrong. This is my little diversion into my real passion, which is genetics. There's a connection between head-tail-heads and head-tail-tails in genetics, and it's the following. When you toss a coin, you get a sequence of heads and tails. When you look at DNA, there's a sequence of not two things -- heads and tails -- but four letters -- As, Gs, Cs and Ts. And there are little chemical scissors, called restriction enzymes which cut DNA whenever they see particular patterns. And they're an enormously useful tool in modern molecular biology. And instead of asking the question, "How long until I see a head-tail-head?" -- you can ask, "How big will the chunks be when I use a restriction enzyme which cuts whenever it sees G-A-A-G, for example? How long will those chunks be?"

Dov'è che voglio arrivare? E' un esempio davvero molto semplice, un problema di facile soluzione con le probabilità, e che tutti... siete in buona compagnia... tutti sbagliano. Questa è una piccola variante della mia vera passione, la genetica. C'è un legame tra testa-croce-testa e testa-croce-croce in genetica, ora ve la spiego. Lanciando una moneta, si ha una sequenza di teste e i croci. Quando pensate al DNA non c'è una sequenza di due elementi... testa e croce... ma di quattro lettere... A, G, C e T. E ci sono delle piccole forbici, gli enzimi di restrizione, che tagliano il DNA ogni volta che trovano delle combinazioni particolari. Sono degli strumenti di enorme importanza nella moderna biologia molecolare. Quindi, piuttosto che chiederci "Quanti lanci impiegherò per avere testa-croce-testa?" potremo domandare "Quanto grosse saranno le parti se uso un enzima di restrizione che taglia il DNA ogni volta che trova, ad esempio, G-A-A-G? E quanto lungo sarà?"

That's a rather trivial connection between probability and genetics. There's a much deeper connection, which I don't have time to go into and that is that modern genetics is a really exciting area of science. And we'll hear some talks later in the conference specifically about that. But it turns out that unlocking the secrets in the information generated by modern experimental technologies, a key part of that has to do with fairly sophisticated -- you'll be relieved to know that I do something useful in my day job, rather more sophisticated than the head-tail-head story -- but quite sophisticated computer modelings and mathematical modelings and modern statistical techniques. And I will give you two little snippets -- two examples -- of projects we're involved in in my group in Oxford, both of which I think are rather exciting. You know about the Human Genome Project. That was a project which aimed to read one copy of the human genome. The natural thing to do after you've done that -- and that's what this project, the International HapMap Project, which is a collaboration between labs in five or six different countries. Think of the Human Genome Project as learning what we've got in common, and the HapMap Project is trying to understand where there are differences between different people.

Questa è una connessione abbastanza superficiale tra probabilità e genetica. Ce n'è una molto più profonda, su cui non ho il tempo di dilungarmi ed è per questo che la genetica moderna è un'area della scienza così eccitante. Più tardi potremo sentire degli interventi specifici su questo problema. Si è anche scoperto che scoprire i segreti delle informazioni generate dagli esperimenti tecnologici si basa fondamentalmente su forme più sofisticate... credo sarete sollevati nel sapere che faccio anche qualcosa di utile nel mio lavoro, di più sofisticato del trucco con testa-croce-testa... sui modelli sofisticati al computer, sui modelli matematici e sulle moderne tecniche statistiche. Vi darò due flash... insomma, due esempi... sui progetti di cui mi occupo con il mio gruppo di Oxford, entrambi piuttosto interessanti. Conoscerete tutti il Human Genome Project. Era un progetto che aveva lo scopo di decifrare il genoma umano. La cosa ovvia da fare dopo esserci riusciti... è la nascita di un altro progetto, l'International HapMap Project, una collaborazione tra i laboratori di cinque o sei diversi paesi. Provate ad immaginare lo Humane Genome Project come la comprensione di cosa abbiamo in comune, e lo HapMap Project come un tentativo di capire dove si trovino le differenze tra persone diverse.

Why do we care about that? Well, there are lots of reasons. The most pressing one is that we want to understand how some differences make some people susceptible to one disease -- type-2 diabetes, for example -- and other differences make people more susceptible to heart disease, or stroke, or autism and so on. That's one big project. There's a second big project, recently funded by the Wellcome Trust in this country, involving very large studies -- thousands of individuals, with each of eight different diseases, common diseases like type-1 and type-2 diabetes, and coronary heart disease, bipolar disease and so on -- to try and understand the genetics. To try and understand what it is about genetic differences that causes the diseases. Why do we want to do that? Because we understand very little about most human diseases. We don't know what causes them. And if we can get in at the bottom and understand the genetics, we'll have a window on the way the disease works, and a whole new way about thinking about disease therapies and preventative treatment and so on. So that's, as I said, the little diversion on my main love.

Perché ci interessano queste cose? Sapete, ci sono molte ragioni. Quella più importante è che vogliamo capire come alcune differenze rendano alcune persone predisposte ad alcune malattie... il diabete di tipo 2, per esempio... e altre differenze portino predisposizione a malattie cardiache, ictus o autismo. E' un progetto davvero straordinario. E ce n'è anche un secondo, fondato da poco in questo stato dal Wellcome Trust, che coinvolge un altissimo numero di studi... migliaia di persone, che soffrono di uno fra otto differenti sindromi, malattie comuni come il diabete di primo e secondo tipo, disfunzioni coronariche, disturbo bipolare e altre... e cerchiamo di individuarne le cause genetiche. Individuare e capire quali sono le differenze genetiche che causano queste malattie. Perché vogliamo farlo? Perché sappiamo davvero molto poco sulla maggior parte delle malattie umane. Non sappiamo cosa le provochi. Se riuscissimo ad andare a monte e conoscerne la genesi, saremo un passo in avanti nella comprensione di come si sviluppano. E quindi avremo un modo del tutto nuovo di programmare le terapie e i trattamenti preventivi. E questo, come vi dicevo prima, è una piccola digressione verso il mio primo amore.

Back to some of the more mundane issues of thinking about uncertainty. Here's another quiz for you -- now suppose we've got a test for a disease which isn't infallible, but it's pretty good. It gets it right 99 percent of the time. And I take one of you, or I take someone off the street, and I test them for the disease in question. Let's suppose there's a test for HIV -- the virus that causes AIDS -- and the test says the person has the disease. What's the chance that they do? The test gets it right 99 percent of the time. So a natural answer is 99 percent. Who likes that answer? Come on -- everyone's got to get involved. Don't think you don't trust me anymore. (Laughter) Well, you're right to be a bit skeptical, because that's not the answer. That's what you might think. It's not the answer, and it's not because it's only part of the story. It actually depends on how common or how rare the disease is. So let me try and illustrate that. Here's a little caricature of a million individuals. So let's think about a disease that affects -- it's pretty rare, it affects one person in 10,000. Amongst these million individuals, most of them are healthy and some of them will have the disease. And in fact, if this is the prevalence of the disease, about 100 will have the disease and the rest won't. So now suppose we test them all. What happens? Well, amongst the 100 who do have the disease, the test will get it right 99 percent of the time, and 99 will test positive. Amongst all these other people who don't have the disease, the test will get it right 99 percent of the time. It'll only get it wrong one percent of the time. But there are so many of them that there'll be an enormous number of false positives. Put that another way -- of all of them who test positive -- so here they are, the individuals involved -- less than one in 100 actually have the disease. So even though we think the test is accurate, the important part of the story is there's another bit of information we need.

Ma torniamo ad un modo più ortodosso di pensare all'incertezza. Ed ecco a voi un altro quiz... ora supponiamo di avere un esame per una malattia non infallibile, ma abbastanza preciso. Risulta affidabile per il 99 percento dei casi. E ora scelgo uno di voi, o qualcuno che passa per la strada, e lo sottopongo alle analisi per la malattia in questione. Diciamo che è un test per l'HIV... il virus che causa l'AIDS... e secondo l'esame risulta che la persona è infetta. Qual è la probabilità che sia vero? L'esame è corretto il 99 percento delle volte. Quindi la risposta più ovvia è il 99 percento. A chi piace questa risposta? Coraggio... tutti devono essere coinvolti. Non voglio pensare che non vi fidate di me. (Risate) Beh, avete anche ragione a essere un po' scettici, perché quella non era la risposta. E' quello che avreste potuto pensare. E non è la risposta perché vi ho raccontato solo una parte della storia. Infatti, molto dipende da quanto comune o rara sia la malattia. Permettetemi di farvi vedere. Questa è una piccola caricatura di un milione di individui. E ora pensiamo a una malattia che colpisce... diciamo che è abbastanza rara, colpisce una persona su 10000. Quindi tra questo milione di persone la maggior parte è sana e solo qualcuno è malato. In effetti, se la percentuale della malattia fosse quella che vi ho detto, circa cento ne sarebbero colpiti e il resto no. Ora supponiamo di sottoporli tutti alle analisi. Cosa succede? Vedete, tra i 100 che sono malati l'esame sarà corretto solo il 99 percento delle volte, quindi 99 volte sarà risulterà positivo. Tra le altre persone che non sono affette dalla malattia il test continuerà ad essere corretto il 99 percento di volte. Sarà sbagliato solo per l'un percento. Ma ci sono talmente tanti sani rispetto ai malati che risulterà un numero enorme di falsi positivi. Guardiamo il problema da un altro punto di vista... di tutti quelli il cui esame risulta positivo... insomma, tra tutti quelli coinvolti... meno di uno su 100 è malato. Quindi anche se crediamo che un test sia accurato, la cosa importante è che c'è un altra informazione di cui abbiamo bisogno.

Here's the key intuition. What we have to do, once we know the test is positive, is to weigh up the plausibility, or the likelihood, of two competing explanations. Each of those explanations has a likely bit and an unlikely bit. One explanation is that the person doesn't have the disease -- that's overwhelmingly likely, if you pick someone at random -- but the test gets it wrong, which is unlikely. The other explanation is that the person does have the disease -- that's unlikely -- but the test gets it right, which is likely. And the number we end up with -- that number which is a little bit less than one in 100 -- is to do with how likely one of those explanations is relative to the other. Each of them taken together is unlikely.

E questo è il punto di svolta. Quello che dobbiamo fare, una volta che un test risulta positivo, è di considerare la credibilità, la probabilità secondo due diverse impostazioni. Ognuna di esse è più o meno credibile. Una possibilità è che la persona non sia malata... decisamente probabile, prendendo delle persona a caso... ma che il test sia sbagliato, il che è improbabile. L'altra spiegazione è che la persona è davvero malato... cosa improbabile... e che il test sia corretto, che è probabile. Alla fine il numero che troviamo... che sarà leggermente inferiore rispetto a uno su 100... sarà basato su quanto probabile è una spiegazione rispetto all'altra. Prese insieme, comunque, sono entrambi improbabili.

Here's a more topical example of exactly the same thing. Those of you in Britain will know about what's become rather a celebrated case of a woman called Sally Clark, who had two babies who died suddenly. And initially, it was thought that they died of what's known informally as "cot death," and more formally as "Sudden Infant Death Syndrome." For various reasons, she was later charged with murder. And at the trial, her trial, a very distinguished pediatrician gave evidence that the chance of two cot deaths, innocent deaths, in a family like hers -- which was professional and non-smoking -- was one in 73 million. To cut a long story short, she was convicted at the time. Later, and fairly recently, acquitted on appeal -- in fact, on the second appeal. And just to set it in context, you can imagine how awful it is for someone to have lost one child, and then two, if they're innocent, to be convicted of murdering them. To be put through the stress of the trial, convicted of murdering them -- and to spend time in a women's prison, where all the other prisoners think you killed your children -- is a really awful thing to happen to someone. And it happened in large part here because the expert got the statistics horribly wrong, in two different ways.

Ed ecco un esempio più attuale della stessa identica cosa. Molti di voi in Inghilterra sapranno di un caso ormai famoso di una donna di nome Sally Clark, i cui due bambini morirono improvvisamente. Inizialmente, si pensò che fossero morti per quello che famigliarmente è chiamata "morte in culla", o, più formalmente, Sindrome della Morte Bianca. Per diversi motivi, in seguito è stata processata per omicidio. E al processo, il suo processo, un pediatra spiegò che la possibilità di due morti in culla, due morti innocenti, in una famiglia come la sua... di professionisti non fumatori... era di una su 73 milioni. Per farla breve, a quel punto venne condannata. Più tardi, in realtà recentemente, è stata prosciolta in appello... in effetti, in secondo appello. E solo per farvi capire la situazione, immaginate quanto possa essere terribile perdere un bambino, e poi un altro, e anche essendo innocenti venire condannati per omicidio. Dover sopportare lo stress del processo, essere condannati... passare il tempo in una prigione femminile, dove tutte le altre pensano che tu abbia ucciso i tuoi figli... è una cosa terrificante. E in questo caso è accaduta soprattutto perché l'esperto ha interpretato i dati statistici in modo assolutamente sbagliato, e per due motivi.

So where did he get the one in 73 million number? He looked at some research, which said the chance of one cot death in a family like Sally Clark's is about one in 8,500. So he said, "I'll assume that if you have one cot death in a family, the chance of a second child dying from cot death aren't changed." So that's what statisticians would call an assumption of independence. It's like saying, "If you toss a coin and get a head the first time, that won't affect the chance of getting a head the second time." So if you toss a coin twice, the chance of getting a head twice are a half -- that's the chance the first time -- times a half -- the chance a second time. So he said, "Here, I'll assume that these events are independent. When you multiply 8,500 together twice, you get about 73 million." And none of this was stated to the court as an assumption or presented to the jury that way. Unfortunately here -- and, really, regrettably -- first of all, in a situation like this you'd have to verify it empirically. And secondly, it's palpably false. There are lots and lots of things that we don't know about sudden infant deaths. It might well be that there are environmental factors that we're not aware of, and it's pretty likely to be the case that there are genetic factors we're not aware of. So if a family suffers from one cot death, you'd put them in a high-risk group. They've probably got these environmental risk factors and/or genetic risk factors we don't know about. And to argue, then, that the chance of a second death is as if you didn't know that information is really silly. It's worse than silly -- it's really bad science. Nonetheless, that's how it was presented, and at trial nobody even argued it. That's the first problem. The second problem is, what does the number of one in 73 million mean? So after Sally Clark was convicted -- you can imagine, it made rather a splash in the press -- one of the journalists from one of Britain's more reputable newspapers wrote that what the expert had said was, "The chance that she was innocent was one in 73 million." Now, that's a logical error. It's exactly the same logical error as the logical error of thinking that after the disease test, which is 99 percent accurate, the chance of having the disease is 99 percent. In the disease example, we had to bear in mind two things, one of which was the possibility that the test got it right or not. And the other one was the chance, a priori, that the person had the disease or not. It's exactly the same in this context. There are two things involved -- two parts to the explanation. We want to know how likely, or relatively how likely, two different explanations are. One of them is that Sally Clark was innocent -- which is, a priori, overwhelmingly likely -- most mothers don't kill their children. And the second part of the explanation is that she suffered an incredibly unlikely event. Not as unlikely as one in 73 million, but nonetheless rather unlikely. The other explanation is that she was guilty. Now, we probably think a priori that's unlikely. And we certainly should think in the context of a criminal trial that that's unlikely, because of the presumption of innocence. And then if she were trying to kill the children, she succeeded. So the chance that she's innocent isn't one in 73 million. We don't know what it is. It has to do with weighing up the strength of the other evidence against her and the statistical evidence. We know the children died. What matters is how likely or unlikely, relative to each other, the two explanations are. And they're both implausible. There's a situation where errors in statistics had really profound and really unfortunate consequences. In fact, there are two other women who were convicted on the basis of the evidence of this pediatrician, who have subsequently been released on appeal. Many cases were reviewed. And it's particularly topical because he's currently facing a disrepute charge at Britain's General Medical Council.

Dov'è che aveva trovato uno su 73 milioni? Aveva letto alcune ricerche, secondo cui la possibilità di morte in culla in una famiglia come quella di Sally Clark era una su 8500. Quindi si era detto "Supponendo di avere una morte in culla in una famiglia, allora la possibilità che un altro bambino muoia è identica." E questo è quello che gli esperti in statistica chiamano assunto di indipendenza. Sarebbe un po' come dire "Se lanci una moneta e la prima volta esce testa, questo non avrà alcun peso nella possibilità di avere testa anche la seconda volta". Quindi se lanciate per due volte, la probabilità di avere per due volte testa è di metà... la possibilità di avere testa... e un'altra metà... per la seconda volta testa. Quindi ha pensato "Ehi, pensiamo... credo che questi eventi siano indipendenti. E quando moltiplichiamo 8500 per se stesso si ottiene circa 73 milioni". E niente di tutto questo è stato spiegato alla corte come un'ipotesi o presentato in questo modo alla giuria. Sfortunatamente in questo caso... e, in realtà, purtroppo... in primo luogo, in situazioni simili c'è bisogno di verificare le teorie in modo empirico. E, comunque,era una statistica evidentemente falsa. Ci sono milioni di cose che non sappiamo sulle morti bianche. Possono concorrere fattori ambientali di cui non sappiamo nulla, e ci sono buone probabilità che ci siano anche fattori genetici che noi non conosciamo. Quindi se una famiglia ha subito una morte in culla, la si considererebbe parte di un gruppo ad alto rischio. Probabilmente erano presenti fattori ambientali di rischio e/o fattori genetici di cui non sappiamo niente. E, quindi, sostenere che c'é un valore statistico su una seconda morte senza avere queste informazioni è molto stupido. Anzi, è peggio che stupido... è scienza di livello bassissimo. Inoltre, al processo sono state presentate queste prove e nessuno ne ha dubitato. Questo è il primo punto. Il secondo problema è questo: cosa significa il numero 73 milioni? Dopo che Sally Clark venne condannata... ve lo immaginerete, è stata una manna dei giornali... un giornalista di una delle testate inglesi più affidabili scrisse che quello che l'esperto aveva detto era "C'è una possibilità su 73 milioni che sia innocente." Ora, questo è un errore di logica. E' lo stesso identico errore di pensare che dopo un esame medico, affidabile al 99 percento, le possibilità di avere la malattia sono del 99 percento. Nel caso della malattia, dobbiamo tenere a mente due fattori, uno dei quali è la possibilità che il test sia o non sia corretto. L'altro è la possibilità, a priori, che il paziente sia o non sia malato. Nel contesto giudiziario si è presentato lo stesso problema. Due fattori erano da considerare... due elementi della spiegazione. E vorremmo sapere quanto probabili, o relativamente probabili, siano le due spiegazioni. Il primo è che Sally Clark fosse innocente... cosa che, a priori, è decisamente probabile... la maggior parte delle madri non uccide i propri bambini. Il secondo, invece, è che sia stata vittima di un evento incredibilmente improbabile. Non così tanto improbabile quanto uno su 73 milioni, ma comunque piuttosto raro. L'altra possibile spiegazione è che fosse colpevole. Ora, molto probabilmente crediamo a priori che non sia così. E sicuramente dovremmo pensarlo in un processo, ricordandoci della presunzione di innocenza. Allora, se avesse cercato di uccidere i suoi bambini, ci era riuscita. Quindi la probabilità che fosse innocente non era di uno su 73 milioni. Non sappiamo quale fosse. Per saperlo sarebbe servito sommare le altre prove a suo carico e la prova fornita dalla statistica. Sappiamo che i bambini sono morti. Quello che importa è quanto siano realistiche o meno le due possibili spiegazioni. E, in realtà, sono entrambe irrealistiche. Questa è una situazione in cui gli errori di statistica hanno avuto pesanti e davvero infelici conseguenze. In fatti, ci sono state altre due donne condannate sulla base delle prove fornite dal pediatra, e che sono state in seguito scagionate. Molti altri casi sono stati riesaminati. Tutto questo è particolarmente attuale, perché in questo periodo lui sta subendo un processo di discredito presso il Britain's General Medical Council.

So just to conclude -- what are the take-home messages from this? Well, we know that randomness and uncertainty and chance are very much a part of our everyday life. It's also true -- and, although, you, as a collective, are very special in many ways, you're completely typical in not getting the examples I gave right. It's very well documented that people get things wrong. They make errors of logic in reasoning with uncertainty. We can cope with the subtleties of language brilliantly -- and there are interesting evolutionary questions about how we got here. We are not good at reasoning with uncertainty. That's an issue in our everyday lives. As you've heard from many of the talks, statistics underpins an enormous amount of research in science -- in social science, in medicine and indeed, quite a lot of industry. All of quality control, which has had a major impact on industrial processing, is underpinned by statistics. It's something we're bad at doing. At the very least, we should recognize that, and we tend not to. To go back to the legal context, at the Sally Clark trial all of the lawyers just accepted what the expert said. So if a pediatrician had come out and said to a jury, "I know how to build bridges. I've built one down the road. Please drive your car home over it," they would have said, "Well, pediatricians don't know how to build bridges. That's what engineers do." On the other hand, he came out and effectively said, or implied, "I know how to reason with uncertainty. I know how to do statistics." And everyone said, "Well, that's fine. He's an expert." So we need to understand where our competence is and isn't. Exactly the same kinds of issues arose in the early days of DNA profiling, when scientists, and lawyers and in some cases judges, routinely misrepresented evidence. Usually -- one hopes -- innocently, but misrepresented evidence. Forensic scientists said, "The chance that this guy's innocent is one in three million." Even if you believe the number, just like the 73 million to one, that's not what it meant. And there have been celebrated appeal cases in Britain and elsewhere because of that.

E ora, per concludere... qual è il succo da ricordarci di tutto il discorso? Beh, ciascuno di noi sa che il caso, l'incertezza e la probabilità fanno parte della vita di tutti i giorni. Ma è anche vero... e, benché, come gruppo, siate eccezionali, siete stati assolutamente nella norma a sbagliare gli esercizi che vi ho proposto. E' documentato molto bene che le persone spesso sbagliano. Spesso ragionando sull'improbabilità si fanno errori di logica. Riusciamo a destreggiarci brillantemente con le sottigliezze della lingua... e ci sono domande interessantissimi sull'evoluzione riguardo il perché ne siamo capaci. Ma non siamo altrettanto bravi nel campo dell'incertezza. E questo può essere un problema nella vita quotidiana. Come avrete sentito da molti dei conferenzieri, le statistiche sostengono un altissimo numero di ricerche scientifiche... nelle scienze sociali, in medicina e, alla fine, anche in gran parte delle ricerche industriali. Tutti i controlli sulla qualità, che sono di grandissima rilevanza nel processo industriale, si basano sulle statistiche. Ma è qualcosa in cui riusciamo male. In conclusione, ci dovremmo ricordare di questo fatto, ma tendiamo a non farlo. Tornando al campo giuridico, durante il processo a Sally Clark tutti gli avvocati non fecero altro che accettare quello che era stato detto dall'esperto. Ma se il pediatra se ne fosse uscito dicendo alla giuria dicendo "Ehi, so come costruire un ponte. Ne ho appena costruito uno qui in strada. Potete passarci sopra per tornare a casa in macchina" gli avrebbero risposto "Beh, i pediatri non sanno costruire i ponti. Quello è un compito per gli ingegneri." Ma d'altra parte lui è davvero saltato fuori dicendo, o comunque volendo dire "So come operare con l'incertezza. So come interpretare le statistiche." E tutti hanno risposto: "Ah, bene. Ascoltiamolo, è un esperto." Abbiamo tutti bisogno di capire cos'è e cosa non è nostra competenza. Lo stesso problema si presenta con le moderne tecnologie di profiling con il DNA, quando scienziati, avvocati e in qualche caso anche i giudici sistematicamente travisano il significato delle prove. Di solito... al meno si spera... in buona fede, ma le interpretano male. L'esperto forense dice "La probabilità che quest'uomo sia innocente è una su tre milioni." E anche se fate fede al numero, come nel caso di uno su 73 milioni, non è questo il significato di una cifra. E ci sono stati casi a cui si è ricorso all'appello in Inghilterra come ovunque per questo motivo.

And just to finish in the context of the legal system. It's all very well to say, "Let's do our best to present the evidence." But more and more, in cases of DNA profiling -- this is another one -- we expect juries, who are ordinary people -- and it's documented they're very bad at this -- we expect juries to be able to cope with the sorts of reasoning that goes on. In other spheres of life, if people argued -- well, except possibly for politics -- but in other spheres of life, if people argued illogically, we'd say that's not a good thing. We sort of expect it of politicians and don't hope for much more. In the case of uncertainty, we get it wrong all the time -- and at the very least, we should be aware of that, and ideally, we might try and do something about it. Thanks very much.

L'ultima cosa riguardo la legislatura. Va benissimo dire "Facciamo del nostro meglio per presentare le prove". Ma sempre più spesso, nel caso del DNA... per fare un altro esempio... pretendiamo che i membri della giuria, persone ordinarie... ed è documentato che non sono preparatissimi a questo... pretendiamo che siano in grado di seguire ogni sorta di ragionamento sul DNA. In altri campi, quando le persone discutono... beh, forse a parte la politica. Ma in tutti gli altri campi, se le persone sostengono cose senza senso, sappiamo che non sta andando bene. Possiamo aspettarcelo dai politici e neanche speriamo in qualcosa di meglio. Nel caso dell'incertezza, ci sbagliamo tutte le volte... e, alla fin fine, dobbiamo esserne consapevoli. L'ideale sarebbe provare a migliorare questa situazione. Grazie mille.

Peter Donnelly: How juries are fooled by statistics

Peter Donnelly: How juries are fooled by statistics

Related talks

Hans Rosling: The best stats you've ever seen

Michael Shermer: Why people believe weird things

Emily Oster: Flip your thinking on AIDS in Africa

Robert Full: Learning from the gecko's tail

Aubrey de Grey: A roadmap to end aging

E.O. Wilson: Advice to a young scientist

Related talks

Hans Rosling: The best stats you've ever seen

Michael Shermer: Why people believe weird things

Emily Oster: Flip your thinking on AIDS in Africa

Robert Full: Learning from the gecko's tail

Aubrey de Grey: A roadmap to end aging

E.O. Wilson: Advice to a young scientist