Nicholas Christakis: How social networks predict epidemics

For the last 10 years, I've been spending my time trying to figure out how and why human beings assemble themselves into social networks. And the kind of social network I'm talking about is not the recent online variety, but rather, the kind of social networks that human beings have been assembling for hundreds of thousands of years, ever since we emerged from the African savannah. So, I form friendships and co-worker and sibling and relative relationships with other people who in turn have similar relationships with other people. And this spreads on out endlessly into a distance. And you get a network that looks like this. Every dot is a person. Every line between them is a relationship between two people -- different kinds of relationships. And you can get this kind of vast fabric of humanity, in which we're all embedded.

Negli ultimi 10 anni, ho passato il tempo a tentare di capire come e perché gli esseri umani si riuniscono in reti sociali. E il tipo di social network di cui sto parlando non è la più recente varietà online, ma piuttosto, il tipo di social network che gli esseri umani hanno sempre creato per centinaia di migliaia di anni, sin da quando siamo emersi dalla savana africana. Perciò, creo amicizie, rapporti di lavoro, di fratellanza o di parentele con altre persone che a loro volta creano relazioni simili con altre persone. E questo si propaga all'infinito. E ottenete una rete che somiglia a questa Ogni puntino rappresenta una persona. Ogni linea che li collega rappresenta un legame tra due persone -- diversi tipi di relazione. E si ottiene questo fitto tessuto umano, in cui siamo tutti collegati.

And my colleague, James Fowler and I have been studying for quite sometime what are the mathematical, social, biological and psychological rules that govern how these networks are assembled and what are the similar rules that govern how they operate, how they affect our lives. But recently, we've been wondering whether it might be possible to take advantage of this insight, to actually find ways to improve the world, to do something better, to actually fix things, not just understand things. So one of the first things we thought we would tackle would be how we go about predicting epidemics.

Assieme al mio collega, James Fowler, abbiamo studiato questo fenomeno a lungo per capirne le regole matematiche, sociali, biologiche e psicologiche quelle regole che governano la creazione di queste reti e quali sono le regole che ne governano le dinamiche, come influenzano le nostre vite E recentemente, ci siamo chiesti se potrebbe essere possibile sfruttare questa prospettiva a nostro vantaggio per trovare nuovi modi per migliorare il mondo, per fare le cose meglio, per riparare i danni, non solo capire le cose. Una delle prime cose che abbiamo deciso di affrontare è stato come avremmo potuto prevedere le epidemie.

And the current state of the art in predicting an epidemic -- if you're the CDC or some other national body -- is to sit in the middle where you are and collect data from physicians and laboratories in the field that report the prevalence or the incidence of certain conditions. So, so and so patients have been diagnosed with something, or other patients have been diagnosed, and all these data are fed into a central repository, with some delay. And if everything goes smoothly, one to two weeks from now you'll know where the epidemic was today. And actually, about a year or so ago, there was this promulgation of the idea of Google Flu Trends, with respect to the flu, where by looking at people's searching behavior today, we could know where the flu -- what the status of the epidemic was today, what's the prevalence of the epidemic today.

E l'attuale stato dell'arte nella previsione delle epidemie -- se lavorate per la CDC o un altro ente nazionale -- consiste nel sedersi al centro e raccogliere dati dai medici e dai laboratori sul campo che riportano la prevalenza o l'incidenza di determinate condizioni. In questo modo, ad alcuni pazienti viene diagnosticata una malattia [qui], e ad altri un'altra malattia [qui], e tutti i dati vengono raccolti in un archivio, con un certo ritardo. Se va tutto liscio, tra una, due settimane da oggi, scoprirete lo stato dell'epidemia di oggi. E in effetti, circa un anno fa, era stata presentata l'idea di "Google Flu Trends", per l'influenza, in cui si proponeva di rilevare l'insorgenza dei focolai, analizzando il comportamento delle persone tentando di capire in questo modo lo stato dell'epidemia, la prevalenza epidemica in tempo reale.

But what I'd like to show you today is a means by which we might get not just rapid warning about an epidemic, but also actually early detection of an epidemic. And, in fact, this idea can be used not just to predict epidemics of germs, but also to predict epidemics of all sorts of kinds. For example, anything that spreads by a form of social contagion could be understood in this way, from abstract ideas on the left like patriotism, or altruism, or religion to practices like dieting behavior, or book purchasing, or drinking, or bicycle-helmet [and] other safety practices, or products that people might buy, purchases of electronic goods, anything in which there's kind of an interpersonal spread. A kind of a diffusion of innovation could be understood and predicted by the mechanism I'm going to show you now.

Ma ciò che voglio mostrarvi oggi è uno strumento attraverso il quale potremmo non solo muoverci repentinamente in caso di rilevamento dell'epidemia ma addirittura rilevare con largo anticipo lo scoppio dei focolai. E, in effetti, questa idea potrebbe essere utilizzata non solo per prevedere le epidemie virali o batteriche ma prevedere ogni sorda di dinamica epidemica Per esempio, qualsiasi cosa che si propaga per contagio sociale potrebbe essere compresa in questo modo, dalle idee astratte sulla sinistra come il patriottismo, l'altruismo, la religione a pratiche sociali come le diete, l'acquisto di libri, il bere, l'uso del casco in bicicletta e altre procedure di sicurezza, o prodotti da far comprare ai consumatori, l'acquisto di prodotti elettronici, qualsiasi cosa legata ai rapporti interpersonali. Un tipo di diffusione dell'innovazione potrebbe essere compreso e previsto attraverso un meccanismo che sto per mostrarvi

So, as all of you probably know, the classic way of thinking about this is the diffusion-of-innovation, or the adoption curve. So here on the Y-axis, we have the percent of the people affected, and on the X-axis, we have time. And at the very beginning, not too many people are affected, and you get this classic sigmoidal, or S-shaped, curve. And the reason for this shape is that at the very beginning, let's say one or two people are infected, or affected by the thing and then they affect, or infect, two people, who in turn affect four, eight, 16 and so forth, and you get the epidemic growth phase of the curve. And eventually, you saturate the population. There are fewer and fewer people who are still available that you might infect, and then you get the plateau of the curve, and you get this classic sigmoidal curve. And this holds for germs, ideas, product adoption, behaviors, and the like. But things don't just diffuse in human populations at random. They actually diffuse through networks. Because, as I said, we live our lives in networks, and these networks have a particular kind of a structure.

Allora, come probabilmente sapete tutti, è molto comune pensare alla diffusione dell'innovazione o alla cosidetta "curva di adozione". Qui sull'asse Y, abbiamo la percentuale di persone che ne sono influenzate, e sull'asse delle X, abbiamo il tempo. E all'inizio, non sono in tanti ad essere colpiti, ottenete la classica sigmoidale, o curva ad "S". E la ragione è che questa è la forma tipica degli stadi iniziali. diciamo che una o due persone vengono colpite, o infettate, da questa cosa. e poi loro colpiscono o contagiano altre due persone, che a loro volta ne colpiscono quattro, otto, 16 e così via, e raggiungiamo la fase di crescita epidemica sulla curva. E alla fine, l'intera popolazione viene saturata. Ci sono sempre meno persone che rimangono e possono essere infettate, e quindi si attiva all'appiattimento della curva, e si raggiunge la classica curva sigmoidale. Questo vale per i germi, le idee, l'acquisto di prodotti, comportamenti sociali, e così via. Ma le cose non si diffondono solo in maniera casuale nella società. Si diffondono attraverso le reti. Perché, come ho detto, viviamo le nostre vite all'interno di reti e queste reti hanno un particolare tipo di struttura.

Now if you look at a network like this -- this is 105 people. And the lines represent -- the dots are the people, and the lines represent friendship relationships. You might see that people occupy different locations within the network. And there are different kinds of relationships between the people. You could have friendship relationships, sibling relationships, spousal relationships, co-worker relationships, neighbor relationships and the like. And different sorts of things spread across different sorts of ties. For instance, sexually transmitted diseases will spread across sexual ties. Or, for instance, people's smoking behavior might be influenced by their friends. Or their altruistic or their charitable giving behavior might be influenced by their coworkers, or by their neighbors. But not all positions in the network are the same.

Se date un'occhiata ad una rete di questo tipo... Queste sono 105 persone. E le linee rappresentano... i puntini sono le persone, e le linee rappresentano le relazioni di amicizia. Potete vedere che le persone occupano una posizione diversa all'interno di questa rete. E ci sono diversi tipi di relazioni tra le persone. Ci possono essere amicizie, relazioni di parentela, relazioni matrimoniali o lavorative, relazioni di vicinato e così via. E altre relazioni che si propagano attraverso una rete di connessioni. Per esempio, le malattie veneree si propagheranno attraverso relazioni sessuali O, per esempio, la tendenza della gente a fumare potrebbe essere determinata dagli amici. E questo vale per l'altruismo e la carità che possono essere influenzate dai colleghi di lavoro o dai vicini. Ma non tutte le posizioni nella rete sono uguali.

So if you look at this, you might immediately grasp that different people have different numbers of connections. Some people have one connection, some have two, some have six, some have 10 connections. And this is called the "degree" of a node, or the number of connections that a node has. But in addition, there's something else. So, if you look at nodes A and B, they both have six connections. But if you can see this image [of the network] from a bird's eye view, you can appreciate that there's something very different about nodes A and B. So, let me ask you this -- I can cultivate this intuition by asking a question -- who would you rather be if a deadly germ was spreading through the network, A or B? (Audience: B.) Nicholas Christakis: B, it's obvious. B is located on the edge of the network. Now, who would you rather be if a juicy piece of gossip were spreading through the network? A. And you have an immediate appreciation that A is going to be more likely to get the thing that's spreading and to get it sooner by virtue of their structural location within the network. A, in fact, is more central, and this can be formalized mathematically. So, if we want to track something that was spreading through a network, what we ideally would like to do is to set up sensors on the central individuals within the network, including node A, monitor those people that are right there in the middle of the network, and somehow get an early detection of whatever it is that is spreading through the network.

Se guardate qui, potrete capire immediatamente che persone diverse hanno un numero diverso di relazioni. Alcune persone hanno una connessione, altre ne hanno due altre ne hanno sei, alcune dieci connessioni. Questo viene chiamato "grado" di un nodo, ossia, il numero di connessioni che ha ogni nodo. Ma c'è qualcosa di più. Perciò, se guardate i nodi A e B, entrambi hanno 6 connessioni. Ma se guardate questa immagine [della rete] dall'alto, si può notare che c'è una grossa differenza tra il nodo A e quello B. Vi chiedo... posso stimolarvi l'intuito ponendovi questa domanda... chi preferireste essere se un virus letale si propagasse nella rete, A o B? (Audience: B) Nicholas Christakis: B, è ovvio. B si trova ai margini della rete. Allora, chi vorreste essere se venisse messa in giro un'indiscrezione per la rete? A. E capite immediatamente che A avrà maggiori possibilità di venire a conoscenza della diceria prima di altri proprio grazie alla sua posizione strutturale all'interno della rete. A, infatti, è più centrale, e questo può essere formalizzato matematicamente. Perciò, se vogliamo rintracciare qualcosa che si propaga per la rete, idealmente cercheremo di posizionare i nostri sensori sugli indivisui centrali della rete, incluso il nodo A, monitoreremo queste persone che stanno al centro della rete, e in un certo modo otterremo prima la rilevazione di qualunque cosa si stia propagando per la rete.

So if you saw them contract a germ or a piece of information, you would know that, soon enough, everybody was about to contract this germ or this piece of information. And this would be much better than monitoring six randomly chosen people, without reference to the structure of the population. And in fact, if you could do that, what you would see is something like this. On the left-hand panel, again, we have the S-shaped curve of adoption. In the dotted red line, we show what the adoption would be in the random people, and in the left-hand line, shifted to the left, we show what the adoption would be in the central individuals within the network. On the Y-axis is the cumulative instances of contagion, and on the X-axis is the time. And on the right-hand side, we show the same data, but here with daily incidence. And what we show here is -- like, here -- very few people are affected, more and more and more and up to here, and here's the peak of the epidemic. But shifted to the left is what's occurring in the central individuals. And this difference in time between the two is the early detection, the early warning we can get, about an impending epidemic in the human population.

E cioè, se vedete che loro contraggono un germe od ottengono un'informazione, saprete, abbastanza presto, che anche agli altri accadrà lo stesso che si tratti di germi o informazioni. E questo sarà meglio che monitorare se persone scelte a caso, a prescindere dalla struttura della popolazione. E infatti, se poteste farlo, otterrete qualcosa del genere. Sulla sinistra, abbiamo la curva di adozione a forma di "S", Nella linea tartteggiata, mostriamo quale potrebbe essere l'adozione delle persone prese a caso, e nella linea a sinistra, più a sinistra, mostriamo quale sarebbe l'adozione degli individui in posizione centrale nella rete. Sull'asse delle Y è riportata l'accumulazione del contagio, e sull'asse delle X il tempo. E sulla destra, mostriamo alcuni dati ma in questo caso con l'incidenza giornaliera. E qui mostriamo... ecco, qui... che molte poche persone vengono infettate e sempre di più fino a qui, e a questo punto si parla di epidemia. ma spostato a sinistra avviene quel che già rileviamo per gli individui centrali. E questa differenza di tempistica tra i due è il fattore di diagnosi o allarme precoce che possiamo ottenere in relazione alle epidemie imminenti sulla popolazione.

The problem, however, is that mapping human social networks is not always possible. It can be expensive, not feasible, unethical, or, frankly, just not possible to do such a thing. So, how can we figure out who the central people are in a network without actually mapping the network? What we came up with was an idea to exploit an old fact, or a known fact, about social networks, which goes like this: Do you know that your friends have more friends than you do? Your friends have more friends than you do, and this is known as the friendship paradox. Imagine a very popular person in the social network -- like a party host who has hundreds of friends -- and a misanthrope who has just one friend, and you pick someone at random from the population; they were much more likely to know the party host. And if they nominate the party host as their friend, that party host has a hundred friends, therefore, has more friends than they do. And this, in essence, is what's known as the friendship paradox. The friends of randomly chosen people have higher degree, and are more central than the random people themselves.

il problema, ad ogni modo, sta nella difficoltà di mappare e reti sociali umane. Può essere dispendioso, [molto difficile], non etico, o francamente, semplicemente impossibile. Quindi, come possiamo capire che sono gli individui al centro della rete che ci permettono di non mappare l'intera rete? Abbiamo escogitato un modo sfruttando un dato che già avevamo o un evento legato alla rete che già conoscevamo, in questo modo: Sapete che i vostri amici hanno più amici di voi? Ne hanno di più E questo è chiamato il paradosso dell'amicizia. Immaginate una persona molto popolare nella rete... come uno che ha organizzato una festa con centinaia di persone... e un misantropo che ha solo un'amico, e prendete una persona a caso dalla rete; è molto più semplice che questa persona conosca l'organizzatore della festa. E se si ha questa persona come amico, avendo 100 amici, egli ne ha più di voi. E questo, in pratica, è quel che viene definito il paradosso dell'amicizia. gli amici di persone scelte a caso hanno un grado più alto, sono più centrali, delle stesse persone scelte a caso

And you can get an intuitive appreciation for this if you imagine just the people at the perimeter of the network. If you pick this person, the only friend they have to nominate is this person, who, by construction, must have at least two and typically more friends. And that happens at every peripheral node. And in fact, it happens throughout the network as you move in, everyone you pick, when they nominate a random -- when a random person nominates a friend of theirs, you move closer to the center of the network. So, we thought we would exploit this idea in order to study whether we could predict phenomena within networks. Because now, with this idea we can take a random sample of people, have them nominate their friends, those friends would be more central, and we could do this without having to map the network.

e si può intuire chiaramente se considerate gli individui che stanno ai margini della rete. Se scegliete questa persona, l'unica persona che può nominare è questa, che, per costruzione, deve avere almeno due amici, e in generale, di più. E questo avviene per ogni nodo periferico. E in fatti, questo avviene su tutta la rete, man mano che si procede verso l'interno, chiunque scegliate, quando nominano qualcuno a caso... quando una persona a caso nomina un suo amico, si riesce a penetrare all'interno della rete. Perciò, abbiamo pensato di sfruttare questa idea in maniera da studiare se fosse possibile prevedere dei fenomeni della rete. Perché ora, con questa idea, possiamo prendere un campione di persone a caso dalla rete, farci dare i nomi dei loro amici, che saranno più centrali rispetto alla rete, e possiamo farlo senza mappare tutta la rete,

And we tested this idea with an outbreak of H1N1 flu at Harvard College in the fall and winter of 2009, just a few months ago. We took 1,300 randomly selected undergraduates, we had them nominate their friends, and we followed both the random students and their friends daily in time to see whether or not they had the flu epidemic. And we did this passively by looking at whether or not they'd gone to university health services. And also, we had them [actively] email us a couple of times a week. Exactly what we predicted happened. So the random group is in the red line. The epidemic in the friends group has shifted to the left, over here. And the difference in the two is 16 days. By monitoring the friends group, we could get 16 days advance warning of an impending epidemic in this human population.

E abbiamo testato questa idea con l'influenza H1N1 all'Harvard College nell'autunno e l'inverno del 2009, qualche mese fa. Abbiamo preso 1300 studenti a caso, ci siamo fatti dire i nomi dei loro amici, e abbiamo seguito sia gli studenti che gli amici giornalmente per vedere se avrebbero contratto o meno la malattia. E l'abbiamo fatto passivamente, guardando se si recassero o meno all'infermeria dell'università e anche attivamente, facendoci spedire email un paio di volte la settimana. è accaduto esattamente quanto abbiamo previsto. perciò il gruppo scelto a caso è rappresentato dalla linea rossa l'epidemia degli amici si è sviluppata a sinistra, qui. E la differenza di tempistica è stata di 16 giorni. monitorando il gruppo di amici, abbiamo potuto avere un rilevamento con 16 giorni di anticipo sullo scoppio dell'epidemia tra la popolazione.

Now, in addition to that, if you were an analyst who was trying to study an epidemic or to predict the adoption of a product, for example, what you could do is you could pick a random sample of the population, also have them nominate their friends and follow the friends and follow both the randoms and the friends. Among the friends, the first evidence you saw of a blip above zero in adoption of the innovation, for example, would be evidence of an impending epidemic. Or you could see the first time the two curves diverged, as shown on the left. When did the randoms -- when did the friends take off and leave the randoms, and [when did] their curve start shifting? And that, as indicated by the white line, occurred 46 days before the peak of the epidemic. So this would be a technique whereby we could get more than a month-and-a-half warning about a flu epidemic in a particular population.

Ora, in più, se voi foste analisti impegnati nello studio dell'epidemia o nella valutazione del grado di adozione di un nuovo prodotto potreste prendere un campione a caso nella popolazione, far nominare i loro amici e seguire gli amici assieme al gruppo preso a caso. Tra gli amici, il primo campanello di allarme è stata una luce intermittente sopra lo zero nell'adozione dell'innovazione, per esempio, che ci segnalava l'arrivo dell'ondata epidemica. O potete vedere quando le due curve si allontanano, come mostrato sulla sinistra. Quand'è che gli amici si allontanano dal gruppo campione, e quando le curve iniziano a divergere? Questo è successo, come indicato dalla linea bianca, dopo 46 giorni prima del picco epidemico. Perciò questa è la tecnica grazie alla quale possiamo avere un avvertimento dell'epidemia un mese e mezzo prima su una specifica popolazione.

I should say that how far advanced a notice one might get about something depends on a host of factors. It could depend on the nature of the pathogen -- different pathogens, using this technique, you'd get different warning -- or other phenomena that are spreading, or frankly, on the structure of the human network. Now in our case, although it wasn't necessary, we could also actually map the network of the students.

Direi che la misura dell'anticipo che possiamo avere dipende da una serie di fattori. Può dipendere dalla natura del patogeno -- di diversi patogeni, con questa tecnica si otterrebbe una previsione diversa... o altri fenomeni che si propagano o, francamente, potrebbe dipendere dalla struttura della rete Ora, nel nostro caso, anche se non era necessario, potremmo mappare la rete degli studenti.

So, this is a map of 714 students and their friendship ties. And in a minute now, I'm going to put this map into motion. We're going to take daily cuts through the network for 120 days. The red dots are going to be cases of the flu, and the yellow dots are going to be friends of the people with the flu. And the size of the dots is going to be proportional to how many of their friends have the flu. So bigger dots mean more of your friends have the flu. And if you look at this image -- here we are now in September the 13th -- you're going to see a few cases light up. You're going to see kind of blooming of the flu in the middle. Here we are on October the 19th. The slope of the epidemic curve is approaching now, in November. Bang, bang, bang, bang, bang -- you're going to see lots of blooming in the middle, and then you're going to see a sort of leveling off, fewer and fewer cases towards the end of December. And this type of a visualization can show that epidemics like this take root and affect central individuals first, before they affect others.

Perciò, questa è una mappa di 714 studenti e delle loro amicizie. E tra un minuto, attiverò questa mappa. vedremo come la mappa si modifica con cadenza giornaliera per 120 giorni. I puntini rossi saranno i casi di influenza, e i puntini gialli saranno gli amici di chi ha contratto la malattia. E la dimensione dei puntini sarà proporzionale al numero degli amici che hanno l'influenza. Perciò i puntini più grandi significano che molti tuoi amici hanno l'influenza. E se guardate questa immagine -- qui siamo al 13 di settembre -- vedrete pochi casi che si accendono. Vedrete lo scoppio della malattia al centro. Questo è il 19 ottobre. La curva epidemica si avvicina adesso, a novembre. Bang, bang, bang, bang, bang! si accendono le luci al centro, e poi si giunge ad uno stallo, con sempre meno casi alla fine di Dicembre. Questo tipo di visualizzazione può mostrare come l'epidemia insorge e infetta gli individui centrali, prima degli altri.

Now, as I've been suggesting, this method is not restricted to germs, but actually to anything that spreads in populations. Information spreads in populations, norms can spread in populations, behaviors can spread in populations. And by behaviors, I can mean things like criminal behavior, or voting behavior, or health care behavior, like smoking, or vaccination, or product adoption, or other kinds of behaviors that relate to interpersonal influence. If I'm likely to do something that affects others around me, this technique can get early warning or early detection about the adoption within the population. The key thing is that for it to work, there has to be interpersonal influence. It cannot be because of some broadcast mechanism affecting everyone uniformly.

Ora, come vi ho detto, questo metodo non si limita alle malattie, ma può valere per qualsiasi evento nella popolazione. L'informazione si propaga nelle popolazioni. Le norme si propagano. I comportamenti si propagano. E per comportamenti, intendo sia quelli criminali, che i comportamenti elettorali, o i comportamenti sanitari, come il fumo, la vaccinazione, o l'adozione di prodotti, e altri tipi di comportamenti che sono legati ai rapporti interpersonale. Se io faccio qualcosa che potrebbe influenzare chi mi sta intorno, questa tecnica può darmi un vantaggio nel rilevamento, di un'adozione all'interno di una popolazione. il fattore chiave è che, perché funzioni, ci deve essere una forma di influenza interpersonale. Non può avvenire per meccanismi di divulgazione che influenzano tutti uniformemente.

Now the same insights can also be exploited -- with respect to networks -- can also be exploited in other ways, for example, in the use of targeting specific people for interventions. So, for example, most of you are probably familiar with the notion of herd immunity. So, if we have a population of a thousand people, and we want to make the population immune to a pathogen, we don't have to immunize every single person. If we immunize 960 of them, it's as if we had immunized a hundred [percent] of them. Because even if one or two of the non-immune people gets infected, there's no one for them to infect. They are surrounded by immunized people. So 96 percent is as good as 100 percent. Well, some other scientists have estimated what would happen if you took a 30 percent random sample of these 1000 people, 300 people and immunized them. Would you get any population-level immunity? And the answer is no. But if you took this 30 percent, these 300 people and had them nominate their friends and took the same number of vaccine doses and vaccinated the friends of the 300 -- the 300 friends -- you can get the same level of herd immunity as if you had vaccinated 96 percent of the population at a much greater efficiency, with a strict budget constraint.

Questi ragionamenti possono essere sfruttati -- rispetto alle reti -- anche in altri modi, per esempio, per individuare le persone da trattare. Per esempio, conoscerete probabilmente il concetto di immunità del gregge. Se avete una popolazione di mille persone, e vogliamo rendere quella popolazione immune ad un patogeno, non dobbiamo immunizzate tutti. Se ne immunizziamo 960 è come se ne avessimo immunizzati il 100%. Perché anche se i non immuni contraggono la malattia, non hanno nessuno da infettare. Sono circondati da persone immuni. Perciò il 96% è buono quanto il 100% Alcuni scienziati hanno stimato ciò che succederebbe se prendessimo il 30% di persone a caso di queste 1000, e le immunizzassimo. Quindi 300. Otterremmo un'immunità di qualche tipo a livello di popolazione? La risposta è no. Ma se prendessimo il 30%, queste 300 persone, e ci facessimo indicare i loro amici e vaccinassimo oltre alle 300 persone anche 300 dei loro amici, si otterrebbe lo stesso livello di immunità del gregge come se ne avessimo vaccinato il 96% con un'efficienza maggiore, ed un budget molto limitato.

And similar ideas can be used, for instance, to target distribution of things like bed nets in the developing world. If we could understand the structure of networks in villages, we could target to whom to give the interventions to foster these kinds of spreads. Or, frankly, for advertising with all kinds of products. If we could understand how to target, it could affect the efficiency of what we're trying to achieve. And in fact, we can use data from all kinds of sources nowadays [to do this].

Idee simili possono essere utilizzate, per esempio, per valutare la distribuzione di prodotti come le zanzariere nei paesi in via di sviluppo. Potremmo capire la struttura della rete dei villaggi, individuare gli individui che potrebbero promuovere questa idea per convincere gli altri. Francamente, questo vale per qualsiasi prodotto. Se capiamo come individuare il bersaglio, potremmo avere un grosso impatto in termini di efficienza per raggiungere il nostro scopo E in effetti, possiamo usare i dati che ci arrivano dalle fonti più disparate, oggi, [per farlo].

This is a map of eight million phone users in a European country. Every dot is a person, and every line represents a volume of calls between the people. And we can use such data, that's being passively obtained, to map these whole countries and understand who is located where within the network. Without actually having to query them at all, we can get this kind of a structural insight. And other sources of information, as you're no doubt aware are available about such features, from email interactions, online interactions, online social networks and so forth. And in fact, we are in the era of what I would call "massive-passive" data collection efforts. They're all kinds of ways we can use massively collected data to create sensor networks to follow the population, understand what's happening in the population, and intervene in the population for the better. Because these new technologies tell us not just who is talking to whom, but where everyone is, and what they're thinking based on what they're uploading on the Internet, and what they're buying based on their purchases. And all this administrative data can be pulled together and processed to understand human behavior in a way we never could before.

Questa è una mappa di 8 milioni di utenti telefonici in un paese europeo. Ogni puntino è una persona, e ogni linea rappresenta un volume di chiamate tra le persone. E possiamo usare questi dati, che abbiamo ottenuto passivamente, per mappare l''intero paese e capire chi è posizionato all'interno della rete. Senza chiedere direttamente a loro, possiamo ottenere questo tipo di comprensione strutturale. E altre fonti di informazione, come sicuramente saprete, sono disponibili su meccanismi del genere, dall'interazione via email alle interazioni online, ai social network, e così via. E in effetti, siamo nell'era che io definirei di raccolta "massiva-passiva" di informazioni. Ci sono moltissimi modi per raccogliere enormi quantità di informazioni in maniera passiva per creare delle reti di sensori per seguire la popolazione, capire cosa accade nella popolazione, e intervenire sulla popolazione per il bene comune. perché tutte queste nuove tecnologie ci dicono non solo chi parla con chi ma dove si trovano, cosa pensano in base a quello che caricano su internet, e quel che comprano in base ai loro acquisti. E tutti questi dati amministrativi possono essere messi assieme ed elaborati per capire il comportamento umano in una maniera senza precedenti.

So, for example, we could use truckers' purchases of fuel. So the truckers are just going about their business, and they're buying fuel. And we see a blip up in the truckers' purchases of fuel, and we know that a recession is about to end. Or we can monitor the velocity with which people are moving with their phones on a highway, and the phone company can see, as the velocity is slowing down, that there's a traffic jam. And they can feed that information back to their subscribers, but only to their subscribers on the same highway located behind the traffic jam! Or we can monitor doctors prescribing behaviors, passively, and see how the diffusion of innovation with pharmaceuticals occurs within [networks of] doctors. Or again, we can monitor purchasing behavior in people and watch how these types of phenomena can diffuse within human populations.

Perciò, ad esempio, potremmo usare i dati sull'acquisto di carburante per i TIR I camionisti viaggiano per lavoro e comprano il carburante. E vediamo una luce che si accende quando un camionista compra del carburante, e vediamo che la recessione sta per finire. O possiamo monitorare la velocità con cui le persone si muovono con i loro cellulari in autostrada e le compagnie telefoniche vedono quando la velocità diminuisce, che c'è un ingorgo. Possono mandare l'informazione ai loro clienti, ma solo ai clienti su quella particolare autostrada che stanno per arrivare ad un ingorgo! O possiamo monitorare il comportamento dei dottori nelle ricette mediche, passivamente, e vedere la diffusione dell'innovazione dei prodotti para-farmaceutici che avviene all'interno della rete dei dottori. O ancora, possiamo monitorare i comportamenti di acquisto delle pesone, e guardare come tutti questi tipi di fenomeni si diffondono tra le popolazioni.

And there are three ways, I think, that these massive-passive data can be used. One is fully passive, like I just described -- as in, for instance, the trucker example, where we don't actually intervene in the population in any way. One is quasi-active, like the flu example I gave, where we get some people to nominate their friends and then passively monitor their friends -- do they have the flu, or not? -- and then get warning. Or another example would be, if you're a phone company, you figure out who's central in the network and you ask those people, "Look, will you just text us your fever every day? Just text us your temperature." And collect vast amounts of information about people's temperature, but from centrally located individuals. And be able, on a large scale, to monitor an impending epidemic with very minimal input from people. Or, finally, it can be more fully active -- as I know subsequent speakers will also talk about today -- where people might globally participate in wikis, or photographing, or monitoring elections, and upload information in a way that allows us to pool information in order to understand social processes and social phenomena.

E ci sono tre modi, io credo, i cui possiamo usare questi dati massivo-passivi. Uno è totalmente passivo come ho già descritto -- come nell'esempio dei camionisti, in cui non si interviene all'interno della popolazione. Uno è semi-attivo, come nel caso dell'influenza, in cui le persone fanno i nomi dei loro amici e poi monitoriamo passivamente gli amici... per vedere se hanno l'influenza, e avere un certo preavviso. Un altro esempio, potrebbe essere, se siete una compagnia telefonia, individuate chi è centrale alla rete, e chiedete a questa gente, "Invia un sms con il livello di febbre che hai ogni giorno. Solo un semplice sms." Si raccolgono i dati sulla febbre delle persone ma da individui al centro della rete. E sarete in grado, su larga scala, di monitorare l'epidemia imminente con un imput minimo O, in ultima istanza, si può essere pienamente attivi -- come diranno gli altri relatori oggi -- laddove le persone partecipano ai wiki, fotografando, monitorando le elezioni, o caricando informazioni che ci permettono di monitorare e capire alcuni processi sociali o fenomeni sociali.

In fact, the availability of these data, I think, heralds a kind of new era of what I and others would like to call "computational social science." It's sort of like when Galileo invented -- or, didn't invent -- came to use a telescope and could see the heavens in a new way, or Leeuwenhoek became aware of the microscope -- or actually invented -- and could see biology in a new way. But now we have access to these kinds of data that allow us to understand social processes and social phenomena in an entirely new way that was never before possible. And with this science, we can understand how exactly the whole comes to be greater than the sum of its parts. And actually, we can use these insights to improve society and improve human well-being.

In effetti, la reperibilità di questi dati, credo, può portare ad una nuova era che io ed altri amiamo definire "scienza sociale computazionale" E' un po' come quando Galileo inventò -- beh non l'inventò -- iniziò ad utilizzare il telescopio per guardare i cieli in un modo diverso, o Leeuwenhoek si rese conto delle capacità del microscopio -- beh lo inventò in effetti -- e fu in grado di guardare alla biologia in maniera diversa. E ora abbiamo accesso ai dati più disparati che ci permettono di capire i processi sociali e i fenomeni sociali in maniera completamente diversa dal passato. E con questa scienza, possiamo capire esattamente come l'insieme diventa più grande della somma delle parti. E in effetti, possiamo usare queste conoscenze per migliorare la società e migliorare il benessere dell'umanità.

Thank you.

Grazie.

Thank you.

Grazie.

Nicholas Christakis: How social networks predict epidemics

Nicholas Christakis: How social networks predict epidemics

Related talks

Nicholas Christakis: The hidden influence of social networks

Dan Dennett: Dangerous memes

Laurie Garrett: Lessons from the 1918 flu

Gary Slutkin: Let's treat violence like a contagious disease

Andreas Raptopoulos: No roads? There's a drone for that

Eric Berlow and Sean Gourley: Mapping ideas worth spreading

Related talks

Nicholas Christakis: The hidden influence of social networks

Dan Dennett: Dangerous memes

Laurie Garrett: Lessons from the 1918 flu

Gary Slutkin: Let's treat violence like a contagious disease

Andreas Raptopoulos: No roads? There's a drone for that

Eric Berlow and Sean Gourley: Mapping ideas worth spreading