Nicholas Christakis: How social networks predict epidemics

على مدى السنوات العشر الماضية، لقد كنت امضي وقتي في محاولة معرفة كيف و لماذا بدأ البشر بتجميع أنفسهم على شكل شبكات إجتماعية و نوع الشبكة الإجتماعية التي أتحدث عنها ليست شبكات الأنترنت المتنوعة لكن بدلاً من ذلك تلك الشبكات الإجتماعية التي بدأ البشر في تجميعها على مدى مئات الاف السنين منذ أن خرجنا من السافانا الأفريقية. لذلك فقد شكلت صداقات و زمالة و أخوة و علاقات قرابة مع أشخاص آخرين هم بالتالي لديهم علاقات مشابهة مع أشخاص آخرين و هذا ينتشر بشكل لا متناهي إلى مدى و تحصل على شبكة تشبه هذه كل نقطة هي شخص كل خط بينهم هو علاقة بين شخصين أنواع مختلفة من العلاقات. و تحصل هذا النسيج الواسع من البشرية الذي نحن جميعاً جزأ لا يتجزأ منه.

For the last 10 years, I've been spending my time trying to figure out how and why human beings assemble themselves into social networks. And the kind of social network I'm talking about is not the recent online variety, but rather, the kind of social networks that human beings have been assembling for hundreds of thousands of years, ever since we emerged from the African savannah. So, I form friendships and co-worker and sibling and relative relationships with other people who in turn have similar relationships with other people. And this spreads on out endlessly into a distance. And you get a network that looks like this. Every dot is a person. Every line between them is a relationship between two people -- different kinds of relationships. And you can get this kind of vast fabric of humanity, in which we're all embedded.

زميلي "جيمس فاولر" و أنا كنا ندرس لوقت لابأس به ما هي القوانين الرياضية و الإجتماعية و البيلوجية و النفسية التي تحكم كيفية تجمع هذه الشبكات و ما هي القوانين المشابهة التي تحكم كيفية عملها، و كيف تؤثر في حياتنا. كنا نتسائل مؤخراً إن كان بالإمكان الإستفادة من هذا الفهم العميق و إيجاد طرق لتحسين العالم لعمل أشياء أفضل و إصلاح الأشياء، و ليس فقط لفهمها. لذلك واحدة من أوائل الأشياء التي فكرنا في علاجها ستكون كيف لنا أن نتوقع الأوبئة

And my colleague, James Fowler and I have been studying for quite sometime what are the mathematical, social, biological and psychological rules that govern how these networks are assembled and what are the similar rules that govern how they operate, how they affect our lives. But recently, we've been wondering whether it might be possible to take advantage of this insight, to actually find ways to improve the world, to do something better, to actually fix things, not just understand things. So one of the first things we thought we would tackle would be how we go about predicting epidemics.

الطريقة الحالية في توقع الأوبئة هي إن كنت في مركز السيطرة على الأمراض أو جهات وطنية أخرى هي أن تجلس في المنتصف حيث أنت و تجمع معلومات من الأطباء و المختبرات في المجال الذي أبلغ عنه أن هنالك إنتشار أو حدوث حالات معينة وهكذا ، وهكذا حتى يتم تشخيص المرضى الذين يعانون من شيء ، أو مرضى آخرين يتم تشخيصهم و كل هذه المعلومات يتم تزويدها للمستودع المركزي مع بعض التأخير و على فرض أن جميع الأمور سارت بشكل سلس من أسبوع إلى أسبوعين من الآن ستعلم أين كان الوباء اليوم. في الواقع، قبل سنة أو نحو ذلك كان هناك هذا النوع من نشر في نفس المفهوم في "Google Flu Trends" (إتجاهات الإنفلونزا فى جوجل) فيما يتعلق بالإنفلونزا حيث، بمراقبة سلوك البحث لدى الناس اليوم يمكننا أن نعرف أين الإنفلونزا ماذا كانت حالة الوباء اليوم ما مدى إنتشار الوباء اليوم

And the current state of the art in predicting an epidemic -- if you're the CDC or some other national body -- is to sit in the middle where you are and collect data from physicians and laboratories in the field that report the prevalence or the incidence of certain conditions. So, so and so patients have been diagnosed with something, or other patients have been diagnosed, and all these data are fed into a central repository, with some delay. And if everything goes smoothly, one to two weeks from now you'll know where the epidemic was today. And actually, about a year or so ago, there was this promulgation of the idea of Google Flu Trends, with respect to the flu, where by looking at people's searching behavior today, we could know where the flu -- what the status of the epidemic was today, what's the prevalence of the epidemic today.

لكن ما أود أن أريكم إياه اليوم هي الوسيلة التي تمكننا ليس فقط من التحذير السريع حول الوباء لكن في الواقع الكشف المبكر حول الوباء و في الواقع، هذه الفكرة يمكن إستخدامها ليس فقط في توقع الأوبئة المتعلقة بالجراثيم لكن أيضاَ في توقع الأوبئة من جميع الأنواع على سبيل المثال، أي شيء ينتشر بسرعة على شكل عدوى إجتماعية يمكن فهمه على هذا النحو من الأفكار المجردة على اليسار مثل الوطنية، أو الإيثار، أو الدين إلى الممارسات مثل سلوك أتباع نظام غذائي، أو شراء كتاب أو الشرب، أو خوذة الدراجة ، و ممارسات السلامة الأخرى أو المنتجات التي قد يشتريها الناس شراء السلع الإلكترونية أي شيء له علاقة بالإنتشار الشخصي نوع من نشر الإبتكار يمكن أن يفهم و يتوقع بواسطة الألية التي سوف أريكم إياها الآن

But what I'd like to show you today is a means by which we might get not just rapid warning about an epidemic, but also actually early detection of an epidemic. And, in fact, this idea can be used not just to predict epidemics of germs, but also to predict epidemics of all sorts of kinds. For example, anything that spreads by a form of social contagion could be understood in this way, from abstract ideas on the left like patriotism, or altruism, or religion to practices like dieting behavior, or book purchasing, or drinking, or bicycle-helmet [and] other safety practices, or products that people might buy, purchases of electronic goods, anything in which there's kind of an interpersonal spread. A kind of a diffusion of innovation could be understood and predicted by the mechanism I'm going to show you now.

لذلك، كما تعلمون جميعاً على الأرجح الطريقة التقليدية في التفكير حول هذه هو نشر الإبداع أو " منحنى التبني" فهنا على محور الـ(ص)، لدينا نسبة الأشخاص المصابين و على محور الـ(س) لدينا الوقت و في البداية ليس هنالك أشخاص كثيرون مصابون و تحصل على هذا الشكل السيني التقليدي أو منحنى على شكل "S" و السبب في هذا الشكل هو أنه في البداية دعنا نقول أن شخص أو شخصين هم متأثرون ، أو مصابون بالشيء و من ثم أثروا أو أصابوا شخصين الذين بدورهما يؤثران على أربعة ، ثمانية ، ستة عشر و هكذا دواليك و بذلك تحصل على مرحلة نمو الوباء للمنحنى و في نهاية المطاف ، يتشبع السكان و يكون هنالك أشخاص أقل و أقل لازالوا متاحين لإحتمال إصابتهم بالعدوى ثم تحصل على إستقرار للمنحنى ثم بعد ذلك تحصل على الشكل السيني التقليدي و هذا ينطوي على الجراثيم، و الأفكار و تبني المنتجات ، و السلوك و الأمور المشابهة لكن الأشياء لا تنتشر في التجمعات البشرية بشكل عشوائي في واقع الحال تنتشر من خلال شبكات لأنه كما قلت إننا نعيش حياتنا في شبكات و هذه الشبكات لديها نوع خاص من البنية

So, as all of you probably know, the classic way of thinking about this is the diffusion-of-innovation, or the adoption curve. So here on the Y-axis, we have the percent of the people affected, and on the X-axis, we have time. And at the very beginning, not too many people are affected, and you get this classic sigmoidal, or S-shaped, curve. And the reason for this shape is that at the very beginning, let's say one or two people are infected, or affected by the thing and then they affect, or infect, two people, who in turn affect four, eight, 16 and so forth, and you get the epidemic growth phase of the curve. And eventually, you saturate the population. There are fewer and fewer people who are still available that you might infect, and then you get the plateau of the curve, and you get this classic sigmoidal curve. And this holds for germs, ideas, product adoption, behaviors, and the like. But things don't just diffuse in human populations at random. They actually diffuse through networks. Because, as I said, we live our lives in networks, and these networks have a particular kind of a structure.

الآن، إذا نظرت إلى شبكة كهذه هذه شبكة ١٠٥ أشخاص النقاط هم الأشخاص و الخطوط تمثل علاقات الصداقة يمكن لك أن ترى أشخاص يحتلون مواقع مختلفة داخل الشبكة و هنالك أنواع مختلفة من العلاقات بين الأشخاص يمكن لك أن تحظى بعلاقة صداقة ، علاقة إخوة علاقة زوجية ، علاقة زمالة علاقة جيرة و هكذا و أنواع مختلفة من الأشياء تنتشر عبر روابط مختلفة على سبيل المثال ، الأمراض المنقولة جنسيا ستنتشر عبر الروابط الجنسية أو مثلاً، سلوك التدخين لدى الأشخاص ممكن أن يكون بتأثير أصدقائهم أو إيثارهم ، أو سلوك العطاء الخيري ممكن أن يكون بتأثير زملائهم أو بواسطة جيرانهم لكن ليس كل المواقع داخل الشبكة متشابهة

Now if you look at a network like this -- this is 105 people. And the lines represent -- the dots are the people, and the lines represent friendship relationships. You might see that people occupy different locations within the network. And there are different kinds of relationships between the people. You could have friendship relationships, sibling relationships, spousal relationships, co-worker relationships, neighbor relationships and the like. And different sorts of things spread across different sorts of ties. For instance, sexually transmitted diseases will spread across sexual ties. Or, for instance, people's smoking behavior might be influenced by their friends. Or their altruistic or their charitable giving behavior might be influenced by their coworkers, or by their neighbors. But not all positions in the network are the same.

لذلك إذا شاهدت هذا، يمكن لك أن تدرك فوراً أن أشخاص مختلفين لهم عدد مختلف من الروابط بعض الأشخاص لديهم رابط واحد ، و آخرين لديهم إثنين البعض لديه ستة، و آخرون لديهم عشرة روابط و هذا يدعى درجة العقدة أو عدد الروابط التي لدى العقدة لكن بالأضافة هنالك شيء آخر لذلك إذا شاهدت العقدة "A" و "B" كلاهما لديه ستة روابط لكن إذا أمكنك رؤية هذه الصورة من منظور عين الطير يمكنك أن تدرك أن هنالك شيء مختلف جداً بين العقدتين "A" و "B" لذلك دعني أسألك هذا ـ يمكن أن أزرع هذا الحدس بسؤال آيهما تفضل أن تكون إذا كان هنالك جراثيم قاتلة تنتشر خلال الشبكة. "A" أو "B" ؟ (الجمهور:B) نيكولاس كريستاكيس: بشكل واضح "B" "B" تقع على حافة الشبكة الآن من تفضل أن تكون إذا كانت هنالك إشاعة مهمة تنتشر خلال الشبكة ؟ "A" و سيكون لديك تقدير فوري "A" ستكون أكثر عرضة لإلتقاط الأشياء التي تنتشر و ستكون أسرع بحكم موقعها الهيكلي داخل الشبكة "A" في الواقع أكثر مركزية ويمكن إضفاء الطابع الرسمي هذا رياضيا لذلك إذا كنا نريد تعقب شيء كان ينتشر خلال الشبكة ما يمكن أن نفعله بشكل مثالي هو أن نضع أجهزة إستشعار في الأفراد المركزيين داخل الشبكة من ضمنهم العقدة "A" مراقبة هؤلاء الأشخاص الذي هم في منتصف الشبكة و بطريقة ما نحصل على كشف مبكر لما هو الشيء الذي ينتشر خلال الشبكة

So if you look at this, you might immediately grasp that different people have different numbers of connections. Some people have one connection, some have two, some have six, some have 10 connections. And this is called the "degree" of a node, or the number of connections that a node has. But in addition, there's something else. So, if you look at nodes A and B, they both have six connections. But if you can see this image [of the network] from a bird's eye view, you can appreciate that there's something very different about nodes A and B. So, let me ask you this -- I can cultivate this intuition by asking a question -- who would you rather be if a deadly germ was spreading through the network, A or B? (Audience: B.) Nicholas Christakis: B, it's obvious. B is located on the edge of the network. Now, who would you rather be if a juicy piece of gossip were spreading through the network? A. And you have an immediate appreciation that A is going to be more likely to get the thing that's spreading and to get it sooner by virtue of their structural location within the network. A, in fact, is more central, and this can be formalized mathematically. So, if we want to track something that was spreading through a network, what we ideally would like to do is to set up sensors on the central individuals within the network, including node A, monitor those people that are right there in the middle of the network, and somehow get an early detection of whatever it is that is spreading through the network.

لذلك إذا رأيتهم يلتقطون جراثيم أو جزء من المعلومات ستعلم ذلك، و بشكل سريع كفاية سيكون الجميع على وشك إلتقاط هذه الجراثيم أو هذا الجزء من المعلومات و هذا سيكون أفضل من مراقبة ستة أشخاص مختارين عشوائياً دون الإشارة إلى هيكلية السكان و في الواقع، إذا كنت تستطيع عمل ذلك ما ستراه هو شيء كهذا في اللوحة على اليد اليسرى، مرة أخرى ، لدينا منحنى على شكل "S" للتبني في الخط الأحمر المتقطع ، نعرض ما سيكون عليه التبني في الأشخاص العشوائيين و على اليد اليسرى الخط يتحول إلى اليسار نعرض ما سيكون عليه التبني لدى الأشخاص المركزين داخل الشبكة على محور الـ(ص) هو الحالات التراكمية للعدوى و على المحور الـ(س) هو الوقت و على الجزء الأيمن، نعرض بعض البيانات لكن هنا مع الوقائع اليومية و ما نعرضه هنا هو قليل من الأشخاص متأثرين، المزيد و المزيد و المزيد إلى هنا و هنا قمة الوباء لكن ما هو محول إلى اليسار هو ما يحدث لدى الأشخاص المركزيين و هذا هو الفرق في الوقت بين الإثنين الكشف المبكر ، الإنذار المبكر هو ما يمكننا الحصول عليه حول الوباء الوشيك في السكان

So if you saw them contract a germ or a piece of information, you would know that, soon enough, everybody was about to contract this germ or this piece of information. And this would be much better than monitoring six randomly chosen people, without reference to the structure of the population. And in fact, if you could do that, what you would see is something like this. On the left-hand panel, again, we have the S-shaped curve of adoption. In the dotted red line, we show what the adoption would be in the random people, and in the left-hand line, shifted to the left, we show what the adoption would be in the central individuals within the network. On the Y-axis is the cumulative instances of contagion, and on the X-axis is the time. And on the right-hand side, we show the same data, but here with daily incidence. And what we show here is -- like, here -- very few people are affected, more and more and more and up to here, and here's the peak of the epidemic. But shifted to the left is what's occurring in the central individuals. And this difference in time between the two is the early detection, the early warning we can get, about an impending epidemic in the human population.

المشكلة هي رسم خرائط لشبكات البشر الإجتماعية ليست دائماً ممكنة قد تكون غالية الثمن، ليست مجدية غير أخلاقية أو بصراحة، من المستحيل القيام بشيء كهذا لذلك كيف لنا معرفة من هم الأشخاص المركزيون داخل الشبكة دون الحاجة إلى رسم خريطة للشبكة ؟ ما أتينا به كان فكرة إستخدام حقيقة قديمة جداً أو حقيقة معروفة حول الشبكات الإجتماعية التي تقول هل تعلم بأن صديقك لديه أصدقاء أكثر مما لديك ؟ صديقك لديه أصدقاء أكثر مما لديك و هذا معروف بمفارقة الصداقة تخيل شخص ذو شعبية كبيرة في الشبكة الإجتماعية كمظيف حفلات لديه المئات من الأصدقاء و شخص مصاب ببغض البشرية لديه صديق واحد و أنت أخترت شخص عشوائيا من السكان هم سيكونون أكثر ميلاً لمعرفة مظيف الحفلات و إذا رشحوا مظيف الحفلات كصديق لهم و بما أن مظيف الحفلات لديه المئات من الأصدقاء لذلك فهو لديه أصدقاء أكثر مما لديهم خلاصة الموضوع ، أن هذا ما يسمى بمفارقة الصداقة الأصدقاء للأشخاص المختارين عشوائياً لديهم درجة عالية و أكثر مركزية من الأشخاص المختارين عشوائياً أنفسهم

The problem, however, is that mapping human social networks is not always possible. It can be expensive, not feasible, unethical, or, frankly, just not possible to do such a thing. So, how can we figure out who the central people are in a network without actually mapping the network? What we came up with was an idea to exploit an old fact, or a known fact, about social networks, which goes like this: Do you know that your friends have more friends than you do? Your friends have more friends than you do, and this is known as the friendship paradox. Imagine a very popular person in the social network -- like a party host who has hundreds of friends -- and a misanthrope who has just one friend, and you pick someone at random from the population; they were much more likely to know the party host. And if they nominate the party host as their friend, that party host has a hundred friends, therefore, has more friends than they do. And this, in essence, is what's known as the friendship paradox. The friends of randomly chosen people have higher degree, and are more central than the random people themselves.

و يمكنك الحصول على تقدير حدسي لهذا إذا تخيلت فقط الأشخاص في محيط الشبكة إذا أخترت هذا الشخص الصديق الوحيد الذي يمكن ترشيحه هو هذا الشخص الذي بالبنية لابد له أن يكون لديه على الأقل إثنين و المزيد من الأصدقاء عادةً و هذا يحدث في كل عقدة طرفية و في الواقع، هذا يحدث في جميع أنحاء الشبكة كلما أتجهت للداخل كل شخص تختاره، عندما يرشح عشوائيا عندما شخص عشوائي يرشح صديق لهم فأنك تتحرك أقرب إلى مركز الشبكة لذلك فكرنا أنه يمكن إستغلال هذه الفكرة من أجل دراسة إمكانية توقع ظواهر داخل الشبكة لأنه الآن ، مع هذه الفكرة يمكننا أخذ عينة عشوائية من الناس و جعلهم يرشحون أصدقائهم هؤلاء الأصدقاء سيكونون أكثر مركزية و سيكون بمقدورنا فعل هذا دون الحاجة إلى خريطة للشبكة

And you can get an intuitive appreciation for this if you imagine just the people at the perimeter of the network. If you pick this person, the only friend they have to nominate is this person, who, by construction, must have at least two and typically more friends. And that happens at every peripheral node. And in fact, it happens throughout the network as you move in, everyone you pick, when they nominate a random -- when a random person nominates a friend of theirs, you move closer to the center of the network. So, we thought we would exploit this idea in order to study whether we could predict phenomena within networks. Because now, with this idea we can take a random sample of people, have them nominate their friends, those friends would be more central, and we could do this without having to map the network.

و قد إختبرنا هذه الفكرة مع تفشي إنفلونز ( H1N1) في كلية هارفرد في فصلي الخريف و الشتاء من عام ٢٠٠٩، فقط قبل بضعة أشهر أخذنا ١٣٠٠ طالب جامعي تم إختيارهم عشوائياً و جعلناهم يرشوحون أصدقائهم و تابعنا كلا الطلاب العشوائيين و أصدقائهم بشكل يومي لمعرفة ما إذا أصيبوا أو لم يصابوا بوباء الإنفلونزا و فعلنا هذا بسلبية بمراقبة ما إذا ذهبوا أو لم يذهبوا إلى الخدمات الصحية بالجامعة و جعلناهم أيضاً يرسلون لنا بريد إلكتروني مرتين في الأسبوع و قد حدث ما توقعناه تماماً لذا المجموعة العشوائية هي الخط الأحمر الوباء في المجموعة تحرك إلى اليسار، إلى هنا و الفرق في الإثنين هو ١٦ يوم من خلال مراقبة مجموعة الأصدقاء يمكننا الحصول على إنذار مبكر بـ ١٦ يوم لتفشي وباء في هذه المجموعة من السكان

And we tested this idea with an outbreak of H1N1 flu at Harvard College in the fall and winter of 2009, just a few months ago. We took 1,300 randomly selected undergraduates, we had them nominate their friends, and we followed both the random students and their friends daily in time to see whether or not they had the flu epidemic. And we did this passively by looking at whether or not they'd gone to university health services. And also, we had them [actively] email us a couple of times a week. Exactly what we predicted happened. So the random group is in the red line. The epidemic in the friends group has shifted to the left, over here. And the difference in the two is 16 days. By monitoring the friends group, we could get 16 days advance warning of an impending epidemic in this human population.

الآن، بالإضافة إلى ذلك إذا كنت محلل تحاول دراسة وباء أو توقع تبني سلعة ، على سبيل المثال ما يمكنك فعله هو إختيار عينة عشوائية من السكان و جعلهم يرشحون أصدقائهم و أتبع الأصدقاء و أتبع كلا العشوائيين و الأصدقاء بين الأصدقاء، الدليل الأول الذي تشاهده لومضة فوق الصفر في تبني الإبتكار ، على سبيل المثال سيكون دليل على تفشي وباء أو يمكن أن ترى عندما يتباين المنحنين للمرة الأولى كما هو معروض على اليسار عندما كان العشوائيين...... عندما يقلع الأصدقاء و يتركون العشوائيين و منحناهم يبدأ في التحرك و ذلك كما يشير الخط الأبيض حدث ٤٦ يوم قبل قمة الوباء لذلك هذا قد يكون أسلوب حيث يمكن أن نحصل على أكثر من شهر و نصف من التحذير حول وباء الإنفلونزا في سكان معينين

Now, in addition to that, if you were an analyst who was trying to study an epidemic or to predict the adoption of a product, for example, what you could do is you could pick a random sample of the population, also have them nominate their friends and follow the friends and follow both the randoms and the friends. Among the friends, the first evidence you saw of a blip above zero in adoption of the innovation, for example, would be evidence of an impending epidemic. Or you could see the first time the two curves diverged, as shown on the left. When did the randoms -- when did the friends take off and leave the randoms, and [when did] their curve start shifting? And that, as indicated by the white line, occurred 46 days before the peak of the epidemic. So this would be a technique whereby we could get more than a month-and-a-half warning about a flu epidemic in a particular population.

يفترض لي أن أقول ذلك لأي مدى مبكر يمكن لإشعار حول شيء ما يعتمد على مجموعة من العوامل يمكن أن يعتمد على طبيعة مسبب المرض مسببات الأمراض المختلفة إستخدام هذا الأسلوب، يعطيك إنذارات مختلفة أو ظواهر أخرى تنتشر أو صراحة، على هيكلية الشبكة البشرية الآن، في حالتنا و على الرغم إنها لم تكن ضرورية كان بإمكاننا واقعا رسم خريطة لشبكة الطلاب

I should say that how far advanced a notice one might get about something depends on a host of factors. It could depend on the nature of the pathogen -- different pathogens, using this technique, you'd get different warning -- or other phenomena that are spreading, or frankly, on the structure of the human network. Now in our case, although it wasn't necessary, we could also actually map the network of the students.

لذا ، هذه خريطة ٧١٤ طالباً و روابط صداقاتهم خلال دقيقة، سوف أضع لكم هذه الخريطة موضع التنفيذ سنقوم بأخذ قطع يومي للشبكة لمدة ١٢٠ يوم النقط الحمراء تشير إلى حالات الإنفلونزا و النقط الصفراء تشير إلى أصدقاء الأشخاص المصابين بالإنفلونزا و حجم النقاط يشير تناسبياً إلى عدد الأصدقاء المصابين بالإنفلونزا لذا، النقاط الكبيرة تشير إلى أن كثير من أصدقائك لديهم إنفلونزا إذا نظرت إلى هذه الصورة ، نحن هنا الآن في ١٣ سبتمبر سوف ترى القليل من حالات تضاء سوف ترى نوع من الإزهار في الإنفلونزا في الوسط هنا نحن في ١٩ اوكتوبر المنحدر من منحنى الوباء يقترب الآن في نوفمبر فرقعة ، فرقعة ، فرقعة ، فرقعة ، فرقعة سوف ترى الكثير من الإزهار في المنتصف ثم سوف ترى نوع من الثبات حالات أقل و أقل بإتجاه نهاية ديسمبر و هذا النوع من التصور يمكن أن يظهر أوبئة مثل هذه تضرب بجذورها و تؤثر على الأشخاص المركزيين أولاً قبل أن تؤثر على الآخرين

So, this is a map of 714 students and their friendship ties. And in a minute now, I'm going to put this map into motion. We're going to take daily cuts through the network for 120 days. The red dots are going to be cases of the flu, and the yellow dots are going to be friends of the people with the flu. And the size of the dots is going to be proportional to how many of their friends have the flu. So bigger dots mean more of your friends have the flu. And if you look at this image -- here we are now in September the 13th -- you're going to see a few cases light up. You're going to see kind of blooming of the flu in the middle. Here we are on October the 19th. The slope of the epidemic curve is approaching now, in November. Bang, bang, bang, bang, bang -- you're going to see lots of blooming in the middle, and then you're going to see a sort of leveling off, fewer and fewer cases towards the end of December. And this type of a visualization can show that epidemics like this take root and affect central individuals first, before they affect others.

الآن ، كما أقترحت هذا المنهج ليس مقيداً بالجراثيم لكن في الواقع أي شيء يمكنه الإنتشار في السكان المعلومة تنتشر في السكان القواعد يمكن لها أن تنتشر في السكان السلوك يمكن له أن ينتشر في السكان و بالسلوك، أعني أشياء كالسلوك الإجرامي أو السلوك الإنتخابي ، أو سلوك الرعاية الصحية كالتدخين أو التلقيح أو تبني منتج ، أو أنواع أخرى من السلوكيات التي لها علاقة بالتأثير الشخصي إذا كنت ميالاً لفعل شيء يؤثر على الآخرين المحيطين بي هذا الأسلوب قد يكون تحذير مبكر ، أو كشف مبكر حول التبني داخل السكان و مفتاح هذا الشيء لكي يعمل لابد أن يكون هنالك تأثير شخصي لا يمكن لها أن تكون بسبب بعض آليات البث تأثر بجميع الأشخاص بشكل موحد

Now, as I've been suggesting, this method is not restricted to germs, but actually to anything that spreads in populations. Information spreads in populations, norms can spread in populations, behaviors can spread in populations. And by behaviors, I can mean things like criminal behavior, or voting behavior, or health care behavior, like smoking, or vaccination, or product adoption, or other kinds of behaviors that relate to interpersonal influence. If I'm likely to do something that affects others around me, this technique can get early warning or early detection about the adoption within the population. The key thing is that for it to work, there has to be interpersonal influence. It cannot be because of some broadcast mechanism affecting everyone uniformly.

الآن نفس الرؤى يمكن إستغلالها أيضاً فيما يتعلق بالشبكات يمكن إستغلالها بطرق أخرى على سبيل المثال بإستهداف أشخاص معينين للتدخل على سبيل المثال، معظم الناس قد يكون على دراية بمفهوم مناعة القطيع لذا ، إذا كان لدينا سكان من ألف شخص و نريد أن نجعل السكان لديهم مناعة من مسببات المرض ليس علينا تحصين كل شخص إذا حصنا ٩٦٠ منهم كأنه إذا قمنا بتحصين مئة منهم لأنه حتى لو واحد أو أثنين من ليست لديهم مناعة أصيب بالعدوى ليس هنالك أحد ليعديه إنهم محاطون بأشخاص لديهم مناعة لذا ٩٦٪ جيدة بقدر ١٠٠٪ حسناً ، بعض العلماء يقدرون ما سيحدث لو أخذنا ٣٠٪ عينة عشوائية من ١٣٠٠ شخص و حصناهم هل ستحصل على أي مستوى سكاني من المناعة ؟ و الجواب هو لا لكن إذا أخذنا ٣٠٪ ، هؤلاء ٣٠٠ شخص و جعلهم يرشحون أصدقائهم و أخذنا نفس العدد من اللقاحات و لقحنا أصدقاء الـ٣٠٠ أصدقاء الـ٣٠٠ تحصل على نفس المستوى من حصانة القطيع كأنك قد لقحت ٩٦ ٪ من السكان بكفاءة أكبر من ذلك بكثير، مع قيود صارمة على الميزانية

Now the same insights can also be exploited -- with respect to networks -- can also be exploited in other ways, for example, in the use of targeting specific people for interventions. So, for example, most of you are probably familiar with the notion of herd immunity. So, if we have a population of a thousand people, and we want to make the population immune to a pathogen, we don't have to immunize every single person. If we immunize 960 of them, it's as if we had immunized a hundred [percent] of them. Because even if one or two of the non-immune people gets infected, there's no one for them to infect. They are surrounded by immunized people. So 96 percent is as good as 100 percent. Well, some other scientists have estimated what would happen if you took a 30 percent random sample of these 1000 people, 300 people and immunized them. Would you get any population-level immunity? And the answer is no. But if you took this 30 percent, these 300 people and had them nominate their friends and took the same number of vaccine doses and vaccinated the friends of the 300 -- the 300 friends -- you can get the same level of herd immunity as if you had vaccinated 96 percent of the population at a much greater efficiency, with a strict budget constraint.

و أفكار مشابهة يمكن إستخدامها ، مثلاً لأستهداف أشياء مثل الناموسيات في العالم النامي إذا أمكننا فهم هيكلة الشبكة في القرى يمكن لنا أن نستهدف لمن نعطي التدخلات لنعزز هذا النوع من الإنتشار أو بصراحة ، للإعلان مع كافة أنواع السلع إذا فهمنا كيف نستهدف يمكن إن تؤثر على كفاءة ما نحاول الوصول إليه و في الواقع، يمكن لنا أن نستخدم بيانات من جميع أنواع المصادر في الوقت الحاضر.

And similar ideas can be used, for instance, to target distribution of things like bed nets in the developing world. If we could understand the structure of networks in villages, we could target to whom to give the interventions to foster these kinds of spreads. Or, frankly, for advertising with all kinds of products. If we could understand how to target, it could affect the efficiency of what we're trying to achieve. And in fact, we can use data from all kinds of sources nowadays [to do this].

هذه خريطة لـ ٨ ملايين مستخدم هاتف في بلد أوروبي كل نقطة شخص ، و كل خط يمثل حجم المكالمات بين الأشخاص و يمكن لنا أن نستخدم البيانات التي تم الحصول عليها بشكل سلبي لرسم خريطة كامل البلدان و فهم من يقع داخل الشبكة دون الحاجة للأستعلام منهم على الإطلاق يمكن لنا الحصول على هذا النوع من الرؤية الهيكلية و مصادر أخرى للمعلومات ، أنت على دراية بها بلا شك متاحة كالمعلومات من تفاعلات البريد الإلكتروني تفاعلات الإنترنت الشبكات الاجتماعية على الانترنت وهكذا دواليك. و في الواقع، نحن في حقبة يمكن أن أدعوها الجهود الضخمة السلبية لجمع البيانات هنالك العديد من الطرق يمكن لنا إستخدامها بشكل ضخم لجمع البيانات لخلق شبكات إستشعار لمتابعة السكان فهم ما يحصل داخل السكان و التدخل في السكان للأفضل لأن هذه الوسائل تخبرنا ليس فقط من يحادث من لكن أين الجميع و ماذا يفكرون به بناء على ما يرفعونه على الإنترنت و ما يشترونه بناء على مشترياتهم و يمكن سحب كل هذه البيانات الإدارية معاً و معالجتها لفهم السلوك البشري بطريقة لم تسبق من قبل

This is a map of eight million phone users in a European country. Every dot is a person, and every line represents a volume of calls between the people. And we can use such data, that's being passively obtained, to map these whole countries and understand who is located where within the network. Without actually having to query them at all, we can get this kind of a structural insight. And other sources of information, as you're no doubt aware are available about such features, from email interactions, online interactions, online social networks and so forth. And in fact, we are in the era of what I would call "massive-passive" data collection efforts. They're all kinds of ways we can use massively collected data to create sensor networks to follow the population, understand what's happening in the population, and intervene in the population for the better. Because these new technologies tell us not just who is talking to whom, but where everyone is, and what they're thinking based on what they're uploading on the Internet, and what they're buying based on their purchases. And all this administrative data can be pulled together and processed to understand human behavior in a way we never could before.

على سبيل المثال ، يمكن لنا إستخدام شراء سائقي الشاحنات للوقود لذا سائقي الشاحنات فقط يذهبون لعملهم و هم يبتاعون الوقود و نرى وميض في شراء سائقي الشاحنات للوقود و نعلم أن الكساد أوشك على الإنتهاء أو يمكن لنا مراقبة السرعة من خلال الأشخاص المتحركون مع هواتفهم في الطرق السريعة و يمكن لشركة الهاتف أن ترى عندما تنخفض السرعة أن هنالك إزدحام مروري و يمكن تزويد تلك المعلومات مرة أخرى إلى مشتركيها لكن فقط لمشتركيها على نفس الطريق السريع الواقعين خلف الإزدحام المروري أو يمكننا مراقبة سوكيات وصف الأطباء ، بشكل سلبي كمشاهدت نشر الإبتكار مع المستحضرات الصيدلانية تحدث داخل الأطباء أو مرة أخرى، يمكننا مراقبة سلوكيات الشراء لدى الناس و مشاهدة كيف لهذه الأنواع من الظواهر أن تنتشر داخل الناس

So, for example, we could use truckers' purchases of fuel. So the truckers are just going about their business, and they're buying fuel. And we see a blip up in the truckers' purchases of fuel, and we know that a recession is about to end. Or we can monitor the velocity with which people are moving with their phones on a highway, and the phone company can see, as the velocity is slowing down, that there's a traffic jam. And they can feed that information back to their subscribers, but only to their subscribers on the same highway located behind the traffic jam! Or we can monitor doctors prescribing behaviors, passively, and see how the diffusion of innovation with pharmaceuticals occurs within [networks of] doctors. Or again, we can monitor purchasing behavior in people and watch how these types of phenomena can diffuse within human populations.

و هنالك ثلاثة طرق، أعتقد ذلك و هذه البيانات الضخمة السلبية يمكن إستخدامها واحدة هي سلبية بالكامل مثل ما وصفته للتو على سبيل المثال، مثال سائقي الشاحنات في حين لم نتدخل في السكان بأي طريقة واحدة هي شبه نشطة كمثال الإنفلونزا الذي أعطيته حيث نجعل بعض الناس يرشحون أصدقائهم ثم يتم مراقبة أصدقائهم بشكل سلبي هل لديهم إنفلونزا أم لا ؟ ثم الحصول على تحذير أو مثال آخر سيكون إذا كنت شركة هاتف، يمكنك معرفة من هو مركزي في الشبكة و تسأل هؤلاء الأشخاص، " هل يمكنك أن ترسل لنا حماك يومياً؟ فقط أرسل درجة حرارتك" وجمع كميات هائلة من المعلومات حول درجة حرارة الناس ، لكن من أشخاص مركزيين الموقع و تكون قادر بشكل كبير على مراقبة تفشي الوباء مع الحد الأدنى من المدخلات من الناس أو يمكن أن يكون أكثر نشاطاً بشكل كامل مثل ما أعرف المتحدثين اللاحقين سيتحدثون عنه اليوم حيث يمكن أن يشارك الناس عالمياً في الموسوعات أو التصوير ، أو مراقبة الإنتخابات و رفع المعلومات بطريقة يمكننا جمعها لفهم العمليات الاجتماعية و الظواهر الإجتماعية

And there are three ways, I think, that these massive-passive data can be used. One is fully passive, like I just described -- as in, for instance, the trucker example, where we don't actually intervene in the population in any way. One is quasi-active, like the flu example I gave, where we get some people to nominate their friends and then passively monitor their friends -- do they have the flu, or not? -- and then get warning. Or another example would be, if you're a phone company, you figure out who's central in the network and you ask those people, "Look, will you just text us your fever every day? Just text us your temperature." And collect vast amounts of information about people's temperature, but from centrally located individuals. And be able, on a large scale, to monitor an impending epidemic with very minimal input from people. Or, finally, it can be more fully active -- as I know subsequent speakers will also talk about today -- where people might globally participate in wikis, or photographing, or monitoring elections, and upload information in a way that allows us to pool information in order to understand social processes and social phenomena.

في الواقع ، توفر هذه البيانات ، أعتقد تبشر بحقبة جديدة لما أود أنا و آخرون أن ندعوه العلوم الاجتماعية الحسابية إنه نوع عندما أخترعه "جاليليو" - أو لم يخترعه جاء لإستخدام المنظار و أمكنه مشاهدة السماء بطريقة جديدة أو "يوينهويك" أصبح على دراية بالمجهر أو في الواقع أخترعه و أصبح قادراً على مشاهدة علم الأحياء بطريقة جديدة لكن الآن لدينا القدرة على الوصل إلى هذا النوع من البيانات التي تتيح لنا فهم العمليات الاجتماعية والظواهر الاجتماعية و طريقة جديدة بالكامل لم تكن من قبل ممكنة و مع هذا العلم ، نستطيع بالظبط فهم كيف يكون الجزء الكامل أكبر من مجموع أجزائه و في الوقع، يمكن إستخدام هذه الرؤى لتحسين المجتمع وتحسين رفاه الإنسان

In fact, the availability of these data, I think, heralds a kind of new era of what I and others would like to call "computational social science." It's sort of like when Galileo invented -- or, didn't invent -- came to use a telescope and could see the heavens in a new way, or Leeuwenhoek became aware of the microscope -- or actually invented -- and could see biology in a new way. But now we have access to these kinds of data that allow us to understand social processes and social phenomena in an entirely new way that was never before possible. And with this science, we can understand how exactly the whole comes to be greater than the sum of its parts. And actually, we can use these insights to improve society and improve human well-being.

شكراً

Thank you.

شكراً

Thank you.

Nicholas Christakis: How social networks predict epidemics

Nicholas Christakis: How social networks predict epidemics

Related talks

Nicholas Christakis: The hidden influence of social networks

Dan Dennett: Dangerous memes

Laurie Garrett: Lessons from the 1918 flu

Gary Slutkin: Let's treat violence like a contagious disease

Andreas Raptopoulos: No roads? There's a drone for that

Eric Berlow and Sean Gourley: Mapping ideas worth spreading

Related talks

Nicholas Christakis: The hidden influence of social networks

Dan Dennett: Dangerous memes

Laurie Garrett: Lessons from the 1918 flu

Gary Slutkin: Let's treat violence like a contagious disease

Andreas Raptopoulos: No roads? There's a drone for that

Eric Berlow and Sean Gourley: Mapping ideas worth spreading