Jeremy Howard: The wonderful and terrifying implications of computers that can learn

It used to be that if you wanted to get a computer to do something new, you would have to program it. Now, programming, for those of you here that haven't done it yourself, requires laying out in excruciating detail every single step that you want the computer to do in order to achieve your goal. Now, if you want to do something that you don't know how to do yourself, then this is going to be a great challenge.

Altădată, dacă voiai ca un computer să facă ceva nou, trebuia să-l programezi. Pentru cei care nu aveți experiența asta, programarea presupune să stabilești în cele mai mici detalii fiecare pas pe care i-l ceri calculatorului pentru a obține ce dorești. Dacă vrei să faci ceva ce n-ai putea face tu însuți, atunci îți va fi foarte greu.

So this was the challenge faced by this man, Arthur Samuel. In 1956, he wanted to get this computer to be able to beat him at checkers. How can you write a program, lay out in excruciating detail, how to be better than you at checkers? So he came up with an idea: he had the computer play against itself thousands of times and learn how to play checkers. And indeed it worked, and in fact, by 1962, this computer had beaten the Connecticut state champion.

Foarte greu i-a fost și unui anumit Arthur Samuel. În 1956 dorea să programeze un calculator care să-l învingă la dame. Cum faci să programezi calculatorul în cele mai mici detalii încât să joace dame mai bine decât tine? I-a venit o idee: a pus calculatorul să joace cu el însuși de mii de ori ca să învețe să joace dame. Ideea a mers. În 1962 calculatorul acesta l-a învins pe campionul statului Connecticut.

So Arthur Samuel was the father of machine learning, and I have a great debt to him, because I am a machine learning practitioner. I was the president of Kaggle, a community of over 200,000 machine learning practictioners. Kaggle puts up competitions to try and get them to solve previously unsolved problems, and it's been successful hundreds of times. So from this vantage point, I was able to find out a lot about what machine learning can do in the past, can do today, and what it could do in the future. Perhaps the first big success of machine learning commercially was Google. Google showed that it is possible to find information by using a computer algorithm, and this algorithm is based on machine learning. Since that time, there have been many commercial successes of machine learning. Companies like Amazon and Netflix use machine learning to suggest products that you might like to buy, movies that you might like to watch. Sometimes, it's almost creepy. Companies like LinkedIn and Facebook sometimes will tell you about who your friends might be and you have no idea how it did it, and this is because it's using the power of machine learning. These are algorithms that have learned how to do this from data rather than being programmed by hand.

Arthur Samuel a fost părintele învățării automate. Eu îi sunt recunoscător pentru că și eu mă ocup cu învățarea automată. Am fost președinte la Kaggle, o comunitate cu peste 200 000 de practicanți ai învățării automate. Kaggle organizează competiții pentru rezolvarea problemelor încă nerezolvate. Și a avut sute de reușite. Din poziția mea avantajoasă am aflat ce putea face învățarea automată în trecut, ce poate face azi și ce ar putea face în viitor. Probabil primul mare succes al învățării automate a fost Google. Google a demonstrat că poți găsi informații folosind un algoritm computerizat bazat pe învățarea automată. De atunci s-au înregistrat numeroase succese comerciale în domeniu. Companii ca Amazon și Netflix folosesc învățarea automată pentru a vă sugera ce ați vrea să cumpărați, ce filme ați vrea să vedeți. Uneori aproape că te sperie: de exemplu LinkedIn și Facebook îți spun cine ți-ar putea fi prieten și habar n-ai cum au procedat. Se folosește puterea învățării automate. Sunt algoritmi care învață din date, fără să fie programați manual.

This is also how IBM was successful in getting Watson to beat the two world champions at "Jeopardy," answering incredibly subtle and complex questions like this one. ["The ancient 'Lion of Nimrud' went missing from this city's national museum in 2003 (along with a lot of other stuff)"] This is also why we are now able to see the first self-driving cars. If you want to be able to tell the difference between, say, a tree and a pedestrian, well, that's pretty important. We don't know how to write those programs by hand, but with machine learning, this is now possible. And in fact, this car has driven over a million miles without any accidents on regular roads.

Tot așa a avut succes și IBM cu Watson, care a învins cei doi campioni mondiali la „Jeopardy”, răspunzând la întrebări incredibil de subtile și complexe ca: [Anticul „Leu din Nimrud” a dispărut din muzeul național al acestui oraș în 2003 (împreună cu multe altele).] Tot astfel azi avem primele mașini care se conduc singure. A face diferența între un copac și un pieton, de exemplu, e important. Nu știm să scriem de mână astfel de programe, dar prin învățare automată acum e posibil. Mașina aceasta a mers peste 1 milion de km fără accidente, pe șosele obișnuite.

So we now know that computers can learn, and computers can learn to do things that we actually sometimes don't know how to do ourselves, or maybe can do them better than us. One of the most amazing examples I've seen of machine learning happened on a project that I ran at Kaggle where a team run by a guy called Geoffrey Hinton from the University of Toronto won a competition for automatic drug discovery. Now, what was extraordinary here is not just that they beat all of the algorithms developed by Merck or the international academic community, but nobody on the team had any background in chemistry or biology or life sciences, and they did it in two weeks. How did they do this? They used an extraordinary algorithm called deep learning. So important was this that in fact the success was covered in The New York Times in a front page article a few weeks later. This is Geoffrey Hinton here on the left-hand side. Deep learning is an algorithm inspired by how the human brain works, and as a result it's an algorithm which has no theoretical limitations on what it can do. The more data you give it and the more computation time you give it, the better it gets.

Acum știm: calculatoarele pot învăța, ba chiar pot învăța să facă lucruri pe care uneori nici noi nu știm să le facem sau poate le fac mai bine decât noi. Unul din cele mai uimitoare exemple de învățare automată din câte am văzut a fost într-un proiect al meu de la Kaggle, în care echipa lui Geoffrey Hinton de la Universitatea din Toronto a câștigat un concurs de descoperire automată a drogurilor. Extraordinar a fost nu că au surclasat toți algoritmii dezvoltați de Merck sau de comunitatea academică, ci faptul că nimeni din echipă nu avea pregătire în chimie sau biologie și au reușit în două săptămâni. Cum au reușit? Au folosit un algoritm numit învățare profundă. Succesul lor a fost atât de important încât a apărut pe prima pagină a New York Times după câteva săptămâni. Iată-l pe Geoffrey Hinton aici în stânga. Învățarea profundă e un algoritm inspirat din funcționarea creierului și astfel, teoretic, e un algoritm fără limitări în ceea ce poate face. Cu cât îi dai mai multe date și timp de calcul, cu atât devine mai bun.

The New York Times also showed in this article another extraordinary result of deep learning which I'm going to show you now. It shows that computers can listen and understand.

New York Times a prezentat în articol încă un rezultat al învățării profunde, pe care vi-l arăt acum. Demonstrează că computerele pot asculta și înțelege.

(Video) Richard Rashid: Now, the last step that I want to be able to take in this process is to actually speak to you in Chinese. Now the key thing there is, we've been able to take a large amount of information from many Chinese speakers and produce a text-to-speech system that takes Chinese text and converts it into Chinese language, and then we've taken an hour or so of my own voice and we've used that to modulate the standard text-to-speech system so that it would sound like me. Again, the result's not perfect. There are in fact quite a few errors. (In Chinese) (Applause) There's much work to be done in this area. (In Chinese) (Applause)

(Video) Richard Rashid: Ultimul pas pe care vreau să-l pot face e să vorbesc cu voi în chineză. Ideea de bază e că am stocat informații de la mulți vorbitori de chineză și am realizat un sistem de sinteză vocală care convertește textul chinezesc în chineza vorbită. Apoi am folosit o probă de circa o oră cu vocea mea pentru a modula sistemul standard încât să sune ca vocea mea. Repet, rezultatul nu e perfect. Există multe erori. (În chineză) (Aplauze) Mai sunt multe de făcut aici. (În chineză) (Aplauze)

Jeremy Howard: Well, that was at a machine learning conference in China. It's not often, actually, at academic conferences that you do hear spontaneous applause, although of course sometimes at TEDx conferences, feel free. Everything you saw there was happening with deep learning. (Applause) Thank you. The transcription in English was deep learning. The translation to Chinese and the text in the top right, deep learning, and the construction of the voice was deep learning as well.

Jeremy Howard: Asta a fost o conferință de învățare automată din China. La conferințele științifice rareori se aud aplauze spontane. Dar la conferințele TEDx faceți cum doriți. Tot ce ați văzut se bazează pe învățarea profundă. (Aplauze) Mulțumesc. Transcrierea în engleză era cu învățare profundă. La fel și traducerea în chineză și textul din dreapta sus. Sinteza vocii – tot învățare profundă.

So deep learning is this extraordinary thing. It's a single algorithm that can seem to do almost anything, and I discovered that a year earlier, it had also learned to see. In this obscure competition from Germany called the German Traffic Sign Recognition Benchmark, deep learning had learned to recognize traffic signs like this one. Not only could it recognize the traffic signs better than any other algorithm, the leaderboard actually showed it was better than people, about twice as good as people. So by 2011, we had the first example of computers that can see better than people. Since that time, a lot has happened. In 2012, Google announced that they had a deep learning algorithm watch YouTube videos and crunched the data on 16,000 computers for a month, and the computer independently learned about concepts such as people and cats just by watching the videos. This is much like the way that humans learn. Humans don't learn by being told what they see, but by learning for themselves what these things are. Also in 2012, Geoffrey Hinton, who we saw earlier, won the very popular ImageNet competition, looking to try to figure out from one and a half million images what they're pictures of. As of 2014, we're now down to a six percent error rate in image recognition. This is better than people, again.

Învățarea profundă e extraordinară. E un singur algoritm, care pare să poată face orice. Am aflat că înainte cu un an învățase și să vadă. Într-o competiție obscură din Germania de recunoaștere a indicatoarelor rutiere, a învățat să recunoască indicatoare rutiere precum acesta. A recunoscut indicatoarele mai bine decât orice alt algoritm și conform clasamentului chiar mai bine decât oamenii, de vreo două ori mai bine. Deci până în 2011 a apărut primul calculator care vede mai bine ca oamenii. De atunci s-au întâmplat multe. În 2012 Google a anunțat că a pus un algoritm să se uite pe YouTube și să ronțăie datele cu 16000 de calculatoare timp de o lună, învățând singur despre concepte ca oamenii și pisicile doar privind videoclipuri. Tot cam așa învață și omul. Oamenii nu învață din explicații despre ce văd, ci din propria experiență cu lucrurile. În 2012, Geoffrey Hinton, cel de adineauri a câștigat celebra competiție ImageNet, încercând să-și dea seama ce reprezintă 1,5 milioane de imagini. În 2014 eroarea de recunoaștere a imaginilor a scăzut la 6%. Din nou mai bine ca oamenii.

So machines really are doing an extraordinarily good job of this, and it is now being used in industry. For example, Google announced last year that they had mapped every single location in France in two hours, and the way they did it was that they fed street view images into a deep learning algorithm to recognize and read street numbers. Imagine how long it would have taken before: dozens of people, many years. This is also happening in China. Baidu is kind of the Chinese Google, I guess, and what you see here in the top left is an example of a picture that I uploaded to Baidu's deep learning system, and underneath you can see that the system has understood what that picture is and found similar images. The similar images actually have similar backgrounds, similar directions of the faces, even some with their tongue out. This is not clearly looking at the text of a web page. All I uploaded was an image. So we now have computers which really understand what they see and can therefore search databases of hundreds of millions of images in real time.

Mașinile se descurcă extraordinar de bine, iar acum sunt folosite în industrie. De exemplu, Google a anunțat anul trecut că a cartografiat toate locurile din Franța în două ore. Au făcut asta încărcând imagini ale străzilor într-un algoritm de învățare profundă care citește numărul străzii. Imaginați-vă cât ar fi durat altădată: zeci de oameni, mulți ani. Asta se întâmplă și în China. Baidu este ca un fel de Google, cred. Aici în stânga sus vedeți o poză încărcată de mine în acest sistem, iar jos vedeți că sistemul a înțeles ce reprezintă poza și a găsit imagini similare. Acestea au un fundal asemănător, direcții similare ale fețelor, unele chiar cu limba afară. Nu e ca și cum ai vedea textul unei pagini de internet. Am încărcat doar o imagine. Avem acum calculatoare care înțeleg ce văd și astfel pot căuta în baze de date cu sute de milioane de imagini în timp real.

So what does it mean now that computers can see? Well, it's not just that computers can see. In fact, deep learning has done more than that. Complex, nuanced sentences like this one are now understandable with deep learning algorithms. As you can see here, this Stanford-based system showing the red dot at the top has figured out that this sentence is expressing negative sentiment. Deep learning now in fact is near human performance at understanding what sentences are about and what it is saying about those things. Also, deep learning has been used to read Chinese, again at about native Chinese speaker level. This algorithm developed out of Switzerland by people, none of whom speak or understand any Chinese. As I say, using deep learning is about the best system in the world for this, even compared to native human understanding.

Ce înseamnă faptul că văd? Și nu numai că văd. Învățarea profundă a ajuns mai departe. Propoziții complexe și nuanțate ca aceasta sunt inteligibile prin algoritmi de învățare profundă. Aici vedeți cum un sistem de la Stanford arată prin bulina roșie de sus că a înțeles conotația negativă a acestei propoziții. Învățarea profundă are acum performanțe aproape umane în a înțelege despre ce e propoziția și ce comunică ea. Învățarea profundă se folosește la a citi în chineză aproape de nivelul de vorbitor nativ. Algoritmul a fost dezvoltat în Elveția de oameni care nu vorbesc și nu înțeleg deloc chineză. Învățarea profundă e cea mai bună soluție din lume pentru așa ceva, chiar și față de înțelegerea umană naturală.

This is a system that we put together at my company which shows putting all this stuff together. These are pictures which have no text attached, and as I'm typing in here sentences, in real time it's understanding these pictures and figuring out what they're about and finding pictures that are similar to the text that I'm writing. So you can see, it's actually understanding my sentences and actually understanding these pictures. I know that you've seen something like this on Google, where you can type in things and it will show you pictures, but actually what it's doing is it's searching the webpage for the text. This is very different from actually understanding the images. This is something that computers have only been able to do for the first time in the last few months.

Iată un sistem pus la punct de compania mea, care combină toate acestea. Aceste imagini nu sunt însoțite de text și pe măsură ce tastez propoziții aici înțelege în timp real imaginile, își dă seama despre ce e vorba și găsește imagini similare cu cele din textul scris de mine. După cum vedeți, înțelege atât propozițiile cât și imaginile. Știu că ați văzut ceva similar pe Google: – tastați ceva și vă arată imagini –, dar acolo de fapt caută pagini de internet după text. E cu totul altceva decât a înțelege imaginile. Calculatoarele au învățat să facă asta abia acum câteva luni.

So we can see now that computers can not only see but they can also read, and, of course, we've shown that they can understand what they hear. Perhaps not surprising now that I'm going to tell you they can write. Here is some text that I generated using a deep learning algorithm yesterday. And here is some text that an algorithm out of Stanford generated. Each of these sentences was generated by a deep learning algorithm to describe each of those pictures. This algorithm before has never seen a man in a black shirt playing a guitar. It's seen a man before, it's seen black before, it's seen a guitar before, but it has independently generated this novel description of this picture. We're still not quite at human performance here, but we're close. In tests, humans prefer the computer-generated caption one out of four times. Now this system is now only two weeks old, so probably within the next year, the computer algorithm will be well past human performance at the rate things are going. So computers can also write.

Așadar calculatoarele nu doar văd, ci pot și citi. Și desigur am demonstrat că înțeleg ce aud. Atunci nu vă va mira dacă vă spun că pot să și scrie. Iată un text pe care l-am generat ieri cu un algoritm de învățare profundă. Iată și un text generat de un algoritm de la Stanford. Fiecare propoziție a fost generată de un algoritm de învățare profundă pentru a descrie imaginile. Algoritmul nu mai văzuse un bărbat în cămașă neagră cântând la chitară. Mai văzuse un bărbat, culoarea neagră, o chitară, dar a generat singur o descriere nouă a acestei imagini. Încă n-am atins performanța umană, dar ne apropiem. La testare oamenii preferă descrierile generate de calculator cam o dată din patru. Sistemul are abia două săptămâni, deci probabil într-un an algoritmul va depăși performanța umană la cum avansează lucrurile. Deci calculatoarele pot și să scrie.

So we put all this together and it leads to very exciting opportunities. For example, in medicine, a team in Boston announced that they had discovered dozens of new clinically relevant features of tumors which help doctors make a prognosis of a cancer. Very similarly, in Stanford, a group there announced that, looking at tissues under magnification, they've developed a machine learning-based system which in fact is better than human pathologists at predicting survival rates for cancer sufferers. In both of these cases, not only were the predictions more accurate, but they generated new insightful science. In the radiology case, they were new clinical indicators that humans can understand. In this pathology case, the computer system actually discovered that the cells around the cancer are as important as the cancer cells themselves in making a diagnosis. This is the opposite of what pathologists had been taught for decades. In each of those two cases, they were systems developed by a combination of medical experts and machine learning experts, but as of last year, we're now beyond that too. This is an example of identifying cancerous areas of human tissue under a microscope. The system being shown here can identify those areas more accurately, or about as accurately, as human pathologists, but was built entirely with deep learning using no medical expertise by people who have no background in the field. Similarly, here, this neuron segmentation. We can now segment neurons about as accurately as humans can, but this system was developed with deep learning using people with no previous background in medicine.

Toate astea la un loc creează niște șanse fantastice. De exemplu, în medicină, o echipă din Boston a anunțat că a descoperit zeci de caracteristici relevante clinic ale tumorilor care ajută doctorii să prezică apariția cancerului. Similar, la Stanford un grup a anunțat că cercetând la microscop țesuturile au realizat un sistem cu învățare automată care reușește mai bine ca patologii umani să prevadă proporția de supraviețuire a pacienților cu cancer. În ambele cazuri previziunile au fost mai exacte și au ajutat și la progresul științific. În cazul radiologiei au apărut indicatori clinici înțeleși de om. În cazul patologiei sistemul computerizat a descoperit că celulele din jurul cancerului sunt la fel de importante ca cele canceroase în stabilirea diagnosticului. Asta infirmă ce spun cursurile de patologie de multe decenii încoace. În ambele cazuri, sistemele au fost realizate de experți în medicină împreună cu experți în învățare automată, dar anul trecut am depășit și acest prag. Iată un exemplu de identificare a zonelor canceroase a țesutului uman la microscop. Sistemul prezentat aici poate identifica acele zone mai exact sau cam la fel de exact ca patologii umani dar a fost realizat prin învățare profundă fără competențe medicale de oameni care nu aveau pregătire în domeniu. La fel, iată segmentarea unui neuron. Acum putem segmenta neuroni la fel de precis ca oamenii, dar sistemul a fost realizat prin învățare profundă, fără cunoștințe de medicină.

So myself, as somebody with no previous background in medicine, I seem to be entirely well qualified to start a new medical company, which I did. I was kind of terrified of doing it, but the theory seemed to suggest that it ought to be possible to do very useful medicine using just these data analytic techniques. And thankfully, the feedback has been fantastic, not just from the media but from the medical community, who have been very supportive. The theory is that we can take the middle part of the medical process and turn that into data analysis as much as possible, leaving doctors to do what they're best at. I want to give you an example. It now takes us about 15 minutes to generate a new medical diagnostic test and I'll show you that in real time now, but I've compressed it down to three minutes by cutting some pieces out. Rather than showing you creating a medical diagnostic test, I'm going to show you a diagnostic test of car images, because that's something we can all understand.

Atunci eu, deși nu am pregătire în medicină, se pare să sunt calificat să pornesc o firmă de medicină. Ceea ce am și făcut. Mi-a fost puțin frică să fac asta, dar teoria pare să sugereze ca poți să faci ceva foarte util în medicină folosind doar metode de analiză a datelor. Din fericire reacțiile au fost fantastice, atât din partea presei, dar și comunitatea medicală ne-a sprijinit foarte mult. Ideea e că putem să luăm partea mijlocie a procesului medical și s-o transformăm în analiză de date, pe cât posibil, lăsând apoi doctorii să-și facă meseria. Să vă dau un exemplu: acum ne trebuie 15 minute pentru a stabili o analiză nouă pentru diagnosticul medical. O să vă arăt acum, dar l-am redus la trei minute prin eliminarea unor părți. În loc de crearea unei analize pentru diagnosticul medical, o să vă arăt analiza imaginilor de mașini, ca să înțelegem toți.

So here we're starting with about 1.5 million car images, and I want to create something that can split them into the angle of the photo that's being taken. So these images are entirely unlabeled, so I have to start from scratch. With our deep learning algorithm, it can automatically identify areas of structure in these images. So the nice thing is that the human and the computer can now work together. So the human, as you can see here, is telling the computer about areas of interest which it wants the computer then to try and use to improve its algorithm. Now, these deep learning systems actually are in 16,000-dimensional space, so you can see here the computer rotating this through that space, trying to find new areas of structure. And when it does so successfully, the human who is driving it can then point out the areas that are interesting. So here, the computer has successfully found areas, for example, angles. So as we go through this process, we're gradually telling the computer more and more about the kinds of structures we're looking for. You can imagine in a diagnostic test this would be a pathologist identifying areas of pathosis, for example, or a radiologist indicating potentially troublesome nodules. And sometimes it can be difficult for the algorithm. In this case, it got kind of confused. The fronts and the backs of the cars are all mixed up. So here we have to be a bit more careful, manually selecting these fronts as opposed to the backs, then telling the computer that this is a type of group that we're interested in.

Începem cu circa 1,5 milioane de imagini de mașini și vreau să creez ceva care le clasifică după unghiul din care au fost pozate. Imaginile nu sunt deloc marcate, trebuie să pornesc de la zero. Algoritmul nostru de învățare profundă găsește automat structurile din imagini. Partea bună e că acum oamenii și calculatoarele pot colabora. Aici vedeți cum omul îi arată calculatorului zonele de interes pe care să se bazeze calculatorul pentru a îmbunătăți algoritmul. Sistemele de învățare profundă operează într-un spațiu cu 16 000 dimensiuni și vedeți cum se rotește calculatorul prin acest spațiu pentru a găsi noi zone de structură. Când reușește, omul care îl conduce poate indica zonele interesante. Aici calculatorul a găsit anumite zone, de exemplu, cu unghiuri. Pe măsură ce continuăm, îi dăm calculatorului mai multe date despre ce fel de structuri căutăm. Într-o analiză medicală aici ar putea fi un patolog care identifică zone patogene sau un radiolog care indică nodulii cu probleme posibile. Uneori algoritmului îi poate fi greu. Iată un caz în care s-a încurcat: a amestecat fața și spatele mașinilor. Aici trebuie să fim mai atenți, să separăm manual fața și spatele și apoi să-i spunem calculatorului că acesta e un tip de grup care ne interesează.

So we do that for a while, we skip over a little bit, and then we train the machine learning algorithm based on these couple of hundred things, and we hope that it's gotten a lot better. You can see, it's now started to fade some of these pictures out, showing us that it already is recognizing how to understand some of these itself. We can then use this concept of similar images, and using similar images, you can now see, the computer at this point is able to entirely find just the fronts of cars. So at this point, the human can tell the computer, okay, yes, you've done a good job of that.

Continuăm o vreme, sărim peste câte ceva și apoi antrenăm algoritmul de învățare automată pe baza celor câteva sute de lucruri și sperăm că a avansat. Vedeți, acum începe să estompeze anumite imagini, arătând că începe să înțeleagă singur să recunoască o parte din ele. Putem folosi acest concept de imagini similare și astfel vedeți cum calculatorul poate găsi doar mașinile pozate din față. Acum omul poate spune calculatorului: „Da, ai făcut o treabă bună.”

Sometimes, of course, even at this point it's still difficult to separate out groups. In this case, even after we let the computer try to rotate this for a while, we still find that the left sides and the right sides pictures are all mixed up together. So we can again give the computer some hints, and we say, okay, try and find a projection that separates out the left sides and the right sides as much as possible using this deep learning algorithm. And giving it that hint -- ah, okay, it's been successful. It's managed to find a way of thinking about these objects that's separated out these together.

Sigur, uneori chiar și acum separarea grupurilor e dificilă. Aici chiar după ce am lăsat o vreme calculatorul să încerce rotațiile, vedem că pozele făcute din stânga și cele din dreapta sunt amestecate. Putem da calculatorului sugestii: caută o proiecție care separă cât mai bine părțile stângă și dreaptă folosind algoritmul de învățare profundă. Apoi uite, sugestia a dus la succes. A reușit să găsească un mod de a analiza obiectele încât să le separe.

So you get the idea here. This is a case not where the human is being replaced by a computer, but where they're working together. What we're doing here is we're replacing something that used to take a team of five or six people about seven years and replacing it with something that takes 15 minutes for one person acting alone.

Ați prins ideea: e o situație în care omul nu e înlocuit de calculator, ci lucrează împreună cu el. Am înlocuit munca unei echipe de 5–6 oameni timp de vreo șapte ani, cu ceva care îi ia 15 minute unui om care lucrează singur.

So this process takes about four or five iterations. You can see we now have 62 percent of our 1.5 million images classified correctly. And at this point, we can start to quite quickly grab whole big sections, check through them to make sure that there's no mistakes. Where there are mistakes, we can let the computer know about them. And using this kind of process for each of the different groups, we are now up to an 80 percent success rate in classifying the 1.5 million images. And at this point, it's just a case of finding the small number that aren't classified correctly, and trying to understand why. And using that approach, by 15 minutes we get to 97 percent classification rates.

Procesul necesită 4–5 iterații. Am ajuns la 62% de clasificare corectă a 1,5 milioane de imagini. În acest stadiu putem începe să prelucrăm rapid secțiuni mari și să verificăm să nu rămână greșeli, iar dacă sunt greșeli îi spunem calculatorului. Folosind acest proces la fiecare din diferitele grupuri am ajuns la o proporție de succes de 80% în a clasifica cele 1,5 milioane de poze. Acum mai rămâne doar să găsim puținele imagini care n-au fost clasificate corect și să încercăm să înțelegem de ce. Procedând astfel în 15 minute ajungem la proporții de clasificare de 97%.

So this kind of technique could allow us to fix a major problem, which is that there's a lack of medical expertise in the world. The World Economic Forum says that there's between a 10x and a 20x shortage of physicians in the developing world, and it would take about 300 years to train enough people to fix that problem. So imagine if we can help enhance their efficiency using these deep learning approaches?

Această tehnică ne-ar putea ajuta să rezolvăm o problemă majoră, și anume lipsa de competență medicală din lume. Forumul Mondial Economic spune că ne trebuie de 10–20 de ori mai mulți medici în țările în curs de dezvoltare și că ar dura cam 300 de ani pentru a forma oameni și a acoperi lipsa. Ce-ar fi dacă le-am crește eficiența folosind tehnicile de învățare profundă?

So I'm very excited about the opportunities. I'm also concerned about the problems. The problem here is that every area in blue on this map is somewhere where services are over 80 percent of employment. What are services? These are services. These are also the exact things that computers have just learned how to do. So 80 percent of the world's employment in the developed world is stuff that computers have just learned how to do. What does that mean? Well, it'll be fine. They'll be replaced by other jobs. For example, there will be more jobs for data scientists. Well, not really. It doesn't take data scientists very long to build these things. For example, these four algorithms were all built by the same guy. So if you think, oh, it's all happened before, we've seen the results in the past of when new things come along and they get replaced by new jobs, what are these new jobs going to be? It's very hard for us to estimate this, because human performance grows at this gradual rate, but we now have a system, deep learning, that we know actually grows in capability exponentially. And we're here. So currently, we see the things around us and we say, "Oh, computers are still pretty dumb." Right? But in five years' time, computers will be off this chart. So we need to be starting to think about this capability right now.

Mă bucură această șansă, dar mă și îngrijorează problemele. Problema e că toate zonele albastre de pe harta aceasta reprezintă locuri unde în servicii lucrează 80% dintre angajați. Ce sunt serviciile? Acestea sunt serviciile. Exact asta au învățat recent calculatoarele să facă. Deci 80% din munca din lumea dezvoltată acum știe și calculatorul s-o facă. Ce înseamnă asta? Lasă, o să apară alte locuri de muncă, de exemplu în cercetarea datelor. De fapt nu prea. Cercetarea datelor nu cere mult timp. Acești patru algoritmi au fost construiți de un singur om. Poate vă gândiți că s-a mai întâmplat și am văzut deja rezultatele: când apare ceva nou apar și alte locuri de muncă. Dar care vor fi acestea? E greu de estimat, pentru că performanța umană crește încet, dar știm că sistemul de învățare profundă crește exponențial în capacitate. Ne aflăm aici. Acum vedem ce se întâmplă și ne gândim: calculatoarele sunt încă proaste, nu? Dar în cinci ani calculatoarele vor ieși din acest grafic. Așa că trebuie să ne gândim de pe acum la această capacitate.

We have seen this once before, of course. In the Industrial Revolution, we saw a step change in capability thanks to engines. The thing is, though, that after a while, things flattened out. There was social disruption, but once engines were used to generate power in all the situations, things really settled down. The Machine Learning Revolution is going to be very different from the Industrial Revolution, because the Machine Learning Revolution, it never settles down. The better computers get at intellectual activities, the more they can build better computers to be better at intellectual capabilities, so this is going to be a kind of change that the world has actually never experienced before, so your previous understanding of what's possible is different.

Am mai văzut asta odată, desigur: la Revoluția Industrială motoarele au produs un salt de capacitate. Dar după un timp situația s-a stabilizat. Au existat perturbări sociale, dar după ce inginerii au învățat să folosească noua putere peste tot lucrurile s-au stabilizat bine. Revoluția Învățării Automate va diferi mult de Revoluția Industrială pentru că lucrurile nu se vor stabiliza niciodată. Cu cât calculatorul va fi mai bun la activități intelectuale, cu atât vor construi calculatoare mai bune la activități intelectuale. Atunci va avea loc o schimbare nouă pe care omenirea n-o cunoaște, încât ce știam că e posibil nu mai e valabil.

This is already impacting us. In the last 25 years, as capital productivity has increased, labor productivity has been flat, in fact even a little bit down.

Deja simțim efectul. În vreme ce productivitatea capitalului a crescut în ultimii 25 de ani, productivitatea muncii a rămas constantă, ba chiar a scăzut un pic.

So I want us to start having this discussion now. I know that when I often tell people about this situation, people can be quite dismissive. Well, computers can't really think, they don't emote, they don't understand poetry, we don't really understand how they work. So what? Computers right now can do the things that humans spend most of their time being paid to do, so now's the time to start thinking about how we're going to adjust our social structures and economic structures to be aware of this new reality. Thank you. (Applause)

Aș vrea să înceapă discuția aceasta. Adesea când vorbesc despre asta lumea respinge ideea. Păi calculatoarele nu gândesc cu adevărat, nu au emoții, nu înțeleg poezia, nu prea înțelegem cum funcționează. Și ce dacă? Computerele pot face azi lucruri pentru care oamenii muncesc mult timp și sunt plătiți. E momentul să începem să ne gândim cum ne vom adapta structurile sociale și economice în fața acestei noi realități. Mulțumesc. (Applauze)

The New York Times also showed in this article another extraordinary result of deep learning which I'm going to show you now. It shows that computers can listen and understand.

New York Times a prezentat în articol încă un rezultat al învățării profunde, pe care vi-l arăt acum. Demonstrează că computerele pot asculta și înțelege.

This is already impacting us. In the last 25 years, as capital productivity has increased, labor productivity has been flat, in fact even a little bit down.

Deja simțim efectul. În vreme ce productivitatea capitalului a crescut în ultimii 25 de ani, productivitatea muncii a rămas constantă, ba chiar a scăzut un pic.