Jeremy Howard: The wonderful and terrifying implications of computers that can learn

It used to be that if you wanted to get a computer to do something new, you would have to program it. Now, programming, for those of you here that haven't done it yourself, requires laying out in excruciating detail every single step that you want the computer to do in order to achieve your goal. Now, if you want to do something that you don't know how to do yourself, then this is going to be a great challenge.

Antes, si querías que un computador hiciera algo nuevo tenías, normalmente, que programarlo. La programación, para quienes no la han practicado, requiere especificar con el más mínimo detalle cada paso que uno quiere que haga su computador para alcanzar el objetivo. Si quieren hacer algo que no saben hacer por sí mismos entonces están ante un gran reto.

So this was the challenge faced by this man, Arthur Samuel. In 1956, he wanted to get this computer to be able to beat him at checkers. How can you write a program, lay out in excruciating detail, how to be better than you at checkers? So he came up with an idea: he had the computer play against itself thousands of times and learn how to play checkers. And indeed it worked, and in fact, by 1962, this computer had beaten the Connecticut state champion.

Ese fue el reto al que se enfrentó este hombre, Arthur Samuel. En 1956, quería hacer que su computador pudiera ganarle a las damas. ¿Cómo se puede diseñar un programa, teniendo en cuenta el más mínimo detalle que sea mejor que sí mismo a las damas? Y se le ocurrió una idea: hizo jugar al computador contra sí mismo miles de veces y le hizo aprender a jugar a las damas. De hecho funcionó, ya en 1962, este computador había ganado la competición estatal de Connecticut. Arthur Samuel fue el padre del aprendizaje automático,

So Arthur Samuel was the father of machine learning, and I have a great debt to him, because I am a machine learning practitioner. I was the president of Kaggle, a community of over 200,000 machine learning practictioners. Kaggle puts up competitions to try and get them to solve previously unsolved problems, and it's been successful hundreds of times. So from this vantage point, I was able to find out a lot about what machine learning can do in the past, can do today, and what it could do in the future. Perhaps the first big success of machine learning commercially was Google. Google showed that it is possible to find information by using a computer algorithm, and this algorithm is based on machine learning. Since that time, there have been many commercial successes of machine learning. Companies like Amazon and Netflix use machine learning to suggest products that you might like to buy, movies that you might like to watch. Sometimes, it's almost creepy. Companies like LinkedIn and Facebook sometimes will tell you about who your friends might be and you have no idea how it did it, and this is because it's using the power of machine learning. These are algorithms that have learned how to do this from data rather than being programmed by hand.

y tengo una deuda con él, porque soy un profesional del aprendizaje automático. Fui presidente de Kaggle, una comunidad de unos 200 000 profesionales del aprendizaje automático. Kaggle contribuye con competiciones para tratar de resolver problemas anteriores no resueltos, y tuvo éxito cientos de veces. Así que desde esa perspectiva, pude descubrir mucho sobre lo que el aprendizaje automático hizo, puede hacer hoy y lo que podrá hacer en el futuro. Quizás el primer gran éxito del aprendizaje automático en el mercado fue Google. Google demostró que era posible encontrar información usando un algoritmo informático y ese algoritmo se basa en el aprendizaje automático. Desde entonces, ha habido muchos éxitos comerciales del aprendizaje automático. Compañías como Amazon y Netflix usan el aprendizaje automático para sugerir artículos que les puedan interesar comprar, películas que les puedan interesar ver A veces, es casi escalofriante. Compañías como LinkedIn y Facebook dicen, en ocasiones, cuáles pueden ser sus amigos y uno no tiene ni idea de cómo lo han hecho, y es porque hace uso del poder del aprendizaje automático. Estos son algoritmos que han aprendido como hacerlo a partir de los datos en lugar de ser programados a mano.

This is also how IBM was successful in getting Watson to beat the two world champions at "Jeopardy," answering incredibly subtle and complex questions like this one. ["The ancient 'Lion of Nimrud' went missing from this city's national museum in 2003 (along with a lot of other stuff)"] This is also why we are now able to see the first self-driving cars. If you want to be able to tell the difference between, say, a tree and a pedestrian, well, that's pretty important. We don't know how to write those programs by hand, but with machine learning, this is now possible. And in fact, this car has driven over a million miles without any accidents on regular roads.

Así es también como IBM tuvo éxito en conseguir que Watson ganara dos campeonatos mundiales de "Jeopardy" respondiendo preguntas increíblemente ingeniosas, como esta. [El antiguo "León de Nimrud" se perdió del Museo Nacional de esta ciudad en 2003] También es por eso, que podemos ver los primeros autos sin piloto. Si pueden diferenciar entre, por ejemplo un árbol y un peatón, es algo muy importante. No sabemos diseñar estos programas manualmente, pero con el aprendizaje automático es posible. De hecho, este auto ha conducido más de un millón y medio de kilómetros sin tener accidentes en carretera.

So we now know that computers can learn, and computers can learn to do things that we actually sometimes don't know how to do ourselves, or maybe can do them better than us. One of the most amazing examples I've seen of machine learning happened on a project that I ran at Kaggle where a team run by a guy called Geoffrey Hinton from the University of Toronto won a competition for automatic drug discovery. Now, what was extraordinary here is not just that they beat all of the algorithms developed by Merck or the international academic community, but nobody on the team had any background in chemistry or biology or life sciences, and they did it in two weeks. How did they do this? They used an extraordinary algorithm called deep learning. So important was this that in fact the success was covered in The New York Times in a front page article a few weeks later. This is Geoffrey Hinton here on the left-hand side. Deep learning is an algorithm inspired by how the human brain works, and as a result it's an algorithm which has no theoretical limitations on what it can do. The more data you give it and the more computation time you give it, the better it gets.

Así, sabemos que los computadores pueden aprender y pueden aprender a hacer cosas que de hecho nosotros, a veces, no sabemos hacer, o las pueden hacer mejor que nosotros. Uno de los ejemplos más sorprendentes que he visto en aprendizaje automático ocurrió en un proyecto que dirigía en Kaggle donde un equipo dirigido por un chico llamado Geoffrey Hinton de la Universidad de Toronto ganó un concurso para el descubrimiento automático de medicamentos. Lo extraordinario fue no solo que batiera a todos los algoritmos desarrollados por Merck o la comunidad académica internacional, sino que nadie del equipo tenía experiencia en química o biología o ciencias biológicas y lo hicieron en dos semanas. ¿Cómo lo hicieron? Usaron un algoritmo extraordinario llamado aprendizaje profundo. Fue tan exitoso que tuvo cobertura en The New York Times en un artículo de portada unas semanas después. Este es Geoffrey Hinton a la izquierda. El aprendizaje profundo es un algoritmo inspirado en el cerebro humano y como resultado no tiene limitaciones teóricas en lo que puede hacer. Cuantos más datos y tiempo de cálculo uno le dé mejor funciona.

The New York Times also showed in this article another extraordinary result of deep learning which I'm going to show you now. It shows that computers can listen and understand.

The New York Times mostró, también en su artículo otro resultado extraordinario del aprendizaje profundo que mostraré ahora. Demuestra que los computadores pueden escuchar y comprender.

(Video) Richard Rashid: Now, the last step that I want to be able to take in this process is to actually speak to you in Chinese. Now the key thing there is, we've been able to take a large amount of information from many Chinese speakers and produce a text-to-speech system that takes Chinese text and converts it into Chinese language, and then we've taken an hour or so of my own voice and we've used that to modulate the standard text-to-speech system so that it would sound like me. Again, the result's not perfect. There are in fact quite a few errors. (In Chinese) (Applause) There's much work to be done in this area. (In Chinese) (Applause)

(Vídeo) Richard Rashid: El último paso que quiero dar en este proceso es hablar en chino. La clave es, hemos recopilado una gran información de hablantes de chino y producido un sistema de conversión de texto a voz que toma el texto en chino y lo convierte en lengua oral, luego hemos grabado una hora de mi voz que usamos para modular el texto estándar de conversión de texto a voz para que suene como yo. De nuevo, el resultado no es perfecto. De hecho, hay unos cuantos errores. 結果並不完美 (los resultados no son perfectos) (Aplausos) Hay mucho que hacer en esta área. 在這方面有很多工作要做 (hay mucho trabajo que hacer en esta área) (Aplausos)

Jeremy Howard: Well, that was at a machine learning conference in China. It's not often, actually, at academic conferences that you do hear spontaneous applause, although of course sometimes at TEDx conferences, feel free. Everything you saw there was happening with deep learning. (Applause) Thank you. The transcription in English was deep learning. The translation to Chinese and the text in the top right, deep learning, and the construction of the voice was deep learning as well.

Jeremy Howard: Esto era una conferencia de aprendizaje automático en China. No es usual, en conferencias académicas oír aplausos espontáneos, aunque en las conferencias de TEDx siéntanse libres. Todo lo que han visto es gracias al aprendizaje profundo. (Aplausos) Gracias. La transcripción en inglés es aprendizaje profundo. La traducción al chino y el texto arriba a la derecha, es aprendizaje profundo, y la construcción de la voz también es aprendizaje profundo.

So deep learning is this extraordinary thing. It's a single algorithm that can seem to do almost anything, and I discovered that a year earlier, it had also learned to see. In this obscure competition from Germany called the German Traffic Sign Recognition Benchmark, deep learning had learned to recognize traffic signs like this one. Not only could it recognize the traffic signs better than any other algorithm, the leaderboard actually showed it was better than people, about twice as good as people. So by 2011, we had the first example of computers that can see better than people. Since that time, a lot has happened. In 2012, Google announced that they had a deep learning algorithm watch YouTube videos and crunched the data on 16,000 computers for a month, and the computer independently learned about concepts such as people and cats just by watching the videos. This is much like the way that humans learn. Humans don't learn by being told what they see, but by learning for themselves what these things are. Also in 2012, Geoffrey Hinton, who we saw earlier, won the very popular ImageNet competition, looking to try to figure out from one and a half million images what they're pictures of. As of 2014, we're now down to a six percent error rate in image recognition. This is better than people, again.

Eso es lo extraordinario del aprendizaje profundo. Es un solo algoritmo que parece hacer casi cualquier cosa, y descubrí que un año antes, aprendió a ver. En esta extraña competición en Alemania llamada Banco de Prueba de Reconocimiento de Señales de Tránsito el aprendizaje profundo ha aprendido a reconocer señales de tránsito como esta. No solo reconoce señales de tránsito mejor que cualquier otro algoritmo, la clasificación mostró que era mejor que las personas, dos veces más bueno que las personas. Para 2011, se da el primer ejemplo de computadores que pueden ver mejor que las personas. Desde entonces, han ocurrido muchas cosas. En 2012, Google anunció que había hecho que un algoritmo de aprendizaje profundo viera vídeos en YouTube y procesaron la información en 16 000 computadores al mes y el computador aprendió de manera independiente conceptos como personas y gatos solo viendo los vídeos. Esto se parece mucho al aprendizaje humano. Los humanos no aprendemos porque nos cuenten lo que vemos, sino que aprendemos solos qué son esas cosas. También en 2012, Geoffrey Hinton, que vimos anteriormente, ganó la famosa competición de ImageNet, tratando de averiguar, mirando un millón y medio de imágenes, sobre qué eran estas imágenes. A partir de 2014, tenemos un porcentaje de error por debajo del 6 % en reconocimiento de imágenes. De nuevo, mejor que las personas.

So machines really are doing an extraordinarily good job of this, and it is now being used in industry. For example, Google announced last year that they had mapped every single location in France in two hours, and the way they did it was that they fed street view images into a deep learning algorithm to recognize and read street numbers. Imagine how long it would have taken before: dozens of people, many years. This is also happening in China. Baidu is kind of the Chinese Google, I guess, and what you see here in the top left is an example of a picture that I uploaded to Baidu's deep learning system, and underneath you can see that the system has understood what that picture is and found similar images. The similar images actually have similar backgrounds, similar directions of the faces, even some with their tongue out. This is not clearly looking at the text of a web page. All I uploaded was an image. So we now have computers which really understand what they see and can therefore search databases of hundreds of millions of images in real time.

Las máquinas están haciendo un trabajo increíble aquí, y está siendo usadas en la industria. Por ejemplo, Google anunció el año pasado que había cartografiado cada sitio de Francia en dos horas, y lo hizo alimentando con imágenes de las calles, al algoritmo de aprendizaje profundo para reconocer y leer los números. Imaginen lo que se habría tardado antes: docenas de personas, muchos años. Esto también está pasando en China. Baidu es como el Google chino, creo, y lo que ven arriba a la izquierda es un ejemplo de una imagen que subí al sistema de aprendizaje profundo de Baidu, y debajo se puede ver que el sistema ha entendido lo que es esa imagen y encuentra imágenes similares. Las imágenes similares tienen fondos similares similares de las caras, incluso algunos con la lengua afuera. Esto no es claramente mirar el texto de una página web. Todo lo que descargué eran imágenes. Por lo que ahora tenemos computadores que entienden lo que ven y por ello pueden buscar bases de datos de cientos de millones de imágenes en tiempo real.

So what does it mean now that computers can see? Well, it's not just that computers can see. In fact, deep learning has done more than that. Complex, nuanced sentences like this one are now understandable with deep learning algorithms. As you can see here, this Stanford-based system showing the red dot at the top has figured out that this sentence is expressing negative sentiment. Deep learning now in fact is near human performance at understanding what sentences are about and what it is saying about those things. Also, deep learning has been used to read Chinese, again at about native Chinese speaker level. This algorithm developed out of Switzerland by people, none of whom speak or understand any Chinese. As I say, using deep learning is about the best system in the world for this, even compared to native human understanding.

¿Qué significado tiene que los computadores puedan ver? Bueno, no es solo que los computadores puedan ver. De hecho, el aprendizaje profundo ha hecho más que eso. Frases complejas y llenas de matices como esta son ahora comprensibles con algoritmos del aprendizaje profundo. Como pueden ver aquí, este sistema basado en el de Stanford que muestra el punto rojo en la parte superior ha comprendido que esta frase expresa sentimientos negativos. El aprendizaje profundo está cercano a la conducta humana al comprender lo que significan las frases y lo que se está diciendo sobre esas cosas. El aprendizaje profundo se ha usado también para leer chino, de nuevo, a un nivel casi de hablante nativo. Este algoritmo, desarrollado en Suiza por gente que no hablaba ni entendía chino. Como dije, usar el aprendizaje profundo es el mejor sistema del mundo para esto, hasta comparándolo con el conocimiento humano.

This is a system that we put together at my company which shows putting all this stuff together. These are pictures which have no text attached, and as I'm typing in here sentences, in real time it's understanding these pictures and figuring out what they're about and finding pictures that are similar to the text that I'm writing. So you can see, it's actually understanding my sentences and actually understanding these pictures. I know that you've seen something like this on Google, where you can type in things and it will show you pictures, but actually what it's doing is it's searching the webpage for the text. This is very different from actually understanding the images. This is something that computers have only been able to do for the first time in the last few months.

Este es un sistema que formamos en mi empresa que demuestra todas estas cosas juntas. Estas son imágenes sin texto adjunto, y cuando tecleo aquí frases, entiende, en tiempo real, estas imágenes y comprende de qué se tratan y encuentra imágenes similares al texto que estoy escribiendo. Como pueden ver, entiende mis frases y de hecho entiende estas imágenes. Se que han visto algo como esto en Google, donde puede escribir algo y te lo muestra en imágenes, pero lo que realmente está haciendo es buscar la página web a través del texto. Esto es muy diferente a comprender las imágenes. Esto es algo que los computadores solo han podido hacer por primera vez hace unos pocos meses.

So we can see now that computers can not only see but they can also read, and, of course, we've shown that they can understand what they hear. Perhaps not surprising now that I'm going to tell you they can write. Here is some text that I generated using a deep learning algorithm yesterday. And here is some text that an algorithm out of Stanford generated. Each of these sentences was generated by a deep learning algorithm to describe each of those pictures. This algorithm before has never seen a man in a black shirt playing a guitar. It's seen a man before, it's seen black before, it's seen a guitar before, but it has independently generated this novel description of this picture. We're still not quite at human performance here, but we're close. In tests, humans prefer the computer-generated caption one out of four times. Now this system is now only two weeks old, so probably within the next year, the computer algorithm will be well past human performance at the rate things are going. So computers can also write.

Así que ahora podemos ver que los computadores no solo ven sino que pueden leer, y, por supuesto, hemos demostrado que pueden entender lo que oyen. Quizá no sea sorprendente ahora lo que voy a decir, pueden escribir. Aquí hay parte de un texto que generé ayer usando el algoritmo de aprendizaje profundo. Y aquí hay parte de un texto que generó un algoritmo de Stanford. Cada una de estas frases fue generada por un algoritmo de aprendizaje profundo para describir estas imágenes. Este algoritmo nunca había visto a un hombre con camisa negra tocando la guitarra. Ha visto a un hombre antes, ha visto el negro antes ha visto una guitarra antes, pero ha generado de manera independiente esta innovadora descripción de esta imagen. Aquí no estamos ante un comportamiento humano, pero estamos cerca. En las pruebas, las personas prefieren las leyendas generadas por el computador 1 de cada 4 veces. Este sistema tiene ahora solo dos semanas de edad, por lo que posiblemente antes del año que viene, el algoritmo del computador irá más allá del comportamiento humano al paso que van las cosas.

So we put all this together and it leads to very exciting opportunities. For example, in medicine, a team in Boston announced that they had discovered dozens of new clinically relevant features of tumors which help doctors make a prognosis of a cancer. Very similarly, in Stanford, a group there announced that, looking at tissues under magnification, they've developed a machine learning-based system which in fact is better than human pathologists at predicting survival rates for cancer sufferers. In both of these cases, not only were the predictions more accurate, but they generated new insightful science. In the radiology case, they were new clinical indicators that humans can understand. In this pathology case, the computer system actually discovered that the cells around the cancer are as important as the cancer cells themselves in making a diagnosis. This is the opposite of what pathologists had been taught for decades. In each of those two cases, they were systems developed by a combination of medical experts and machine learning experts, but as of last year, we're now beyond that too. This is an example of identifying cancerous areas of human tissue under a microscope. The system being shown here can identify those areas more accurately, or about as accurately, as human pathologists, but was built entirely with deep learning using no medical expertise by people who have no background in the field. Similarly, here, this neuron segmentation. We can now segment neurons about as accurately as humans can, but this system was developed with deep learning using people with no previous background in medicine.

Así que los computadores pueden escribir. Juntamos todo esto y lleva a oportunidades apasionantes. Por ejemplo, en medicina, un equipo de Boston anunció que habían descubierto decenas de características clínicas relevantes sobre tumores que ayudan a los médicos a hacer un diagnóstico de un cáncer. Algo similar, en Stanford, un grupo anunció que, mirando un tejido con aumento, habían desarrollado una máquina basada en el sistema de aprendizaje que de hecho es mejor que los patólogos humanos prediciendo las tasas de supervivencia de los enfermos de cáncer. En ambos casos, no solo fueron las predicciones más precisas, sino que generaron una nueva ciencia reveladora. En el caso de la radiología, hubo nuevos indicadores clínicos que las personas pueden entender. En este caso de patología, el sistema informático descubrió que las células alrededor del cáncer son tan importantes como las células del cáncer mismo al hacer un diagnóstico. Esto es lo contrario de lo que los patólogos han pensado por décadas. En cada uno de estos casos, fueron sistemas desarrollados por una combinación de expertos médicos y expertos del aprendizaje profundo, pero a partir del año pasado, dimos un paso más allá. Este es un caso de identificación de áreas cancerígenas del tejido humano por microscopio. El sistema que se muestra aquí puede identificar esas áreas de formar más precisa o casi tan precisa como los patólogos humanos, construido completamente con aprendizaje profundo sin usar experiencia médica por gente que no tenía experiencia en este campo. De manera similar, esta segmentación neuronal. Ahora, podemos segmentar neuronas de forma casi tan precisa como las personas, y este sistema fue desarrollado por aprendizaje profundo usando a gente sin experiencia previa en medicina.

So myself, as somebody with no previous background in medicine, I seem to be entirely well qualified to start a new medical company, which I did. I was kind of terrified of doing it, but the theory seemed to suggest that it ought to be possible to do very useful medicine using just these data analytic techniques. And thankfully, the feedback has been fantastic, not just from the media but from the medical community, who have been very supportive. The theory is that we can take the middle part of the medical process and turn that into data analysis as much as possible, leaving doctors to do what they're best at. I want to give you an example. It now takes us about 15 minutes to generate a new medical diagnostic test and I'll show you that in real time now, but I've compressed it down to three minutes by cutting some pieces out. Rather than showing you creating a medical diagnostic test, I'm going to show you a diagnostic test of car images, because that's something we can all understand.

Como yo, alguien sin experiencia previa en medicina, parezco completamente calificado para empezar una empresa médica, y lo hice. Estaba aterrorizado de hacerlo, pero la teoría parecía sugerir que podía ser posible hacer medicina muy útil usando solo estas técnicas de información analítica. Afortunadamente, la recompensa ha sido fantástica, no solo por parte de los medios sino de la comunidad médica, que nos ha apoyado mucho. La teoría es que podemos tomar media parte del proceso médico y convertirlo todo lo posible en análisis de datos, dejando a los médicos en lo que son mejores. Quiero dar un ejemplo. Nos lleva unos 15 minutos crear una nueva prueba de diagnóstico médico y ahora lo demostraré en tiempo real, pero lo he comprimido a 3 minutos cortando algunas partes. En vez de mostrar cómo crear una prueba de diagnóstico médico, mostraré una prueba de diagnóstico de imágenes de autos,

So here we're starting with about 1.5 million car images, and I want to create something that can split them into the angle of the photo that's being taken. So these images are entirely unlabeled, so I have to start from scratch. With our deep learning algorithm, it can automatically identify areas of structure in these images. So the nice thing is that the human and the computer can now work together. So the human, as you can see here, is telling the computer about areas of interest which it wants the computer then to try and use to improve its algorithm. Now, these deep learning systems actually are in 16,000-dimensional space, so you can see here the computer rotating this through that space, trying to find new areas of structure. And when it does so successfully, the human who is driving it can then point out the areas that are interesting. So here, the computer has successfully found areas, for example, angles. So as we go through this process, we're gradually telling the computer more and more about the kinds of structures we're looking for. You can imagine in a diagnostic test this would be a pathologist identifying areas of pathosis, for example, or a radiologist indicating potentially troublesome nodules. And sometimes it can be difficult for the algorithm. In this case, it got kind of confused. The fronts and the backs of the cars are all mixed up. So here we have to be a bit more careful, manually selecting these fronts as opposed to the backs, then telling the computer that this is a type of group that we're interested in.

porque es algo que todos podemos entender. Así que, empezamos con un millón y medio de imágenes de autos, y quiero crear algo que pueda dividirlas en el ángulo en el que la foto fue tomada. Estas imágenes están sin etiquetar, así que tengo que empezar desde cero. Con nuestro algoritmo de aprendizaje profundo, se pueden identificar automáticamente áreas de la estructura en estas imágenes. Lo bueno es que la persona y el computador pueden trabajar juntos. Así que la persona, como pueden ver aquí, le está indicando al computador áreas de interés que quiere que el computador pruebe y use para mejorar su algoritmo. Estos sistemas de aprendizaje profundo están en un espacio de 16 000 dimensiones, así, pueden ver aquí cómo el computador rota esto en ese espacio, intentando encontrar nuevas áreas de estructura. Y cuando lo hace con éxito, la persona que lo maneja puede, entonces, señalar las áreas de interés. Aquí, el computador ha encontrado, con éxito, áreas, por ejemplo, ángulos. Conforme avanzamos en este proceso, vamos diciendo, gradualmente, al computador más y más sobre los tipos de estructuras que estamos buscando. Pueden imaginar en una prueba diagnóstica que esto debería ser un patólogo identificando áreas patológicas, por ejemplo, o un radiólogo indicando nódulos potencialmente problemáticos. A veces puede ser difícil para el algoritmo. En este caso, queda algo confuso. Las partes delanteras y traseras de los autos están todas mezcladas. Así que tenemos que ser un poco más cuidadosos, seleccionando manualmente las partes delanteras en contraposición a las traseras, para luego decir al computador que este es una especie de grupo

So we do that for a while, we skip over a little bit,

es en el que estamos interesados.

and then we train the machine learning algorithm based on these couple of hundred things, and we hope that it's gotten a lot better. You can see, it's now started to fade some of these pictures out, showing us that it already is recognizing how to understand some of these itself. We can then use this concept of similar images, and using similar images, you can now see, the computer at this point is able to entirely find just the fronts of cars. So at this point, the human can tell the computer, okay, yes, you've done a good job of that.

Hacemos esto por un tiempo, nos saltamos un poco, y luego probamos el algoritmo de aprendizaje automático basado en un par de cientos de cosas, y esperamos que haya mejorado mucho. Se puede ver, que han empezado a desvanecerse algunas de estas imágenes, mostrándonos que ya está reconociendo cómo entender por sí mismo algunas de ellas. Entonces podemos usar este concepto en imágenes similares, y usando imágenes similares, como pueden ver, en este punto, el computador puede encontrar solo la parte delantera de los autos. En este punto, la persona puede decir al computador, de acuerdo, sí, has hecho un buen trabajo.

Sometimes, of course, even at this point it's still difficult to separate out groups. In this case, even after we let the computer try to rotate this for a while, we still find that the left sides and the right sides pictures are all mixed up together. So we can again give the computer some hints, and we say, okay, try and find a projection that separates out the left sides and the right sides as much as possible using this deep learning algorithm. And giving it that hint -- ah, okay, it's been successful. It's managed to find a way of thinking about these objects that's separated out these together.

En ocasiones, por supuesto, incluso en este punto sigue siendo difícil separar los grupos. En este caso, incluso después de que dejamos al computador que intente girar esto por un momento, seguimos encontrando que la parte izquierda y derecha de las imágenes están mezcladas. Así que podemos dar, de nuevo, al computador algunas pistas, y decimos, bien, intenta encontrar una proyección que separe los lados izquierdos de los derechos de la manera más precisa usando este algoritmo de aprendizaje profundo. Y dándole esta pista... ah, bien, ha tenido éxito. Consiguió encontrar la manera de pensar estos objetos que está separando estos que están juntos.

So you get the idea here. This is a case not where the human is being replaced by a computer, but where they're working together. What we're doing here is we're replacing something that used to take a team of five or six people about seven years and replacing it with something that takes 15 minutes for one person acting alone.

Así se entiende la idea aquí. Este es un caso en el que la persona no es reemplazada por un computador, sino que trabajan juntos. Estamos reemplazando algo que solía necesitar de un equipo de 5 o 6 personas durante 7 años por algo que lleva 15 minutos a una sola persona.

So this process takes about four or five iterations. You can see we now have 62 percent of our 1.5 million images classified correctly. And at this point, we can start to quite quickly grab whole big sections, check through them to make sure that there's no mistakes. Where there are mistakes, we can let the computer know about them. And using this kind of process for each of the different groups, we are now up to an 80 percent success rate in classifying the 1.5 million images. And at this point, it's just a case of finding the small number that aren't classified correctly, and trying to understand why. And using that approach, by 15 minutes we get to 97 percent classification rates.

Este proceso lleva unas 4 o 5 iteraciones. Ahora pueden ver que tenemos un 62 % de nuestro millón y medio de imágenes clasificadas correctamente. En este punto, podemos empezar, con bastante rapidez, a tomar grandes secciones completas, comprobándolas para asegurarse que no hay errores. Cuando hay errores, podemos hacérselo saber al computador. Usando este tipo de proceso para cada uno de los diferentes grupos, nos colocamos en un índice del 80 % de éxito en la clasificación de un millón y medio de imágenes. En este punto, es solo cuestión de encontrar el pequeño número que no está clasificado correctamente, e intentar comprender el porqué. Usando este enfoque, en 15 minutos alcanza un índice de clasificación del 97 %.

So this kind of technique could allow us to fix a major problem, which is that there's a lack of medical expertise in the world. The World Economic Forum says that there's between a 10x and a 20x shortage of physicians in the developing world, and it would take about 300 years to train enough people to fix that problem. So imagine if we can help enhance their efficiency using these deep learning approaches?

Este tipo de técnica nos permite arreglar un problema mayor, que es que hay una falta de conocimientos médicos en el mundo. El Foro Económico Mundial dice que hay entre 10 y 20 veces de escasez de físicos en el mundo desarrollado, y llevará unos 300 años entrenar a gente suficiente para arreglar el problema. Imaginen que pudiésemos ayudar a aumentar su eficiencia usando estos métodos de aprendizaje profundo.

So I'm very excited about the opportunities. I'm also concerned about the problems. The problem here is that every area in blue on this map is somewhere where services are over 80 percent of employment. What are services? These are services. These are also the exact things that computers have just learned how to do. So 80 percent of the world's employment in the developed world is stuff that computers have just learned how to do. What does that mean? Well, it'll be fine. They'll be replaced by other jobs. For example, there will be more jobs for data scientists. Well, not really. It doesn't take data scientists very long to build these things. For example, these four algorithms were all built by the same guy. So if you think, oh, it's all happened before, we've seen the results in the past of when new things come along and they get replaced by new jobs, what are these new jobs going to be? It's very hard for us to estimate this, because human performance grows at this gradual rate, but we now have a system, deep learning, that we know actually grows in capability exponentially. And we're here. So currently, we see the things around us and we say, "Oh, computers are still pretty dumb." Right? But in five years' time, computers will be off this chart. So we need to be starting to think about this capability right now.

Estoy muy entusiasmado con las oportunidades. También estoy preocupado por los problemas. El problema aquí es que cada área azul de este mapa es algún sitio donde el empleo, de servicios es mayor del 80 %. ¿Qué son los servicios? Los servicios son estos. Estas son también las mismas cosas que los computadores acaban de aprender a hacer. Así que el 80 % del empleo mundial en el mundo desarrollado son cosas que los computadores acaban de aprender a hacer. ¿Qué significa esto? Bueno, no habrá problema, lo reemplazarán por otros trabajos. Por ejemplo, habrá más trabajos para los científicos de datos. Bueno, realmente no. A los científicos de datos no les lleva mucho tiempo construir estas cosas. Por ejemplo, estos 4 algoritmos fueron creados por el mismo chico. Así que si piensan, todo ha pasado ya antes, hemos visto los resultados en el pasado de cuando surgen cosas nuevas y son reemplazadas por nuevos trabajos. ¿Qué trabajos van a ser? Es muy difícil para nosotros hacer una estimación ya que el comportamiento humano crece a un ritmo gradual, pero ahora tenemos un sistema, aprendizaje profundo, que sabemos que crece en capacidad, exponencialmente. Y aquí estamos. Actualmente, vemos las cosas a nuestro alrededor y decimos: "Los computadores siguen siendo un poco estúpidos". ¿Verdad? Pero en 5 años, los computadores estarán fuera de esta gráfica.

We have seen this once before, of course. In the Industrial Revolution, we saw a step change in capability thanks to engines. The thing is, though, that after a while, things flattened out. There was social disruption, but once engines were used to generate power in all the situations, things really settled down. The Machine Learning Revolution is going to be very different from the Industrial Revolution, because the Machine Learning Revolution, it never settles down. The better computers get at intellectual activities, the more they can build better computers to be better at intellectual capabilities, so this is going to be a kind of change that the world has actually never experienced before, so your previous understanding of what's possible is different.

Así que necesitamos empezar a pensar sobre esta capacidad ahora mismo. Lo hemos visto anteriormente, por supuesto. En la Revolución Industrial, vimos un cambio en la capacidad gracias a los motores. El asunto es, sin embargo, que tras un tiempo, las cosas se nivelan. Hubo una alteración social, pero una vez que los motores se usaron para generar energía en todas las situaciones, las cosas realmente se establecieron. La Revolución del Aprendizaje Automático va a ser diferente a la Revolución Industrial porque la Revolución del Aprendizaje Automático, nunca se asienta. Cuanto mejores son los computadores en actividades intelectuales, mejores computadores se crearán para que mejoren su capacidad intelectual, así que esto va a ser una especie de cambio que nunca antes había experimentado el mundo, por lo que el entendimiento previo de lo posible, es diferente.

This is already impacting us. In the last 25 years, as capital productivity has increased, labor productivity has been flat, in fact even a little bit down.

Esto nos impacta. En los últimos 25 años, la productividad del capital se ha incrementado, la productividad laboral se ha mantenido, incluso ha descendido.

So I want us to start having this discussion now. I know that when I often tell people about this situation, people can be quite dismissive. Well, computers can't really think, they don't emote, they don't understand poetry, we don't really understand how they work. So what? Computers right now can do the things that humans spend most of their time being paid to do, so now's the time to start thinking about how we're going to adjust our social structures and economic structures to be aware of this new reality. Thank you. (Applause)

Por lo que quiero que empecemos a discutir esto ahora. Sé que cuando hablo sobre esta situación la gente puede ser despectiva. Bueno, los computadores no pueden realmente pensar, no tienen sentimientos, no entienden poesía, no entendemos realmente cómo funcionan. Y, ¿qué? Actualmente los computadores pueden hacer cosas en las que las personas gastan su tiempo y les pagan por ello así pues ahora tenemos que empezar a pensar sobre cómo vamos a ajustar nuestras estructuras sociales y económicas para ser conscientes de esta nueva realidad. Gracias. (Aplausos)

The New York Times also showed in this article another extraordinary result of deep learning which I'm going to show you now. It shows that computers can listen and understand.

The New York Times mostró, también en su artículo otro resultado extraordinario del aprendizaje profundo que mostraré ahora. Demuestra que los computadores pueden escuchar y comprender.

So we do that for a while, we skip over a little bit,

es en el que estamos interesados.

This is already impacting us. In the last 25 years, as capital productivity has increased, labor productivity has been flat, in fact even a little bit down.

Esto nos impacta. En los últimos 25 años, la productividad del capital se ha incrementado, la productividad laboral se ha mantenido, incluso ha descendido.