Hod Lipson: Building "self-aware" robots

So, where are the robots? We've been told for 40 years already that they're coming soon. Very soon they'll be doing everything for us. They'll be cooking, cleaning, buying things, shopping, building. But they aren't here. Meanwhile, we have illegal immigrants doing all the work, but we don't have any robots. So what can we do about that? What can we say? So I want to give a little bit of a different perspective of how we can perhaps look at these things in a little bit of a different way. And this is an x-ray picture of a real beetle, and a Swiss watch, back from '88. You look at that -- what was true then is certainly true today. We can still make the pieces. We can make the right pieces. We can make the circuitry of the right computational power, but we can't actually put them together to make something that will actually work and be as adaptive as these systems.

Entonces, ¿dónde están los robots? Nos han dicho por 40 años que ya vienen. Muy pronto estarán haciendo todo por nosotros: cocinarán, limpiarán, comprarán, construirán. Pero aún no están aquí. Mientras tanto tenemos inmigrantes ilegales haciendo todo el trabajo, pero no tenemos robots. ¿Qué podemos hacer al respecto? ¿Qué podemos decir? Quiero compartir un poco de otra perspectiva, de cómo quizás podamos ver estas cosas de forma ligeramente distinta. Y esta es una foto de rayos X de un escarabajo y un reloj suizo, del año 1988. Viendo eso, lo que fue cierto entonces es cierto hoy. Podemos hacer las piezas y hacerlas correctamente, podemos hacer los circuitos para una capacidad de cómputo adecuada, pero en la práctica no podemos unirlos para construir algo que realmente funcione y sea tan adaptable como estos sistemas.

So let's try to look at it from a different perspective. Let's summon the best designer, the mother of all designers. Let's see what evolution can do for us. So we threw in -- we created a primordial soup with lots of pieces of robots -- with bars, with motors, with neurons. Put them all together, and put all this under kind of natural selection, under mutation, and rewarded things for how well they can move forward. A very simple task, and it's interesting to see what kind of things came out of that.

Así que intentemos verlo desde una perspectiva diferente. Llamemos al mejor diseñador, la madre de todos los diseñadores: veamos que puede hacer por nosotros la evolución. Entonces mezclamos, creamos una sopa primordial con montones de piezas de robots, con barras, motores, neuronas. Los juntamos todos, pusimos todo eso bajo cierta clase de selección natural, bajo mutación, y recompensamos la capacidad de avance. Una tarea muy simple, y es interesante ver la clase de cosas que surgen de ahí.

So if you look, you can see a lot of different machines come out of this. They all move around. They all crawl in different ways, and you can see on the right, that we actually made a couple of these things, and they work in reality. These are not very fantastic robots, but they evolved to do exactly what we reward them for:

Así que si observan, pueden ver un montón de máquinas diferentes saliendo de esto. Todas se mueven, todas ellas se arrastran a su manera y, pueden ver a la derecha que efectivamente hemos fabricado un par de estas cosas, y funcionan en el mundo real. No son robots muy fantásticos pero evolucionaron para hacer exactamente aquello por lo que los recompensamos;

for moving forward. So that was all done in simulation, but we can also do that on a real machine. Here's a physical robot that we actually have a population of brains, competing, or evolving on the machine. It's like a rodeo show. They all get a ride on the machine, and they get rewarded for how fast or how far they can make the machine move forward. And you can see these robots are not ready to take over the world yet, but they gradually learn how to move forward, and they do this autonomously.

para moverse hacia adelante. Así que eso fue hecho en simulación pero también podemos hacerlo en una máquina de verdad. Aquí está un robot físico que de hecho tiene una población de cerebros, compitiendo o evolucionando, en la máquina. Es como un espectáculo de rodeo: todos se pueden subir a la máquina y son recompensados por qué tan rápido o qué tan lejos pueden hacer que la máquina avance. Y pueden ver que esos robots no están listos aún para dominar al mundo, pero aprenden gradualmente a avanzar y hacen esto de forma autónoma.

So in these two examples, we had basically machines that learned how to walk in simulation, and also machines that learned how to walk in reality. But I want to show you a different approach, and this is this robot over here, which has four legs. It has eight motors, four on the knees and four on the hip. It has also two tilt sensors that tell the machine which way it's tilting.

Así que en esos dos ejemplos, básicamente tenemos máquinas que aprendieron cómo caminar en una simulación y también máquinas que aprendieron cómo caminar en la realidad. Pero quiero mostrarles un enfoque distinto y es este robot de aquí, el cual tiene cuatro patas, tiene ocho motores, cuatro en las rodillas y cuatro en las caderas. Tiene también 2 sensores de inclinación que le dicen a la máquina por cuál lado se está inclinando.

But this machine doesn't know what it looks like. You look at it and you see it has four legs, the machine doesn't know if it's a snake, if it's a tree, it doesn't have any idea what it looks like, but it's going to try to find that out. Initially, it does some random motion, and then it tries to figure out what it might look like. And you're seeing a lot of things passing through its minds, a lot of self-models that try to explain the relationship between actuation and sensing. It then tries to do a second action that creates the most disagreement among predictions of these alternative models, like a scientist in a lab. Then it does that and tries to explain that, and prune out its self-models.

Pero esta máquina no sabe cómo es ella misma. Mirándola sabes que tiene cuatro patas pero la máquina no sabe si es una serpiente, un árbol, no tiene ni idea de cómo es pero va a tratar de descubrirlo. Al principio, hace algunos movimientos al azar y entonces intenta descubrir cómo puede ser y se ven un montón de cosas pasar por "sus mentes", un montón de modelos de sí misma que tratan de explicar la relación entre su actuación y su percepción y entonces intenta ejecutar una segunda acción que genere la mayor discrepancia entre las predicciones de esos modelos alternativos, como un científico en un laboratorio. Entonces hace eso e intenta explicarlo y depura sus modelos de sí misma.

This is the last cycle, and you can see it's pretty much figured out what its self looks like. And once it has a self-model, it can use that to derive a pattern of locomotion. So what you're seeing here are a couple of machines -- a pattern of locomotion. We were hoping that it wass going to have a kind of evil, spidery walk, but instead it created this pretty lame way of moving forward.

Este es el último ciclo y pueden ver que prácticamente ha descubierto cómo es y una vez que tiene un modelo de sí misma, puede usarlo para derivar un patrón de locomoción. Así que lo que ven aquí son un par de máquinas, un patrón de locomoción. Esperábamos que fuera a tener un tipo de andar maligno, arácnido, pero en cambio creó esta forma de avanzar bastante patética.

But when you look at that, you have to remember that this machine did not do any physical trials on how to move forward, nor did it have a model of itself. It kind of figured out what it looks like, and how to move forward, and then actually tried that out. (Applause)

Pero cuando miran eso tienen que recordar que esta máquina no hizo ningún ensayo físico sobre cómo avanzar, ni tenía un modelo de sí misma. Más o menos descubrió cómo era, y cómo podía avanzar, y entonces lo intentó en realidad. (Aplausos)

So, we'll move forward to a different idea. So that was what happened when we had a couple of -- that's what happened when you had a couple of -- OK, OK, OK -- (Laughter) -- they don't like each other. So there's a different robot. That's what happened when the robots actually are rewarded for doing something. What happens if you don't reward them for anything, you just throw them in?

Entonces, probemos una idea diferente. Eso fue lo que ocurrió cuando teníamos un par de... fue lo que ocurrió cuando tienes un par... OK, OK, OK, (Risas) no se caen bien. Entonces hay un robot diferente. Eso fue lo que ocurrió cuando los robots fueron recompensados por hacer algo. ¿Qué ocurre si no los recompensas en lo absoluto, si sólo los pones ahí?

So we have these cubes, like the diagram showed here. The cube can swivel, or flip on its side, and we just throw 1,000 of these cubes into a soup -- this is in simulation --and don't reward them for anything, we just let them flip. We pump energy into this and see what happens in a couple of mutations. So, initially nothing happens, they're just flipping around there. But after a very short while, you can see these blue things on the right there begin to take over.

Tenemos estos cubos, como el diagrama muestra aquí. El cubo puede girar, o ponerse de lado, y dejamos mil de esos cubos en una sopa, esto es en simulación, y no los recompensamos por nada, sólo los dejamos dar vueltas. Le insertamos energía a esto y vemos que ocurre en un par de mutaciones. Al principio no pasa nada, sólo están dando de vueltas por ahí. Pero muy poco después puedes ver esas cosas azules a la derecha que empiezan a prevalecer.

They begin to self-replicate. So in absence of any reward, the intrinsic reward is self-replication. And we've actually built a couple of these, and this is part of a larger robot made out of these cubes. It's an accelerated view, where you can see the robot actually carrying out some of its replication process. So you're feeding it with more material -- cubes in this case -- and more energy, and it can make another robot. So of course, this is a very crude machine, but we're working on a micro-scale version of these, and hopefully the cubes will be like a powder that you pour in.

Empiezan a autorreplicarse. Así que en ausencia de toda recompensa la recompensa intrínseca es autorreplicación. Y de hecho hemos construido un par de éstos y esto es parte de un robot más grande hecho con estos cubos, es una vista acelerada donde puedes ver al robot ejecutando parte de su proceso de replicación. Así que si estás dándole más material, cubos en este caso, y más energía y puede hacer otro robot. Por supuesto, es una máquina muy rudimentaria pero estamos trabajando en una versión a microescala, esperando que algún día los cubos sean como un polvo que puedas echar.

OK, so what can we learn? These robots are of course not very useful in themselves, but they might teach us something about how we can build better robots, and perhaps how humans, animals, create self-models and learn. And one of the things that I think is important is that we have to get away from this idea of designing the machines manually, but actually let them evolve and learn, like children, and perhaps that's the way we'll get there. Thank you. (Applause)

¿Qué podemos aprender? Estos robots, por supuesto, no son muy útiles por sí mismos pero podrían enseñarnos algo sobre cómo podemos construir mejores robots y quizás cómo los humanos y animales crean modelos de sí mismos y aprenden. Y una de las cosas que creo es importante es que tenemos que abandonar esta idea de diseñar máquinas manualmente sino más bien dejarlas evolucionar y aprender, como niños, y quizás esa es la forma en que lo consigamos. Gracias. (Aplausos)