Briana Brownell: How does artificial intelligence learn?

Today, artificial intelligence helps doctors diagnose patients, pilots fly commercial aircraft, and city planners predict traffic. But no matter what these AIs are doing, the computer scientists who designed them likely don’t know exactly how they’re doing it. This is because artificial intelligence is often self-taught, working off a simple set of instructions to create a unique array of rules and strategies. So how exactly does a machine learn?

Danas veštačka inteligencija pomaže doktorima da dijagnostikuju pacijente, pilotima da lete komercijalnim avionima i urbanistima da predvide saobraćaj. Bez obzira na to šta ova VI radila, informatičari koji su je dizajnirali verovatno ne znaju kako ona tačno radi. To je iz razloga što je veštačka inteligencija najčešće samouka i radi na osnovu jednostavne grupe komandi da bi stvorila jedinstveni niz pravila i strategija. Kako tačno mašina uči?

There are many different ways to build self-teaching programs. But they all rely on the three basic types of machine learning: unsupervised learning, supervised learning, and reinforcement learning. To see these in action, let’s imagine researchers are trying to pull information from a set of medical data containing thousands of patient profiles.

Postoje različiti načini kako se prave samouki programi. Ali su svi zasnovani na tri osnovne vrste mašinskog učenja: nenadgledano učenje, nadgledano učenje i učenje potkrepljivanjem. Da bismo ih videli na delu, zamislimo istraživače koji pokušavaju da izvuku informacije iz grupe medicinskih podataka koji sadrže hiljade profila pacijenata.

First up, unsupervised learning. This approach would be ideal for analyzing all the profiles to find general similarities and useful patterns. Maybe certain patients have similar disease presentations, or perhaps a treatment produces specific sets of side effects. This broad pattern-seeking approach can be used to identify similarities between patient profiles and find emerging patterns, all without human guidance.

Prvo imamo nenadgledano učenje. Ovaj pristup bi bio idealan za analiziranje svih profila i pronalaženje sličnosti i korisnih obrazaca. Možda pojedini pacijenti imaju slične kliničke slike ili terapija ima određeni skup neželjenih dejstava. Ovaj opšti pristup traženju obrazaca može se koristiti da se pronađu sličnosti između profila pacijenata, kao i za pronalaženje novih obrazaca, sve to bez ljudskog delovanja.

But let's imagine doctors are looking for something more specific. These physicians want to create an algorithm for diagnosing a particular condition. They begin by collecting two sets of data— medical images and test results from both healthy patients and those diagnosed with the condition. Then, they input this data into a program designed to identify features shared by the sick patients but not the healthy patients. Based on how frequently it sees certain features, the program will assign values to those features’ diagnostic significance, generating an algorithm for diagnosing future patients. However, unlike unsupervised learning, doctors and computer scientists have an active role in what happens next. Doctors will make the final diagnosis and check the accuracy of the algorithm’s prediction. Then computer scientists can use the updated datasets to adjust the program’s parameters and improve its accuracy. This hands-on approach is called supervised learning.

Recimo da lekari traže nešto specifičnije. Ti lekari žele da stvore algoritam za dijagnostikovanje određene bolesti. Počnu tako što sakupljaju dve grupe podataka - medicinske slike i rezultate testova zdravih pacijenata, ali i od obolelih. Zatim te podatke unose u program dizajniran da prepozna karakteristike prisutne kod bolesnih pacijenata, ali ne i zdravih. Na osnovu toga koliko često uoči određene karakteristike, program će odrediti vrednosti koliko je neka karakteristika važna, stvarajući algoritam za dijagnostikovanje budućih pacijenata. Ali za razliku od nenadgledanog učenja, doktori i informatičari imaju aktivnu ulogu u onome što sledi. Doktori će napraviti konačnu dijagnozu i proveriti preciznost predviđanja algoritma. Onda informatičari mogu da koriste dopunjene setove podataka da prilagode parametre programa i poboljšaju njegovu preciznost. Ovaj direktni pristup se naziva nadgledano učenje.

Now, let’s say these doctors want to design another algorithm to recommend treatment plans. Since these plans will be implemented in stages, and they may change depending on each individual's response to treatments, the doctors decide to use reinforcement learning. This program uses an iterative approach to gather feedback about which medications, dosages and treatments are most effective. Then, it compares that data against each patient’s profile to create their unique, optimal treatment plan. As the treatments progress and the program receives more feedback, it can constantly update the plan for each patient. None of these three techniques are inherently smarter than any other. While some require more or less human intervention, they all have their own strengths and weaknesses which makes them best suited for certain tasks. However, by using them together, researchers can build complex AI systems, where individual programs can supervise and teach each other. For example, when our unsupervised learning program finds groups of patients that are similar, it could send that data to a connected supervised learning program. That program could then incorporate this information into its predictions. Or perhaps dozens of reinforcement learning programs might simulate potential patient outcomes to collect feedback about different treatment plans.

Recimo da ovi doktori žele da naprave još jedan algoritam za preporučivanje planova za lečenje. S obzirom na to da će ovi planovi biti primenjeni u fazama, i možda će se menjati na osnovu reakcije pojedinaca na terapiju, doktori su odlučili da koriste učenje potkrepljivanjem. Ovaj program koristi učestan pristup za sakupljanje povratnih informacija o tome koji lekovi, doze i terapije su najefikasniji. Zatim poredi podatke sa profilom svakog pacijenta i stvara njihov jedinstveni, optimalni plan lečenja. Kako lečenje napreduje i program dobija sve više povratnih informacija, on može stalno da ažurira plan za svakog pacijenta. Nijedna od ove tri tehnike nije “pametnija” od ostalih. Nekima je potrebno više ili manje ljudske intervencije, sve imaju svoje prednosti i mane što ih čini odgovarajućim za određene zadatke. No koristeći ih zajedno, istraživači su izgradili kompleksne sisteme veštačke inteligencije gde pojedinačni programi mogu da nadgledaju i uče jedni druge. Na primer, kada naš program nenadgledanog učenja pronađe grupu pacijenata koji su slični, može da pošalje te podatke u program nadgledanog učenja sa kojim je povezan. Taj program onda može da pripoji ovu informaciju u svoja predviđanja. Ili desetine programa učenja potkrepljivanjem mogu da simuliraju moguće ishode kod pacijenata kako bi sakupili povratne informacije o različitim terapijama.

There are numerous ways to create these machine-learning systems, and perhaps the most promising models are those that mimic the relationship between neurons in the brain. These artificial neural networks can use millions of connections to tackle difficult tasks like image recognition, speech recognition, and even language translation. However, the more self-directed these models become, the harder it is for computer scientists to determine how these self-taught algorithms arrive at their solution. Researchers are already looking at ways to make machine learning more transparent. But as AI becomes more involved in our everyday lives, these enigmatic decisions have increasingly large impacts on our work, health, and safety. So as machines continue learning to investigate, negotiate and communicate, we must also consider how to teach them to teach each other to operate ethically.

Postoje mnogi načini da se naprave ovakvi sistemi mašinskog učenja, a možda najobećavajući su oni koji imitiraju vezu između neurona u mozgu. Ove veštačke neuronske mreže mogu da koriste milione veza za teške zadatke poput prepoznavanja slika, govora, pa čak i prevođenja sa drugih jezika. Ali što su ovi modeli samostalniji, teže je informatičarima da odrede kako samouki algoritmi dolaze do svojih zaključaka. Istraživači već traže načine da učine mašinsko učenje transparentnijim. Međutim, kako VI postaje sve više uključena u naš svakodnevni život, ove enigmatične odluke imaju mnogo veći uticaj na naše poslove, zdravlje i bezbednost. Kako mašine nastavljaju da uče da istražuju, pregovaraju i komuniciraju moramo takođe uzeti u obzir da ih naučimo da uče jedna drugu da rade etički.

Briana Brownell: How does artificial intelligence learn?

Briana Brownell: How does artificial intelligence learn?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?