Briana Brownell: How does artificial intelligence learn?

Today, artificial intelligence helps doctors diagnose patients, pilots fly commercial aircraft, and city planners predict traffic. But no matter what these AIs are doing, the computer scientists who designed them likely don’t know exactly how they’re doing it. This is because artificial intelligence is often self-taught, working off a simple set of instructions to create a unique array of rules and strategies. So how exactly does a machine learn?

Bugün, yapay zeka doktorların hastalık teşhis etmesine, pilotların uçakları kontrol etmesine ve şehir mimarlarının trafik tahmini yapmasına yardım ediyor. Ancak yapay zeka ne yaparsa yapsın, onları tasarlayan bilgisayar bilimciler muhtemelen nasıl çalıştıklarını tam olarak bilemiyor. Bunun sebebi, yapay zekanın genellikle kendi kendine öğrenmesi, bunun için bir dizi talimattan yola çıkarak eşsiz bir kural ve strateji bütünü yaratıyorlar.

There are many different ways to build self-teaching programs. But they all rely on the three basic types of machine learning: unsupervised learning, supervised learning, and reinforcement learning. To see these in action, let’s imagine researchers are trying to pull information from a set of medical data containing thousands of patient profiles.

Peki bir makine tam olarak nasıl öğreniyor? Self öğrenme programı inşa etmek için çok sayıda farklı yol var. Ama hepsi üç temel makine öğrenimi türüne dayanıyor: kontrolsüz öğrenme, kontrollü öğrenme ve teşvikli öğrenme. Bunları uygulamada görmek için binlerce hasta profili içeren bir tıbbi veri setinden araştırmacıların bilgi edinmeye çalıştığını düşünelim.

First up, unsupervised learning. This approach would be ideal for analyzing all the profiles to find general similarities and useful patterns. Maybe certain patients have similar disease presentations, or perhaps a treatment produces specific sets of side effects. This broad pattern-seeking approach can be used to identify similarities between patient profiles and find emerging patterns, all without human guidance.

Öncelikle kontrolsüz öğrenme. Genel benzerlikler ve faydalı örekler bulmak için tüm profilleri analiz etmede ideal bir yaklaşım. Belki de bazı hastaların benzer hastalık semptomları vardır veya belki de bir tedavi belli yan etkiler gösteriyordur. Bu örnek arayan geniş çaplı yaklaşım hasta profilleri ve ortaya çıkan modeller arasında benzerlik bulmak için kullanılabilir. Hem de bir insanın rehberliği olmadan.

But let's imagine doctors are looking for something more specific. These physicians want to create an algorithm for diagnosing a particular condition. They begin by collecting two sets of data— medical images and test results from both healthy patients and those diagnosed with the condition. Then, they input this data into a program designed to identify features shared by the sick patients but not the healthy patients. Based on how frequently it sees certain features, the program will assign values to those features’ diagnostic significance, generating an algorithm for diagnosing future patients. However, unlike unsupervised learning, doctors and computer scientists have an active role in what happens next. Doctors will make the final diagnosis and check the accuracy of the algorithm’s prediction. Then computer scientists can use the updated datasets to adjust the program’s parameters and improve its accuracy. This hands-on approach is called supervised learning.

Ama doktorların daha spesifik bir şey aradıklarını varsayalım. Bu doktorlar belli bir hastalığı teşhis etmek için bir algoritma yaratmak istiyorlar. İki veri seti toplayarak başlarlar— hem sağlıklı hastalardan hem de bu tanının konduğu hastalardan tıbbi görseller ve tahlil sonuçları. Sonra bu verileri bir programa eklerler, program, sağlıklı olanların değil tanılı hastaların ortak özelliklerini bulmak için tasarlanmıştır. Belli özellikleri ne sıklıkta gördüğüne dayalı olarak program, o özelliklere teşhis için değer tanımlayacaktır, bu da gelecekte hastaları teşhis eden bir algoritma yaratmış olur. Ancak, kontrollü öğrenmeden farklı olarak doktorlar ve bilgisayar bilimcilerin sonrasında aktif bir rolü var. Doktorlar nihai tanıyı kendi koyar ve algoritmanın tahmindeki doğruluğunu kontrol eder. Bilgisayar bilimci ise güncel veri setlerini kullanarak programın parametrelerini ayarlar ve doğruluk payını iyileştirir. Bu pratik yönteme kontrollü öğrenme denir.

Now, let’s say these doctors want to design another algorithm to recommend treatment plans. Since these plans will be implemented in stages, and they may change depending on each individual's response to treatments, the doctors decide to use reinforcement learning. This program uses an iterative approach to gather feedback about which medications, dosages and treatments are most effective. Then, it compares that data against each patient’s profile to create their unique, optimal treatment plan. As the treatments progress and the program receives more feedback, it can constantly update the plan for each patient. None of these three techniques are inherently smarter than any other. While some require more or less human intervention, they all have their own strengths and weaknesses which makes them best suited for certain tasks. However, by using them together, researchers can build complex AI systems, where individual programs can supervise and teach each other. For example, when our unsupervised learning program finds groups of patients that are similar, it could send that data to a connected supervised learning program. That program could then incorporate this information into its predictions. Or perhaps dozens of reinforcement learning programs might simulate potential patient outcomes to collect feedback about different treatment plans.

Bir de doktorların tedavi önermek için başka bir algoritma tasarladığını düşünelim. Bu planlar aşamalı halde uygulanacağı ve her bireyin tedaviye verdiği farklı yanıta göre değişebileceği için doktorlar güçlendirilmiş öğrenme kullanmaya karar veriyorlar. Bu program tekrarlı bir yaklaşım kullanıyor, hangi ilaçların, dozun ve tedavinin en etkili olduğuyla ilgili dönüt topluyor. Sonra bu verileri her bir hastanın profiliyle karşılaştırıyor, böylece eşsiz ve optimal bir tedavi planı ortaya çıkıyor. Tedavi ilerledikçe ve program daha fazla dönüt aldıkça Her bir hasta için sürekli planı güncelleyebilir. Bu üç teknikten hiçbiri bir diğerinden daha zeki değil. Bazıları daha az veya daha fazla insan müdahalesi gerektiriyor ama hepsinin güçlü ve zayıf yanları var, böylece belli amaçlar için daha uygun oluyorlar. Ancak birlikte kullanıldıklarında araştırmacılar karmaşık yapay zeka sistemleri yapabilir ve bireysel programlar birbirini kontrol edebilir. Örneğin kontrolsüz öğrenme programımız benzer bir grup hasta bulabilir, bu veriyi bağlı olduğu kontrollü bir öğrenme programına gönderebilir. Sonra bu program bu bilgiyle tahminlerde bulunabilir. Veya düzinelerce güçlendirilmiş öğrenme programı potansiyel hasta sonuçlarını simüle ederek farklı tedavi planları üzerine geri bildirim toplayabilir.

There are numerous ways to create these machine-learning systems, and perhaps the most promising models are those that mimic the relationship between neurons in the brain. These artificial neural networks can use millions of connections to tackle difficult tasks like image recognition, speech recognition, and even language translation. However, the more self-directed these models become, the harder it is for computer scientists to determine how these self-taught algorithms arrive at their solution. Researchers are already looking at ways to make machine learning more transparent. But as AI becomes more involved in our everyday lives, these enigmatic decisions have increasingly large impacts on our work, health, and safety. So as machines continue learning to investigate, negotiate and communicate, we must also consider how to teach them to teach each other to operate ethically.

Bu makine öğrenimi sistemlerini yaratmada çok sayıda yöntem var ve muhtemelen en umut verici modeller beyindeki nöronların ilişkisini taklit edenler. Bu yapay sinirsel ağlar, milyonlarca bağlantı kullanarak görsel tanıma, ses tanıma ve hatta dil çevirisi gibi zor işlerin bile üstesinden gelebilir. Ama bu modeller kendi kendini yönettikçe kendi kendine öğrenen algoritmaların çözümde ne kadar faydalı olduklarını anlamak zorlaşıyor. Araştırmacılar makine öğrenimini daha şeffaf yapmak için zaten uğraşıyorlar. Fakat yapay zeka günlük hayatımızla içe içe geçtikçe işimiz, sağlığımız ve güvenliğimiz üzerinde bu şifreli kararların etkisi giderek artıyor. Makineler araştırma, müzakere ve iletişim konusunda öğrenmeye devam ettikçe onlara birbirleriyle etik bir şekilde çalışmayı öğretmemiz gerekiyor.

Briana Brownell: How does artificial intelligence learn?

Briana Brownell: How does artificial intelligence learn?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?