Briana Brownell: How does artificial intelligence learn?

Today, artificial intelligence helps doctors diagnose patients, pilots fly commercial aircraft, and city planners predict traffic. But no matter what these AIs are doing, the computer scientists who designed them likely don’t know exactly how they’re doing it. This is because artificial intelligence is often self-taught, working off a simple set of instructions to create a unique array of rules and strategies. So how exactly does a machine learn?

ทุกวันนี้ปัญญาประดิษฐ์ สามารถช่วยหมอวินิจฉัยโรค ช่วยนักบินขับเครื่องบินพาณิชย์ และช่วยนักผังเมืองพยากรณ์สภาพจราจร แต่ไม่ว่าจะเป็นเอไอสำหรับงานอะไรก็ตาม นักวิจัยผู้ออกแบบระบบเหล่านี้ มักจะไม่รู้อย่างแน่ชัดว่า มันทำงานได้อย่างไรกันแน่ นั่นเป็นเพราะว่า ปัญญาประดิษฐ์มักจะสอนตัวมันเอง โดยทำงานจากชุดคำสั่งง่ายๆ เพื่อสร้างกลุ่มก้อนเฉพาะ ของกฎเกณฑ์และกลยุทธ์ต่างๆ แล้วเครื่องจักรเรียนรู้ได้อย่างไร

There are many different ways to build self-teaching programs. But they all rely on the three basic types of machine learning: unsupervised learning, supervised learning, and reinforcement learning. To see these in action, let’s imagine researchers are trying to pull information from a set of medical data containing thousands of patient profiles.

มีหลายวิธีที่จะสร้างโปรแกรม ที่สอนตัวเองได้ แต่ทั้งหมดก็มีพื้นฐาน จากการเรียนรู้ของเครื่องสามแบบ การเรียนรู้แบบมีผู้สอน แบบไม่มีผู้สอน และแบบให้รางวัล เพื่อให้เห็นภาพได้ชัดเจนขึ้น ลองนึกภาพนักวิจัยกำลังดึงข้อมูล จากชุดข้อมูลทางการแพทย์ ที่มีข้อมูลของผู้ป่วยนับพันราย

First up, unsupervised learning. This approach would be ideal for analyzing all the profiles to find general similarities and useful patterns. Maybe certain patients have similar disease presentations, or perhaps a treatment produces specific sets of side effects. This broad pattern-seeking approach can be used to identify similarities between patient profiles and find emerging patterns, all without human guidance.

อย่างแรก การเรียนรู้แบบมีผู้สอน แนวทางนี้จะเหมาะมากสำหรับการ วิเคราะห์ข้อมูลผู้ป่วยทั้งหมด เพื่อหาความเหมือน และรูปแบบที่เป็นประโยชน์ ผู้ป่วยบางกลุ่มอาจมีการแสดงอาการ ของโรคที่คล้ายคลึงกัน หรือการรักษาแบบใดแบบหนึ่ง อาจมีผลข้างเคียงที่เฉพาะเจาะจง วิธีการหารูปแบบอย่างกว้างๆ นี้ จะสามารถนำมาใช้เพื่อหาความเหมือนกัน ระหว่างข้อมูลผู้ป่วยคนต่างๆ และหารูปแบบสำคัญที่กำลังเกิดขึ้นได้ โดยไม่จำเป็นต้องใช้มนุษย์ช่วย

But let's imagine doctors are looking for something more specific. These physicians want to create an algorithm for diagnosing a particular condition. They begin by collecting two sets of data— medical images and test results from both healthy patients and those diagnosed with the condition. Then, they input this data into a program designed to identify features shared by the sick patients but not the healthy patients. Based on how frequently it sees certain features, the program will assign values to those features’ diagnostic significance, generating an algorithm for diagnosing future patients. However, unlike unsupervised learning, doctors and computer scientists have an active role in what happens next. Doctors will make the final diagnosis and check the accuracy of the algorithm’s prediction. Then computer scientists can use the updated datasets to adjust the program’s parameters and improve its accuracy. This hands-on approach is called supervised learning.

แต่ลองจินตนาการว่าแพทย์ กำลังมองหาอะไรที่เจาะจงกว่านั้น แพทย์เหล่านั้นต้องการสร้างอัลกอริทึม เพื่อวินิจฉัยอาการเฉพาะบางอย่าง พวกเขาเริ่มด้วยการเก็บข้อมูลสองชุด ภาพถ่ายทางการแพทย์ และผลตรวจ จากทั้งกลุ่มผู้ป่วยที่แข็งแรง และจากผู้ป่วยที่วินิจฉัยแล้วว่า มีอาการ แล้วพวกเขาก็ป้อนข้อมูลเข้าไปในโปรแกรม ที่ถูกออกแบบมาเพื่อให้ระบุ คุณลักษณะที่มีในคนไข้ที่ป่วย แต่ไม่พบในคนไข้ที่ไม่มีอาการดังกล่าว และจากความถี่ ที่พบในคุณลักษณะแต่ละข้อ โปรแกรมก็จะกำหนดค่าความสำคัญต่อ ผลการวินิจฉัยของคุณลักษณะนั้นๆ สร้างขึ้นมาเป็นอัลกอริทึม เพื่อใช้ในการวินิจฉัยผู้ป่วยในอนาคต อย่างไรก็ตาม ในการเรียนรู้แบบมีผู้สอน ทั้งแพทย์และนักคอมพิวเตอร์ มีบทบาทสำคัญต่อสิ่งที่จะเกิดขึ้นต่อไป แพทย์จะต้องทำการวินิจฉัย ขั้นสุดท้าย แล้วตรวจสอบความแม่นยำ ของผลการทำนายของอัลกอริทึม แล้วนักคอมพิวเตอร์ จะสามารถใช้ ชุดข้อมูลที่ปรับปรุงแล้วนี้ เพื่อปรับแต่งตัวแปร ของโปรแกรม ในการเพิ่มความแม่นยำ แนวทางการทำงานแบบนี้ เรียกว่า การเรียนรู้แบบมีผู้สอน

Now, let’s say these doctors want to design another algorithm to recommend treatment plans. Since these plans will be implemented in stages, and they may change depending on each individual's response to treatments, the doctors decide to use reinforcement learning. This program uses an iterative approach to gather feedback about which medications, dosages and treatments are most effective. Then, it compares that data against each patient’s profile to create their unique, optimal treatment plan. As the treatments progress and the program receives more feedback, it can constantly update the plan for each patient. None of these three techniques are inherently smarter than any other. While some require more or less human intervention, they all have their own strengths and weaknesses which makes them best suited for certain tasks. However, by using them together, researchers can build complex AI systems, where individual programs can supervise and teach each other. For example, when our unsupervised learning program finds groups of patients that are similar, it could send that data to a connected supervised learning program. That program could then incorporate this information into its predictions. Or perhaps dozens of reinforcement learning programs might simulate potential patient outcomes to collect feedback about different treatment plans.

ต่อมาสมมติว่าแพทย์ต้องการ ออกแบบอัลกอริทึมอีกอย่างหนึ่ง เพื่อจะแนะนำแผนการรักษา เนื่องจากแผนเหล่านี้ จะถูกดำเนินการเป็นขั้นๆ ไป และอาจเปลี่ยนไปได้ ตามการตอบสนอง ต่อการรักษาของผู้แต่ละคน ทีมแพทย์จึงตัดสินใจใช้การ การเรียนรู้แบบให้รางวัล โปรแกรมนี้จะเป็นการทำซ้ำๆ เพื่อเก็บผลตอบรับ ว่าการให้ยาแบบไหน ปริมาณใด และการรักษาแบบใด ให้ผลดีที่สุด แล้วจึงนำผลมาเปรียบเทียบกับ ข้อมูลผู้ป่วยแต่ละราย เพื่อสร้างแผนการรักษา เฉพาะตัวที่เหมาะสมที่สุด เมื่อการรักษาดำเนินไป และโปรแกรมได้รับข้อมูลตอบรับมากขึ้น มันจะสามารถปรับปรุงแผน การรักษาสำหรับผู้ป่วยแต่ละรายได้ เทคนิคการโปรแกรมเหล่านี้ ไม่ได้มีวิธีไหนที่ชาญฉลาดกว่ากัน ในขณะที่บางวิธี ต้องมีมนุษย์เข้ามาเกี่ยวข้องด้วย แต่ละวิธีก็มีจุดอ่อนจุดแข็งของมันเอง ทำให้มันเหมาะสมที่สุด กับงานต่างประเภทกัน อย่างไรก็ตาม เมื่อนำมาใช้ร่วมกันแล้ว นักวิจัยจะสามารถสร้างระบบ ปัญญาประดิษฐ์ที่ซับซ้อน โดยให้โปรแกรมย่อยแต่ละตัว สามารถสอนกันเองได้ ตัวอย่างเช่น ในโปรแกรมเรียนรู้แบบไม่มีผู้สอน ทำการหากลุ่มของผู้ป่วย ที่มีความคล้ายคลึงกัน มันสามารถส่งข้อมูลไปยังโปรแกรมอื่น ที่ใช้การเรียนรู้แบบมีผู้สอน โปรแกรมนั้นก็จะนำข้อมูลที่ได้ มาเป็นส่วนหนึ่งของการทำนายผล หรืออาจใช้โปรแกรม เรียนรู้แบบให้รางวัลหลายสิบตัว เพื่อจำลองผลลัพธ์การตอบสนอง ของผู้ป่วยหลายๆ แบบ และเก็บรวบรวมผลตอบรับ ต่อแผนการรักษาที่แตกต่างกัน

There are numerous ways to create these machine-learning systems, and perhaps the most promising models are those that mimic the relationship between neurons in the brain. These artificial neural networks can use millions of connections to tackle difficult tasks like image recognition, speech recognition, and even language translation. However, the more self-directed these models become, the harder it is for computer scientists to determine how these self-taught algorithms arrive at their solution. Researchers are already looking at ways to make machine learning more transparent. But as AI becomes more involved in our everyday lives, these enigmatic decisions have increasingly large impacts on our work, health, and safety. So as machines continue learning to investigate, negotiate and communicate, we must also consider how to teach them to teach each other to operate ethically.

มีหลากหลายวิธีที่จะสร้าง ระบบการเรียนรู้ด้วยเครื่องเหล่านี้ โมเดลรูปแบบหนึ่ง ที่ให้ความหวังได้ดีที่สุด คือโมเดลที่จำลอง ความเชื่อมโยงกันของเซลประสาทในสมอง โครงข่ายประสาทเทียมเหล่านี้ อาจมีจุดเชื่อมต่อกันเป็นล้านๆ จุด เพื่อรับมือกับปัญหายากๆ อย่างเช่น การจดจำภาพถ่าย การจดจำคำพูด หรือแม้แต่การแปลภาษา อย่างไรก็ตาม ยิ่งเมื่อ โมเดลเหล่านี้มีการเรียนรู้เองมากเท่าใด ก็ยิ่งเป็นเรื่องยากมากขึ้น สำหรับนักคอมพิวเตอร์ ที่จะเข้าใจได้ว่าอัลกอริทึมที่สอนตัวเอง เหล่านี้ สร้างผลลัพธ์ออกมาได้อย่างไร นักวิจัยเริ่มมองหาหนทางต่างๆ ที่จะทำให้ การเรียนรู้ของเครื่องมีความโปร่งใสมากขึ้น แต่เมื่อปัญญาประดิษฐ์ เข้ามาเกี่ยวข้องมากขึ้นในชีวิตประจำวัน ปริศนาการตัดสินใจของเครื่อง ก็ส่งผลกระทบมากขึ้นเรื่อยๆ กับงาน สุขภาพ และความปลอดภัยของเรา ดังนั้นเมื่อเครื่องจักรเรียนรู้มากขึ้น ที่จะค้นหา ต่อรอง และสื่อสาร เราจำเป็นต้องหาหนทางที่จะสอนมัน ให้สอนตัวเองเพื่อทำงานอย่างมีจริยธรรม

Briana Brownell: How does artificial intelligence learn?

Briana Brownell: How does artificial intelligence learn?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?