Briana Brownell: How does artificial intelligence learn?

Today, artificial intelligence helps doctors diagnose patients, pilots fly commercial aircraft, and city planners predict traffic. But no matter what these AIs are doing, the computer scientists who designed them likely don’t know exactly how they’re doing it. This is because artificial intelligence is often self-taught, working off a simple set of instructions to create a unique array of rules and strategies. So how exactly does a machine learn?

こんにちAI(人工知能)は医師による患者の診断やパイロットによる商用航空機の操縦や都市計画者による交通量予測を支援しますしかしAIがどんな仕事をするにしても AIを設計するコンピュータ科学者は AIが何をしているかを正確に把握している訳ではないようですこれはAIがしばしば自己学習し単純な一連の命令から一連のユニークなルールと戦略を生成するからですでは実際に機械はどのように学習するのでしょうか？

There are many different ways to build self-teaching programs. But they all rely on the three basic types of machine learning: unsupervised learning, supervised learning, and reinforcement learning. To see these in action, let’s imagine researchers are trying to pull information from a set of medical data containing thousands of patient profiles.

自己学習型のプログラムを組む方法は多くありますしかしすべての方法が機械学習の３つの基本的な型に依存しています教師なし学習、教師あり学習そして強化学習の３つですこれらが動作する様子を見てみましょう数千人もの患者のプロファイルを含む医療データから研究者が情報を引き出そうとしていたとします

First up, unsupervised learning. This approach would be ideal for analyzing all the profiles to find general similarities and useful patterns. Maybe certain patients have similar disease presentations, or perhaps a treatment produces specific sets of side effects. This broad pattern-seeking approach can be used to identify similarities between patient profiles and find emerging patterns, all without human guidance.

まずは教師なし学習ですこの手法はすべてのプロファイルを分析して一般的な類似性や有益なパターンを見つけるのに理想的なものですある患者たちは似通った病状を示しているかも知れませんしもしかするとある治療法が一連の副作用を引き起こしているかもしれませんこの大規模なパターン検索の手法は患者のプロファイルから類似性を識別し発病パターンの見出すのに利用できますがまったく人間の手を介しません

But let's imagine doctors are looking for something more specific. These physicians want to create an algorithm for diagnosing a particular condition. They begin by collecting two sets of data— medical images and test results from both healthy patients and those diagnosed with the condition. Then, they input this data into a program designed to identify features shared by the sick patients but not the healthy patients. Based on how frequently it sees certain features, the program will assign values to those features’ diagnostic significance, generating an algorithm for diagnosing future patients. However, unlike unsupervised learning, doctors and computer scientists have an active role in what happens next. Doctors will make the final diagnosis and check the accuracy of the algorithm’s prediction. Then computer scientists can use the updated datasets to adjust the program’s parameters and improve its accuracy. This hands-on approach is called supervised learning.

しかし医師が何かもっと具体的なことを求めていたとしましょうその医師たちはある特定の疾患を診断するアルゴリズムを作りたいのだとします彼らは２種類のデータを集めることから始めます医療用画像データと試験結果のデータを健康な患者と疾患があると診断された患者から集めます健康な患者にはなく患者に共通する特徴を識別するように設計されたプログラムにこれらのデータを入力しますプログラムは各特徴量をどれくらい参照するかによって診断における特徴量の重要度を数値化して将来患者を診断する際に使うアルゴリズムを生成するのですしかしながら教師なし学習とは異なり医師とコンピュータ科学者は次の段階で積極的な役割を担います医師は最終的な診断を行いアルゴリズムの下した予測に対して正確さを確認します次にコンピュータ科学者は更新されたデータを使って確度を向上させるためにプログラムのパラメータを修正しますこの実践的な手法を教師あり学習といいます

Now, let’s say these doctors want to design another algorithm to recommend treatment plans. Since these plans will be implemented in stages, and they may change depending on each individual's response to treatments, the doctors decide to use reinforcement learning. This program uses an iterative approach to gather feedback about which medications, dosages and treatments are most effective. Then, it compares that data against each patient’s profile to create their unique, optimal treatment plan. As the treatments progress and the program receives more feedback, it can constantly update the plan for each patient. None of these three techniques are inherently smarter than any other. While some require more or less human intervention, they all have their own strengths and weaknesses which makes them best suited for certain tasks. However, by using them together, researchers can build complex AI systems, where individual programs can supervise and teach each other. For example, when our unsupervised learning program finds groups of patients that are similar, it could send that data to a connected supervised learning program. That program could then incorporate this information into its predictions. Or perhaps dozens of reinforcement learning programs might simulate potential patient outcomes to collect feedback about different treatment plans.

ここで治療計画を提案するアルゴリズムを医師たちが求めていたとします治療計画は段階的に導入され治療による個々の患者の反応により治療計画を変更する必要があるため医師たちは強化学習を使うことにしますこのプログラムではどのような処方投薬量、治療が最も効果的かについてフィードバックを行う反復的な手法を使います次に個々の患者に適した治療計画を作成するためにデータを個々の患者のプロファイルと照合します治療が進められプログラムがフィードバックを得るごとに個々の患者の治療計画をプログラムは継続的に更新していきますこれら３つの手法はどれが一番優れているということはありません人間の介在をいくらか必要とする手法もありますしいずれの手法にもそれぞれ強みと弱みがあるので各作業に対して最善な手法を選ぶことになりますしかし３つの手法を組み合わせることで研究者は複雑なAIシステムを作り上げることができますこれは各プログラムが互いに教師となり訓練することで作られます例えば教師なし学習のプログラムが類似する患者の集団を見つけるとこれに連結した教師あり学習プログラムにその情報を送ることができて教師あり学習プログラムは情報を予測に取り込めるでしょうあるいは何十もの強化学習のプログラムが異なる治療計画のフィードバックを集めるために起こりうる治療結果をシミュレーションすることもあるでしょう

There are numerous ways to create these machine-learning systems, and perhaps the most promising models are those that mimic the relationship between neurons in the brain. These artificial neural networks can use millions of connections to tackle difficult tasks like image recognition, speech recognition, and even language translation. However, the more self-directed these models become, the harder it is for computer scientists to determine how these self-taught algorithms arrive at their solution. Researchers are already looking at ways to make machine learning more transparent. But as AI becomes more involved in our everyday lives, these enigmatic decisions have increasingly large impacts on our work, health, and safety. So as machines continue learning to investigate, negotiate and communicate, we must also consider how to teach them to teach each other to operate ethically.

機械学習のシステムを作る様々な方法がありますが恐らくもっとも有望なモデルは脳内のニューロン同士の関係を模倣したものでしょうそのような人工ニューラルネットワークは数百万の接続を利用して画像認識、音声認識、さらには翻訳といった難しい作業に取り組めますでもこのようなモデルが自己学習を進めるほどコンピューター科学者が自己学習型アルゴリズムによる解の導出過程を特定することは難しくなります科学者はすでに機械学習をもっと可視的にする方法を探しています AIが私たちの日々の生活に入り込んできますが不可解な判断が仕事や健康や安全に与える影響が大きくなっています機械がデータの調査、計算や通信方法を継続的に学習している中で機械同士が倫理的な作動方法を教えあう手段を人間が教え込む方法も考えなければなりません

Briana Brownell: How does artificial intelligence learn?

Briana Brownell: How does artificial intelligence learn?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?

Related talks

José Américano N L F de Freitas: How exactly does binary code work?

Patrick Lin: The ethical dilemma of self-driving cars

David J. Malan: What's an algorithm?