How many of you had to fill out a web form where you've been asked to read a distorted sequence of characters like this? How many of you found it really annoying?
Koliko vas je nekada moralo da na internetu popuni neku vrstu obrasca gde ste morali da prekucate iskrivljeni niz znakova poput ovog? Koliko vas je pomislilo "Ovo je zaista iritirajuće"? Dobro. Odlično. Ja sam to izmislio.
(Laughter)
OK, outstanding. So I invented that.
(Smeh)
(Laughter)
Ili bio sam jedan od ljudi koji su to izmislili.
Or I was one of the people who did it. That thing is called a CAPTCHA. And it is there to make sure you, the entity filling out the form, are a human and not a computer program that was written to submit the form millions of times. The reason it works is because humans, at least non-visually-impaired humans, have no trouble reading these distorted characters, whereas programs can't do it as well yet. In the case of Ticketmaster, the reason you have to type these characters is to prevent scalpers from writing a program that can buy millions of tickets, two at a time.
To se zove CAPTCHA. I tu je kako bismo bili sigurni da je stvorenje koje je popunilo formular zapravo čovek, a ne kompjuterski program koji je napisan kako bi popunio milione ovakvih obrazaca. To funkcioniše jer ljudi, ili barem ljudi koji vide, nemaju problema da pročitaju ove izobličene znake, dok kompjuterski programi jednostavno još uvek to ne mogu da urade. Na primer, u slučaju Tiketmastera, razlog što morate da ukucate ove simbole je da biste sprečili tapkaroše da naprave program koji će im omogućiti da kupe milione ulaznica.
CAPTCHAs are used all over the Internet. And since they're used so often, a lot of times the sequence of random characters shown to the user is not so fortunate. So this is an example from the Yahoo registration page. The random characters that happened to be shown to the user were W, A, I, T, which, of course, spell a word. But the best part is the message that the Yahoo help desk got about 20 minutes later.
CAPTCHA se koristi širom interneta I budući da je to veoma često, mnogo puta konkretni niz slučajno odabranih simbola koji su pokazani korisniku nije naročito "srećan". Ovo je primer sa stranice za registraciju na Jahu (Yahoo). Slučajno odabrani karakteri prikazani korisniku su bili Č,E,K,A,J,T.E, što naravno daje reč. Ali najbolje od svega je što je dvadesetak minuta kasnije Jahu dobio poruku sledećeg sadržaja.
[Help! I've been waiting for over 20 minutes and nothing happens.]
"Pomozite! Čekam već 20 minuta i ništa se ne dešava."
(Laughter)
(Smeh)
This person thought they needed to wait. This, of course, is not as bad as this poor person.
Ova osoba je mislila da je trebalo da čeka. Ovo naravno i nije tako loše u poređenju sa ovom jadnom osobom.
(Laughter)
(Smeh)
CAPTCHA Project is something that we did at Carnegie Melllon over 10 years ago, and it's been used everywhere. Let me now tell you about a project that we did a few years later, which is sort of the next evolution of CAPTCHA. This is a project that we call reCAPTCHA, which is something that we started here at Carnegie Mellon, then we turned it into a start-up company. And then about a year and a half ago, Google actually acquired this company.
CAPTCHA projekat smo uradili ovde na Karnegi Melon univerzitetu pre više od deset godina, i koristi se svuda. Sada ću vam ispričati nešto o projektu koji smo uradili nekoliko godina kasnije i što je u neku ruku evolucija CAPTCHA-e. Ovaj projekat smo nazvali reCAPTCHA, započet je ovde u Karnegi Melonu ali se onda pretvorio u startap kompaniju. Zatim, pre oko godinu i po, Gugl je kupio ovu kompaniju,
Let me tell you what this project started. This project started from the following realization: It turns out that approximately 200 million CAPTCHAs are typed everyday by people around the world. When I first heard this, I was quite proud of myself. I thought, look at the impact my research has had. But then I started feeling bad. Here's the thing: each time you type a CAPTCHA, essentially, you waste 10 seconds of your time. And if you multiply that by 200 million, you get that humanity is wasting about 500,000 hours every day typing these annoying CAPTCHAs.
Reći ću vam kako je projekat počeo. Počelo je tako što smo shvatili sledeće: ispostavi se da ljudi širom sveta svakog dana ukucaju oko 200 miliona CAPTCHA kodova. Kad sam to čuo prvi put, bio sam prilično ponosan, misleći kako moje istraživanje ima ogroman uticaj. Ali onda sam počeo da se loše osećam. Jer... svaki put kad ukucate CAPTCHA vi izgubite deset sekundi svog života. I ako to pomnožite sa 200 miliona ispada da čovečanstvo troši oko 500.000 sati svakog dana ukucavajući te iritantne CAPTCHA kodove.
(Laughter)
I tako sam počeo da se osećam loše -
So then I started feeling bad.
(Smeh)
(Laughter)
A onda sam počeo da razmišljam, naravno da se ne možemo otarasiti CAPTCHA-e,
And then I started thinking, of course, we can't just get rid of CAPTCHAs, because the security of the web depends on them. But then I started thinking, can we use this effort for something that is good for humanity? So see, here's the thing. While you're typing a CAPTCHA, during those 10 seconds, your brain is doing something amazing. Your brain is doing something that computers cannot yet do. So can we get you to do useful work for those 10 seconds? Is there some humongous problem that we cannot yet get computers to solve, yet we can split into tiny 10-second chunks such that each time somebody solves a CAPTCHA, they solve a little bit of this problem? And the answer to that is "yes," and this is what we're doing now.
budući da bezbednost čitavog interneta zavisi od njih. Ali onda sam se zapitao postoji li način na koji možemo iskoristiti ovaj trud za nešto što je dobro za čovečanstvo? Zaključak je sledeći. Tokom tih desetak sekundi dok ukucavate CAPTCHA kod, vaš mozak radi nešto zaista zadivljujuće. On radi nešto što kompjuteri još uvek ne mogu da urade. Dakle, možemo li izvesti da za tih deset sekundi vi uradite nešto korisno? Drugim rečima, postoji li neki ogromni problem koji kompjuteri još uvek ne mogu da reše, a koji možemo izdeliti na malecke deonice od 10 sekundi tako da svaki put kad neko reši CAPTCHA kod reši mali deo tog problema? Odgovor na ovo je "da", i upravo to sada vi radite.
Nowadays, while you're typing a CAPTCHA, not only are you authenticating yourself as a human, but in addition you're helping us to digitize books. Let me explain how this works. There's a lot of projects trying to digitize books. Google has one. The Internet Archive has one. Amazon, with the Kindle, is trying to digitize books. Basically, the way this works is you start with an old book. You've seen those things, right? Like a book?
Ono što možda ne znate je da kada ukucate CAPTCHA, ne samo da ste dokazali da ste ljudsko biće već nam zapravo i pomažete da digitalizujemo knjige. Dozvolite mi da objasnim. Postoji mnogo projekata kojima je cilj da digitalizuju knjige. Gugl ima jedan. Internet arhiv ima jedan. Amazon, sada sa Kindlom takođe pokušava da digitalizuje knjige. To funkcioniše tako što počnete sa nekom starom knjigom. Znate te stvari, zar ne? Knjige? (Smeh)
(Laughter)
Uzmete knjigu i skenirate je.
So you start with a book and then you scan it.
Skeniranje knjige je kao
Now, scanning a book is like taking a digital photograph of every page. It gives you an image for every page. This is an image with text for every page of the book. The next step in the process is that the computer needs to be able to decipher the words in this image. That's using a technology called OCR, for optical character recognition, which takes a picture of text and tries to figure out what text is in there. Now, the problem is that OCR is not perfect. Especially for older books where the ink has faded and the pages have turned yellow, OCR cannot recognize a lot of the words. For things that were written more than 50 years ago, the computer cannot recognize about 30 percent of the words. So now we're taking all of the words that the computer cannot recognize and we're getting people to read them for us while they're typing a CAPTCHA on the Internet.
digitalno fotografisanje svake stranice knjige. Dobijete fotografiju svake stranice. I ovo je fotografija sa tekstom za svaku stranicu knjige. Sledeći korak u procesu je da kompjuter uspe da odgonetne sve reči na ovoj slici. Tu se koristi tehnologija pod imenom OCR (OPK). za optičko prepoznavanje karaktera, koja fotografiše tekst i pokušava da odgonetne šta tu piše. Problem je što ova tehnologija nije savršena. Naročito kada se radi o starijim knjigama gde je tinta izbledela i stranice požutele, pa OPK ne može da prepozna mnoge reči. Na primer, kod stvari koje su napisane pre više od 50 godina, kompjuter ne može da prepozna oko 30 odsto reči. I ono što tada radimo je da uzmemo sve reči koje kompjuter ne može da prepozna i pitamo ljude da ih pročitaju umesto nas dok ukucavaju CAPTCHA na internetu.
So the next time you type a CAPTCHA, these words that you're typing are actually words from books that are being digitized that the computer could not recognize. The reason we have two words nowadays instead of one is because one of the words is a word that the system just got out of a book, it didn't know what it was and it's going to present it to you. But since it doesn't know the answer, it cannot grade it. So we give you another word, for which the system does know the answer. We don't tell you which one's which and we say, please type both. And if you type the correct word for the one for which the system knows the answer, it assumes you are human and it also gets some confidence that you typed the other word correctly. And if we repeat this process to 10 different people and they agree on what the new word is, then we get one more word digitized accurately.
Tako da, sledeći put kad budete kucali CAPTCHA, te reči su zapravo reči koje dolaze iz knjiga koje se digitalizuju i koje kompjuter ne može da prepozna. A razlog što danas imamo dve, umesto jedne reči je zato što je jednu od reči sistem dobio iz knjige, nije je prepoznao i pokazaće vam je. Ali budući da ne zna odgovor, ne može ni vas da oceni. Zato vam dajemo još jednu reč, koju sistem prepoznaje. Ne kažemo vam koja je koja, već vas pitamo da upišete obe. I ako upišete tačno reč koju sistem već zna, on zaključuje da ste ljudsko biće i da ćete i drugu reč ukucati tačno. Ako ponovimo ovaj proces sa 10 različitih ljudi i svi oni se slože oko nove reči, dobijemo još jednu tačno digitalizovanu reč.
So this is how the system works. And since we released it about three or four years ago, a lot of websites have started switching from the old CAPTCHA, where people wasted their time, to the new CAPTCHA where people are helping to digitize books. So every time you buy tickets on Ticketmaster, you help to digitize a book. Facebook: Every time you add a friend or poke somebody, you help to digitize a book. Twitter and about 350,000 other sites are all using reCAPTCHA. And the number of sites that are using reCAPTCHA is so high that the number of words we're digitizing per day is really large. It's about 100 million a day, which is the equivalent of about two and a half million books a year. And this is all being done one word at a time by just people typing CAPTCHAs on the Internet.
Tako funkcioniše sistem. Počeo je da radi pre oko tri ili četiri godine, i mnogi sajtovi su već prešli sa stare CAPTCHA kodove gde su ljudi gubili vreme na novi CAPTCHA gde ljudi pomažu da se knjige digitalizuju. Dakle, na primer, Tiketmaster. Svaku put kada kupite karte na Tiketmasteru pomažete digitalizaciju knjiga. Fejsbuk: Svaku put kad dodate prijatelja ili nekog "pokujete", učestvujete u digitalizaciji knjiga. Tviter i oko 350.000 drugih sajtova koriste reCAPTCHA. I zapravo, broj tih sajtova je toliko veliki da je i broj reči digitalizovanih tokom jednog dana veoma veliki. Oko sto miliona dnevno, što je ekvivalent za oko dva i po miliona knjiga godišnje. I to sve reč po reč, samo uz pomoć ljudi koji ukucavaju CAPTCHA kodove na internetu. (Aplauz)
(Applause)
Naravno, budući da
Now, of course, since we're doing so many words per day, funny things can happen. This is especially true because now we're giving people two randomly chosen English words next to each other. So funny things can happen. For example, we presented this word. It's the word "Christians"; there's nothing wrong with it. But if you present it along with another randomly chosen word, bad things can happen. So we get this.
uradimo toliko reči u toku jednog dana, dešavaju se razne smešne stvari. I ovo je tako jer dajemo ljudima dve slučajno odabrane engleske reči jednu do druge. Pa se dese smešne stvari. Na primer, pojavila se ova reč - To je reč "Hrišćani" i nema tu ništa loše. Ali ako je uparite sa drugom slučajno odabranom reči, mogu se desiti loše kombinacije. Dakle dobili smo ovo. (Tekst: loši hrišćani)
[bad Christians]
Ali, najgore od svega je što se sve ovo desilo na sajtu
But it's even worse, because the website where we showed this actually happened to be called The Embassy of the Kingdom of God.
koji se zove "Ambasada kraljevstva božijeg". (Smeh)
(Laughter)
Ups.
Oops.
(Smeh)
(Laughter)
Evo još jednog lošeg primera.
Here's another really bad one. JohnEdwards.com
DžonEdvards.com (Tekst: Prokleti liberal)
[Damn liberal]
(Laughter)
(Smeh)
So we keep on insulting people left and right everyday. Of course, we're not just insulting people. Here's the thing. Since we're presenting two randomly chosen words, interesting things can happen. So this actually has given rise to a really big Internet meme that tens of thousands of people have participated in, which is called CAPTCHA art. I'm sure some of you have heard about it. Here's how it works. Imagine you're using the Internet and you see a CAPTCHA that you think is somewhat peculiar, like this CAPTCHA.
Tako da svakodnevno vređamo ljude i desne i leve orijentacije. Naravno, nije samo da ih vređamo. Budući da su reči slučajno uparene, mogu se desiti i interesantne stvari. Ovo je dovelo do velikog internet fenomena u kom su učestvovale na hiljade ljudi, a koji se zove CAPTCHA umetnost. Siguran sam da su neki od vas čuli za to. Evo kako radi. Zamislite da surfujete internetom i vidite CAPTCHA za koj mislite da je jedinstven, kao na primer ovaj CAPTCHA (Tekst: nevidljivi toster)
[invisible toaster]
Ono što bi trebalo da uradite je da sačuvate sliku sa ekrana.
What you're supposed to do is you take a screenshot of it. Then of course, you fill out the CAPTCHA because you help us digitize a book. But first you take a screenshot and then you draw something that is related to it.
A onda naravno popunite CAPTCHA jer nam tako pomažete da digitalizujemo knjige. Ali prvo sačuvate tu sliku, i onda docrtate nešto što je u vezi sa tim.
(Laughter)
(Smeh)
That's how it works.
Tako to funkcioniše.
(Laughter)
Postoje na desetine hiljada ovoga.
There are tens of thousands of these. Some of them are very cute.
Neke od njih su veoma simpatične. (Tekst: stegnuo sam ga)
[clenched it]
(Smeh)
(Laughter)
Neke su smešnije.
Some of them are funnier.
(Tekst: drogirani osnivači)
[stoned Founders]
(Laughter)
(Smeh)
And some of them, like paleontological shvisle ...
A neke od njih,
(Laughter)
kao "paleontološki švizl",
they contain Snoop Dogg.
imaju Snup Doga.
(Laughter)
(Smeh)
OK, so this is my favorite number of reCAPTCHA. So this is the favorite thing that I like about this whole project. This is the number of distinct people that have helped us digitize at least one word out of a book through reCAPTCHA: 750 million, a little over 10 percent of the world's population, has helped us digitize human knowledge. And it is numbers like these that motivate my research agenda. So the question that motivates my research is the following: If you look at humanity's large-scale achievements, these really big things that humanity has gotten together and done historically -- like, for example, building the pyramids of Egypt or the Panama Canal or putting a man on the Moon -- there is a curious fact about them, and it is that they were all done with about the same number of people. It's weird; they were all done with about 100,000 people. And the reason for that is because, before the Internet, coordinating more than 100,000 people, let alone paying them, was essentially impossible. But now with the Internet, I've just shown you a project where we've gotten 750 million people to help us digitize human knowledge. So the question that motivates my research is, if we can put a man on the Moon with 100,000, what can we do with 100 million?
Ok, ovo je moj omiljeni broj reCAPTCHA-e. Ovo je nešto što mi se najviše sviđa u vezi sa celim projektom. Ovo je broj različitih ljudi koji su nam pomogli da digitalizujemo makar jednu reč neke knjige kroz reCAPTCHA: 750 miliona što znači da nam je oko 10 odsto celokupne svetske populacije pomoglo da digitalizujemo ljudsko znanje. Ovo je jedan od brojeva koji me motivišu da istražujem dalje. Dakle pitanje koje me motiviše da nastavim je sledeće: ako posmatrate grupna dostignuća čovečanstva, one zaista velike stvari oko kojih se čovečanstvo okupilo i uradilo nešto kao, na primer, gradnju piramida u Egiptu ili Panamskog kanala ili slanje čoveka na Mesec -- zanimljiva činjenica u vezi sa njima je da su svi bili urađeni sa otprilike istim brojem ljudi. Čudno; svi oni su sprovedeni sa oko 100.000 ljudi. A razlog za to je što je pre postojanja interneta koordinisanje više od 100.000 ljudi, a kamoli plaćanje, bilo prosto nemoguće. Ali sada, uz pomoć interneta, upravo sam vam pokazao projekat gde smo okupili 750 miliona ljudi koji su nam pomogli da digitalizujemo ljudsko znanje. Dakle, ono što me motiviše prilikom istraživanja je, što ako sa 100.000 možemo da pošaljemo čoveka na Mesec šta tek možemo da uradimo sa 100 miliona?
So based on this question, we've had a lot of different projects that we've been working on. Let me tell you about one that I'm most excited about. This is something that we've been semiquietly working on for the last year and a half or so. It hasn't yet been launched. It's called Duolingo. Since it hasn't been launched, shhh!
I tako bazirano na ovom pitanju počeli smo da radimo na mnogo različitih projekata. Pričaću vam o jednom oko kog sam najviše uzbuđen. To je nešto na čemu radimo polutajno u poslednjih godinu i po dana. Još ga nismo lansirali. Zove se Duolingo. Budući da nije lansiram, psssst!
(Laughter)
(Smeh)
Yeah, I can trust you'll do that. So this is the project. Here's how it started. It started with me posing a question to my graduate student, Severin Hacker. OK, that's Severin Hacker. So I posed the question to my graduate student. By the way, you did hear me correctly; his last name is Hacker.
Da, verujem da ćete ćutati. Dakle ovo je projekat. Evo kako je počeo. Počeo je tako što sam postavio pitanje mom studentu, Severinu Hakeru. Ok, ovo je Severin Haker. Dakle, postavio sam mu pitanje. Usput, dobro ste čuli, njegovo prezime je Haker.
(Laughter)
Pitao sam ga: kako možemo
So I posed this question to him: How can we get 100 million people translating the web into every major language for free? There's a lot of things to say about this question. First of all, translating the web. Right now, the web is partitioned into multiple languages. A large fraction of it is in English. If you don't know English, you can't access it. But there's large fractions in other different languages, and if you don't know them, you can't access it. So I would like to translate all of the web, or at least most of it, into every major language. That's what I would like to do.
da postignemo da nam 100 miliona ljudi prevede internet na svaki važniji svetski jezik, besplatno? Ok, prvo moram nešto da kažem o ovom problemu. Prvo, prevod interneta. Trenutno je internet podeljen na mnogo jezika. Veliki deo je na engleskom. Ako ne znate engleski, ne možete pristupiti podacima. Ali postoje i veliki delovi na drugim jezicima, i ako ne znate te druge jezike, ne možete pristupiti. Dakle, voleo bih da prevedem ceo internet, ili makar njegov veliki deo, na svaki važan jezik. To bih voleo da uradim.
Now, some of you may say, why can't we use computers to translate? Machine translation is starting to translate some sentences here and there. Why can't we use it to translate the web? The problem with that is it's not yet good enough and it probably won't be for the next 15 to 20 years. It makes a lot of mistakes. Even when it doesn't, since it makes so many mistakes, you don't know whether to trust it or not.
Sada neki od vas mogu reći, ali zašto ne kortistite kompjutere za prevod? Zašto ne koristimo mašine za prevod? Kompjutersko prevođenje prevede tu i tamo poneku rečenicu. Zašto ih ne koristimo da prevedu ceo internet? Problem je što i dalje nisu dovoljno dobri, i verovatno neće biti u narednih 15 ili 20 godina. Prave mnogo grešaka. Čak i kada ne naprave grešku, budući da ih često prave, kako da znamo kada je prevod tačan a kada ne.
So let me show you an example of something that was translated with a machine. Actually, it was a forum post. It was somebody who was trying to ask a question about JavaScript. It was translated from Japanese into English. So I'll just let you read. This person starts apologizing for the fact that it's translated with a computer. So the next sentence is going to be the preamble to the question. So he's just explaining something. Remember, it's a question about JavaScript.
Daću vam primer nečega što je preveo kompjuter. Zapravo to je bio post na forumu. Neko ko je pokušavao da pita nešto o jeziku Java. Bilo je prevedeno sa japanskog na engleski. Daću vam da pročitate. Osoba počinje izvinjenjem zbog činjenice da se radi o kompjuterskom prevodu. Dakle sledeća rečenica je uvod u pitanje. On nešto objašnjava. Upamtite, pitanje je o Java programskom jeziku.
[At often, the goat-time install a error is vomit.]
(Tekst: Na često, koza-vreme instaliranja greška je bljuvanje)
(Laughter)
(Smeh)
Then comes the first part of the question.
Onda dolazi prvi deo pitanja.
[How many times like the wind, a pole, and the dragon?]
(Tekst: Koliko puta kao vetar, motka i zmaj?)
(Laughter)
(Smeh)
Then comes my favorite part of the question.
A onda dolazi moj omiljeni deo pitanja.
[This insult to father's stones?]
(Tekst: Ova uvreda očevom kamenju?)
(Laughter)
(Smeh)
And then comes the ending, which is my favorite part of the whole thing.
A onda dolazi kraj, koji mi je omiljeni deo cele stvari. (Tekst: Molimo vas da se izvinite za svoju glupost. Postoje mnoge hvala)
[Please apologize for your stupidity. There are a many thank you.]
(Smeh)
(Laughter)
Dobro, znači kompjuterski prevod, još uvek nedovoljno dobar.
OK, so computer translation, not yet good enough. So back to the question. So we need people to translate the whole web. So now the next question you may have is, well, why can't we just pay people to do this? We could pay professional translators to translate the whole web. We could do that. Unfortunately, it would be extremely expensive. For example, translating a tiny fraction of the whole web, Wikipedia, into one other language, Spanish. OK? Wikipedia exists in Spanish, but it's very small compared to the size of English. It's about 20 percent of the size of English. If we wanted to translate the other 80 percent into Spanish, it would cost at least 50 million dollars -- and this is even at the most exploited, outsourcing country out there. So it would be very expensive. So what we want to do is, we want to get 100 million people translating the web into every major language for free.
Nazad na pitanje. Trebaju nam ljudi da prevedemo ceo internet. Sledeće što biste mogli pitati je: zašto jednostavno ne platimo ljudima da to urade? Mogli bismo platiti profesionalnim prevodiocima da to urade. Mogli bismo. Nažalost, to bi bilo preskupo. Na primer, prevod samo malog dela interneta, Vikipedije, u neki drugi jezik, na primer španski, Vikipedija postoji na španskom, ali je veoma mala u poređenju sa onom na engleskom. Predstavlja oko 20 odsto engleske verzije. Ako bismo hteli da prevedemo na španski tih 80 odsto, koštalo bi najmanje 50 miliona dolara -- i to za najiskorišćeniju zemlju koja postoji. Dakle, bilo bi mnogo skupo. I zato želimo da u to uključimo 100 miliona ljudi koji će prevesti internet na svaki veći jezik besplatno.
If this is what you want to do, you quickly realize you're going to run into two big hurdles, two big obstacles. The first one is a lack of bilinguals. So I don't even know if there exists 100 million people out there using the web who are bilingual enough to help us translate. That's a big problem. The other problem you're going to run into is a lack of motivation. How are we going to motivate people to actually translate the web for free? Normally, you have to pay people to do this. So how are we going to motivate them to do it for free? When we were starting to think about this, we were blocked by these two things. But then we realized, there's a way to solve both these problems with the same solution. To kill two birds with one stone. And that is to transform language translation into something that millions of people want to do and that also helps with the problem of lack of bilinguals, and that is language education.
Ako je ovo ono što hoćete da uradite vrlo brzo shvatate da ćete naići na dve stvari koje će to otežati, dve velike prepreke. Prva je manjak ljudi koji govore dva jezika. Ni ne znam da li postoje 100 miliona ljudi koji koriste internet koji govore dva jezika dovoljno dobro da nam pomognu sa prevodima. To je veliki problem. Drugi problem je nedostatak motivacije. Kako ćemo motivisati ljude da zaista prevedu internet besplatno? Obično platite ljudima za to. Kako ćemo ih onda motivisati da to urade besplatno? Kada smo počeli da razmišljamo o svemu, ove dve stvari su nas kočile. Ali onda smo shvatili da postoji način i da postoji jedno rešenje za oba problema. Jednim udarcem ćemo ubiti dve muve. A to je da prevođenje pretvorimo u nešto što milioni ljudi hoće da rade i to nam takođe pomaže sa problemom manjka bilingvista a to je učenje jezika.
So it turns out that today, there are over 1.2 billion people learning a foreign language. People really want to learn a foreign language. And it's not just because they're being forced to do so in school. In the US alone, there are over five million people who have paid over $500 for software to learn a new language. So people really want to learn a new language. So what we've been working on for the last year and a half is a new website -- it's called Duolingo -- where the basic idea is people learn a new language for free while simultaneously translating the web. And so basically, they're learning by doing.
Ispostavilo se da danas imate preko 1,2 milijarde ljudi koji uče strani jezik. Ljudi zaista žele da nauče strane jezike. I to ne samo zato što ih u školama teraju da to rade. Na primer, samo u SAD postoji preko pet miliona ljudi koji su platili više od 500 dolara za softvere za učenje stranih jezika. Dakle ljudi zaista žele da nauče nove jezike. Tako smo u poslednjih godinu i po dana radili na novom sajtu zove se Duolingo -- gde je osnovna ideja da ljudi besplatno uče jezik dok istovremeno prevode internet. I zapravo oni uče dok prevode.
So the way this works is whenever you're a just a beginner, we give you very simple sentences. There's a lot of very simple sentences on the web. We give you very simple sentences along with what each word means. And as you translate them and as you see how other people translate them, you start learning the language. And as you get more advanced, we give you more complex sentences to translate. But at all times, you're learning by doing.
Način na koji ovo radi je da, dok ste tek početnik, dobijate veoma jednostavne rečenice. Naravno, na internetu ih ima mnogo. Damo vam veoma, veoma jednostavne rečenice zajedno sa značenjem svake reči. I dok ih prevodite i dok vidite kako ih drugi prevode, počinjete da učite jezik. Sa vašim napretkom dajemo vam sve kompleksnije rečenice na prevod. Ali sve vreme učite dok radite. Luda stvar u vezi sa ovim metodom
Now, the crazy thing about this method is that it actually really works. People are really learning a language. We're mostly done building it and now we're testing it. People really can learn a language with it. And they learn it about as well as the leading language learning software. So people really do learn a language. And not only do they learn it as well, but actually it's more interesting. Because with Duolingo, people are learning with real content. As opposed to learning with made-up sentences, people are learning with real content, which is inherently interesting. So people really do learn a language.
je da on zapravo funkcioniše. Prvo, ljudi zaista uče jezik. Projekat je gotovo završen i sada je u fazi testiranja. Ljudi tako zaista mogu naučiti jezik. I nauče ga jednako dobro kao uz pomoć vodećih programa za učenje jezika. Dakle, ljudi zaista nauče jezik. I ne samo da ga nauče, već je i mnogo interesantnije. Jer, uz Duolingo, ljudi uče koristeći realne sadržaje. Nasuprot učenju kroz izmišljene rečenice, uči se na stvarnim sadržajima, što je samo po sebi zanimljivije. Dakle ljudi zaista uče jezik.
But perhaps more surprisingly, the translations that we get from people using the site, even though they're just beginners, the translations that we get are as accurate as those of professional language translators, which is very surprising. So let me show you one example. This is a sentence that was translated from German into English. The top is the German. The middle is an English translation that was done by a professional translator who we paid 20 cents a word for this translation. And the bottom is a translation by users of Duolingo, none of whom knew any German before they started using the site. If you can see, it's pretty much perfect. Of course, we play a trick here to make the translations as good as professional language translators. We combine the translations of multiple beginners to get the quality of a single professional translator.
Možda su još više iznenađuje da su prevodi koje smo dobili od ljudi koji koriste sajt, iako su početnici, bili tačni koliko i oni koje su radili profesionalni prevodioci, što je veoma iznenađujuće. Daću vam još jedan primer. Ovo je rečenica prevedena sa nemačkog na engleski. Gore je nemački. U sredini je engleski prevod koji je uradio profesionalni prevodilac, koga plaćamo 20 centi po reči. A dole je prevod urađen od strane korisnika Duolinga, od kojih nijedan nije znao ni malo nemačkog pre nego što su počeli da koriste sajt. Možete i sami videti, savršeno je. Naravno, ovde koristimo trik da bi prevodi bili dobri koliko i profesionalni. Kombinujemo prevode više početnika kako bismo dobili kvalitet jednog profesionalca.
Now, even though we're combining the translations, the site actually can translate pretty fast. So let me show you, this is our estimates of how fast we could translate Wikipedia from English into Spanish. Remember, this is 50 million dollars' worth of value. So if we wanted to translate Wikipedia into Spanish, we could do it in five weeks with 100,000 active users. And we could do it in about 80 hours with a million active users. Since all the projects my group has worked on so far have gotten millions of users, we're hopeful that we'll be able to translate extremely fast.
Iako kombinujemo prevode sajt se prevodi prilično brzo. Pokazaću vam, naše procene koliko brzo možemo da prevedemo Vikipediju sa engleskog na španski. Zapamtite, to košta 50 miliona dolara. Dakle, ako bismo hteli da prevedemo Vikipediju na španski, mogli bismo to uraditi za pet nedelja sa 100.000 aktivnih korisnika. I mogli bismo to uraditi za 80 sati sa milion aktivnih korisnika. Budući da su svi projekti na kojima je moja grupa radila uspeli da uključe milione korisnika nadamo se da ćemo kroz ovaj projekat uspeti da prevodimo uspeti da prevodimo prilično brzo.
Now, the thing that I'm most excited about with Duolingo is I think this provides a fair business model for language education. So here's the thing: The current business model for language education is the student pays, and in particular, the student pays Rosetta Stone 500 dollars.
Sada, stvar oko koje sam najuzbuđeniji oko Duolinga je što mislim da obezbeđuje fer biznis model za učenje jezika. Evo o čemu se radi: trenutni model za učenje stranih jezika je takav da student plaća, i na primer, učenik plaća za Rozeta Stoun 500 dolara.
(Laughter)
(Smeh)
That's the current business model. The problem with this business model is that 95 percent of the world's population doesn't have 500 dollars. So it's extremely unfair towards the poor. This is totally biased towards the rich. Now, see, in Duolingo, because while you learn, you're actually creating value, you're translating stuff -- which, for example, we could charge somebody for translations, so this is how we could monetize this. Since people are creating value while they're learning, they don't have to pay with their money, they pay with their time. But the magical thing here is that is time that would have had to have been spent anyways learning the language. So the nice thing about Duolingo is, I think, it provides a fair business model -- one that doesn't discriminate against poor people.
To je postojeći model poslovanja. Problem sa tim je što 95 odsto svetske populacije nema 500 dolara. Tako da je potpuno nefer prema siromašnima. Totalno je usmereno na bogate. A u Duolingu dok učite zapravo stvarate vrednost, prevodite stvari -- za šta bismo mogli da nekog platimo. Ovo je način da to unovčimo. Budući da ljudi stvaraju vrednost dok uče oni ne moraju da plate svojim novcem, već plaćaju svojim vremenom. Ali magična stvar je da oni plaćaju svojim vremenom, a to je vreme koje bi bilo potrošeno svakako na učenje jezika. Dakle, dobra stvar kod Duolinga je da obezbeđuje fer biznis model -- koji ne diskriminiše siromašne. Evo i sajta. Hvala.
So here's the site. Thank you.
(Aplauz)
(Applause)
Evo ga i sajt.
We haven't yet launched, but if you go there, you can sign up to be part of our private beta, which is probably going to start in three or four weeks. We haven't yet launched it.
Nije još pušten u rad, ali ako ga posetite možete sa upisati kako biste bili deo naše "private beta" koja će sigurno početi za tri ili četiri nedelje. Nismo još lansirali ovaj Duolingo.
By the way, I'm the one talking here, but Duolingo is the work of a really awesome team,
Usput, ja ovde pričam, ali zapravo Duolingo je proizvod zaista fenomenalnog tima, a neki od njih su ovde.
some of whom are here. So thank you.
Hvala vam.
(Applause)
(Aplauz)