How many of you had to fill out a web form where you've been asked to read a distorted sequence of characters like this? How many of you found it really annoying?
Koliko vas je moralo ispuniti neku vrstu mrežnog obrasca gdje su vas tražili da čitate iskrivljeni niz znakova poput ovog? Koliko vas smatra ovo stvarno, stvarno dosadnim? U redu, izvanredno. Ja sam to izmislio.
(Laughter)
OK, outstanding. So I invented that.
(Smijeh)
(Laughter)
Ili sam bio jedan od ljudi koji su to napravili.
Or I was one of the people who did it. That thing is called a CAPTCHA. And it is there to make sure you, the entity filling out the form, are a human and not a computer program that was written to submit the form millions of times. The reason it works is because humans, at least non-visually-impaired humans, have no trouble reading these distorted characters, whereas programs can't do it as well yet. In the case of Ticketmaster, the reason you have to type these characters is to prevent scalpers from writing a program that can buy millions of tickets, two at a time.
Ta se stvar zove Captcha. I razlog zbog kojeg je tu jest da ste vi, entitet koji ispunjava obrazac, doista čovjek, a ne neka vrsta kompjuterskog programa napisana da podnosi obrazac milijune i milijune puta. Ona djeluje zato što ljudi, barem oni koji nemaju oštećen vid, nemaju problema s čitanjem tih iskrivljenih vijugavih znakova, dok to kompjutorski programi jednostavno još ne mogu tako dobro. Npr., u slučaju Ticketmastera, razlog zbog kojeg morate tipkati te iskrivljene znakove jest da se spriječe preprodavači da napišu program koji može kupiti milijune ulaznica, dvije odjednom.
CAPTCHAs are used all over the Internet. And since they're used so often, a lot of times the sequence of random characters shown to the user is not so fortunate. So this is an example from the Yahoo registration page. The random characters that happened to be shown to the user were W, A, I, T, which, of course, spell a word. But the best part is the message that the Yahoo help desk got about 20 minutes later.
Captche se koriste na cijelom Internetu. I budući da se koriste tako često, puno puta precizan niz nasumičnih znakova koje vidi korisnik nije tako sretan. Ovo je primjer Yahooove stranice za registraciju. Nasumični znakovi koji se pokazuju korisniku bili su W, A, I, T, koji, naravno, sriču riječ (čekaj). No najbolji dio je poruka koju je Yahooov odjel za pomoć dobio oko 20 minuta kasnije.
[Help! I've been waiting for over 20 minutes and nothing happens.]
Tekst: "Upomoć! Čekam već 20 minuta, i ništa se ne događa."
(Laughter)
(Smijeh)
This person thought they needed to wait. This, of course, is not as bad as this poor person.
Ovaj je čovjek mislio da mora čekati. Ovo, naravno, nije tako loše kao ovaj jadnik.
(Laughter)
(Smijeh)
CAPTCHA Project is something that we did at Carnegie Melllon over 10 years ago, and it's been used everywhere. Let me now tell you about a project that we did a few years later, which is sort of the next evolution of CAPTCHA. This is a project that we call reCAPTCHA, which is something that we started here at Carnegie Mellon, then we turned it into a start-up company. And then about a year and a half ago, Google actually acquired this company.
Projekt Captcha je nešto što smo napravili ovdje na sveučilištu Carnegie Mellon prije 10 godina, i otada se koristi posvuda. Sad ću govoriti o projektu koji smo napravili prije nekoliko godina, nešto nalik sljedećoj evoluciji Captche. Ovo je projekt koji nazivamo reCaptcha, što je nešto što smo započeli ovdje na Carnegie Mellonu, zatim smo to pretvorili u malu tvrtku. I nakon godine i pol, Google je zapravo kupio ovu tvrtku.
Let me tell you what this project started. This project started from the following realization: It turns out that approximately 200 million CAPTCHAs are typed everyday by people around the world. When I first heard this, I was quite proud of myself. I thought, look at the impact my research has had. But then I started feeling bad. Here's the thing: each time you type a CAPTCHA, essentially, you waste 10 seconds of your time. And if you multiply that by 200 million, you get that humanity is wasting about 500,000 hours every day typing these annoying CAPTCHAs.
Reći ću vam što je pokrenuo ovaj projekt. Dakle, ovaj projekt je krenuo od sljedeće realizacije: Ispada da otprilike 200 milijuna Captcha tipkaju ljudi svaki dan diljem svijeta. Kad sam prvi put čuo za to, bio sam poprilično ponosan na sebe. Mislio sam, pogledaj na utjecaj koje je imalo moje istraživanje. Ali zatim sam se počeo osjećati loše. Evo o čemu se radi, svaki put kad tipkate Captchu, u biti gubite 10 sekundi svog vremena. I ako to umnožite s 200 milijuna, dobivate da čovječanstvo u cijelom gubi oko 500.000 sati svaki dan tipkajući te dosadne Captche.
(Laughter)
I tad sam se počeo osjećati loše.
So then I started feeling bad.
(Smijeh)
(Laughter)
I tad sam počeo razmišljati, pa, naravno, ne možemo se samo riješiti Captchi,
And then I started thinking, of course, we can't just get rid of CAPTCHAs, because the security of the web depends on them. But then I started thinking, can we use this effort for something that is good for humanity? So see, here's the thing. While you're typing a CAPTCHA, during those 10 seconds, your brain is doing something amazing. Your brain is doing something that computers cannot yet do. So can we get you to do useful work for those 10 seconds? Is there some humongous problem that we cannot yet get computers to solve, yet we can split into tiny 10-second chunks such that each time somebody solves a CAPTCHA, they solve a little bit of this problem? And the answer to that is "yes," and this is what we're doing now.
zato što sigurnost Mreže na neki način ovisi o njima. Ali zatim sam počeo razmišljati, postoji li bilo koji način na koji možemo upotrijebiti ovaj trud za nešto što je dobro za čovječanstvo? Gledajte, evo o čemu se radi. Dok tipkate Captchu, tijekom tih 10 sekundi, vaš mozak radi nešto nevjerojatno. Vaš mozak radi nešto što kompjuteri još ne mogu. Pa, možete li napraviti koristan posao za tih 10 sekundi? Drugim riječima, postoji li neki grdan problem koji kompjuteri još ne mogu riješiti, ali možemo podijeliti na sićušne dijelove od 10 sekundi tako da svaki put kad netko rješava Captchu riješi i mali dio ovog problema? I odgovor na to je "da", i to upravo radimo.
Nowadays, while you're typing a CAPTCHA, not only are you authenticating yourself as a human, but in addition you're helping us to digitize books. Let me explain how this works. There's a lot of projects trying to digitize books. Google has one. The Internet Archive has one. Amazon, with the Kindle, is trying to digitize books. Basically, the way this works is you start with an old book. You've seen those things, right? Like a book?
Ono što možda ne znate je da danas, dok tipkate Captchu, ne samo da se identificirate kao čovjek, nego nam također zapravo pomažete digitalizirati knjige. Objasnit ću vam kako to funkcionira. Postoji puno projekata koji pokušavaju digitalizirati knjige. Google ima jedan. Internetski arhiv ima jedan. Amazon, sad s Kindleom, pokušava digitalizirati knjige. U osnovi to funkcionira na način da započinjete sa starom knjigom. Vidjeli ste te stvari, točno? Poput knjige? (Smijeh)
(Laughter)
Započinjete s knjigom, zatim ju skenirate.
So you start with a book and then you scan it.
Sad, skeniranje knjige je
Now, scanning a book is like taking a digital photograph of every page. It gives you an image for every page. This is an image with text for every page of the book. The next step in the process is that the computer needs to be able to decipher the words in this image. That's using a technology called OCR, for optical character recognition, which takes a picture of text and tries to figure out what text is in there. Now, the problem is that OCR is not perfect. Especially for older books where the ink has faded and the pages have turned yellow, OCR cannot recognize a lot of the words. For things that were written more than 50 years ago, the computer cannot recognize about 30 percent of the words. So now we're taking all of the words that the computer cannot recognize and we're getting people to read them for us while they're typing a CAPTCHA on the Internet.
kao da snimate digitalnu fotografiju svake stranice knjige. Daje vam sliku za svaku stranicu knjige. Ovo je slika s tekstom za svaku stranicu knjige. Slijedeći je korak u postupku taj da kompjuter mora biti kadar dešifrirati sve riječi u toj slici. Koristeći tehnologiju OCR, optičko prepoznavanje znakova, koja snima sliku teksta i pokušava odgonetnuti koji je tekst unutra. Sad, OCR nije savršen. Posebno za starije knjige gdje je tinta izblijedjela i stranice su požutjele, OCR ne može prepoznati puno riječi. Npr., stvari koje su bile napisane prije više od 50 godina, kompjuter ne može prepoznati oko 30 posto riječi. Ono što sad radimo jest uzimanje svih riječi koje kompjuter ne može prepoznati i davanje ljudima da ih čitaju za nas dok tipkaju Captchu na Internetu.
So the next time you type a CAPTCHA, these words that you're typing are actually words from books that are being digitized that the computer could not recognize. The reason we have two words nowadays instead of one is because one of the words is a word that the system just got out of a book, it didn't know what it was and it's going to present it to you. But since it doesn't know the answer, it cannot grade it. So we give you another word, for which the system does know the answer. We don't tell you which one's which and we say, please type both. And if you type the correct word for the one for which the system knows the answer, it assumes you are human and it also gets some confidence that you typed the other word correctly. And if we repeat this process to 10 different people and they agree on what the new word is, then we get one more word digitized accurately.
Sljedeći put kad tipkate Captchu, te riječi koje tipkate su zapravo riječi koje dolaze iz knjiga koje se digitaliziraju koje kompjuter nije mogao prepoznati. I sad, razlog zbog kojeg danas imamo dvije riječi umjesto jedne jest taj što je, vidite, jedna od riječi riječ koju je sustav tek dobio iz knjige, nije znao što je, i namjerava je prikazati vama. Ali kako ne zna odgovor na nju, ne može ju ocijeniti za vas. Dakle, dajemo vam drugu riječ, za koju sustav zna odgovor. Ne govorimo vam koja je koja, i kažemo da utipkate obje. I ako utipkate pravilnu riječ za onu koju sustav već zna odgovor, pretpostavlja da ste čovjek, i također dobiva neku sigurnost da ste utipkali drugu riječ pravilno. I ako ponovimo taj proces s 10 različitih ljudi i svi se slože oko toga što je nova riječ, tad mi dobivamo još jednu riječ točno digitaliziranu.
So this is how the system works. And since we released it about three or four years ago, a lot of websites have started switching from the old CAPTCHA, where people wasted their time, to the new CAPTCHA where people are helping to digitize books. So every time you buy tickets on Ticketmaster, you help to digitize a book. Facebook: Every time you add a friend or poke somebody, you help to digitize a book. Twitter and about 350,000 other sites are all using reCAPTCHA. And the number of sites that are using reCAPTCHA is so high that the number of words we're digitizing per day is really large. It's about 100 million a day, which is the equivalent of about two and a half million books a year. And this is all being done one word at a time by just people typing CAPTCHAs on the Internet.
Evo kako sustav funkcionira. I u osnovi, otkad smo ga objavili prije tri ili četiri godine, puno se mrežnih mjesta počelo prebacivati sa stare Captche gdje su ljudi gubili vrijeme na novu Captchu gdje ljudi pomažu digitalizirati knjige. Npr., Ticketmaster. Svaki put kad kupite ulaznicu na Ticketmasteru, pomažete digitalizirati knjigu. Facebook: Svaki put kad dodate prijatelja ili bocnete nekog, pomažete digitalizirati knjigu. Twitter i oko 350.000 drugih mjesta koriste reCaptchu. I u stvari, broj mjesta koji koriste reCaptchu je tako visok da je broj riječi koje digitaliziramo na dan, stvarno, stvarno velik. To je oko 100 milijuna na dan, što je ekvivalent oko dva i pol milijuna knjiga godišnje. I sve rade s jednom po jednom riječju samo ljudi koji tipkaju Captchu na Internetu. (Pljesak)
(Applause)
Naravno,
Now, of course, since we're doing so many words per day, funny things can happen. This is especially true because now we're giving people two randomly chosen English words next to each other. So funny things can happen. For example, we presented this word. It's the word "Christians"; there's nothing wrong with it. But if you present it along with another randomly chosen word, bad things can happen. So we get this.
budući da obrađujemo toliko riječi na dan, mogu se dogoditi smiješne stvari. I to je posebno istinito zato što sada dajemo ljudima dvije nasumično izabrane engleske riječi jednu pokraj druge. Pa se mogu dogoditi smiješne stvari. Npr., prikazali smo ovu riječ. To je riječ "Christians" (kršćani); sve je u redu s njom. Ali ako ju prikažete zajedno s drugom nasumično odabranom riječi, loše se stvari mogu dogoditi. Pa dobivamo ovo. (Tekst: loši kršćani)
[bad Christians]
Ali još je gore, zato što se određena mrežna mjesta gdje smo ovo pokazali
But it's even worse, because the website where we showed this actually happened to be called The Embassy of the Kingdom of God.
zapravo nazivaju Poslanstvo Kraljevstva Božjeg. (Smijeh)
(Laughter)
Joj.
Oops.
(Smijeh)
(Laughter)
Evo još jednog stvarno lošeg.
Here's another really bad one. JohnEdwards.com
JohnEdwards.com (Tekst: Damn liberal - prokleti liberal)
[Damn liberal]
(Laughter)
(Smijeh)
So we keep on insulting people left and right everyday. Of course, we're not just insulting people. Here's the thing. Since we're presenting two randomly chosen words, interesting things can happen. So this actually has given rise to a really big Internet meme that tens of thousands of people have participated in, which is called CAPTCHA art. I'm sure some of you have heard about it. Here's how it works. Imagine you're using the Internet and you see a CAPTCHA that you think is somewhat peculiar, like this CAPTCHA.
Nastavljamo vrijeđati ljude na sve strane svaki dan. Naravno, ne vrijeđamo samo ljude. Evo o čemu se radi, budući da prikazujemo dvije nasumično izabrane riječi, zanimljive se stvari mogu dogoditi. Ovo je zapravo dovelo do stvarno velikog internetskog fenomena u kojem su sudjelovali deseci tisuća ljudi, koji se naziva umjetnost Captcha. Siguran sam da su neki od vas čuli za to. Evo kako funkcionira. Zamislite da ste na Internetu i vidite Captchu za koju mislite da je pomalo neobična, kao ova Captcha. (Tekst: nevidljivi toster)
[invisible toaster]
Ono što bi trebali učiniti jest snimiti sadržaj ekrana.
What you're supposed to do is you take a screenshot of it. Then of course, you fill out the CAPTCHA because you help us digitize a book. But first you take a screenshot and then you draw something that is related to it.
Zatim, naravno, ispunite Captchu zato što nam pomažete digitalizirati knjige. Ali zatim, prvo snimite sadržaj ekrana, i zatim nacrtate nešto što ima veze s tim.
(Laughter)
(Smijeh)
That's how it works.
Tako to funkcionira.
(Laughter)
Postoje deseci tisuća takvih.
There are tens of thousands of these. Some of them are very cute.
Neke od njih su stvarno ljupke. (Tekst: stisnuti ga)
[clenched it]
(Smijeh)
(Laughter)
Neke od njih su smiješnije.
Some of them are funnier.
(Tekst: napušeni osnivači)
[stoned Founders]
(Laughter)
(Smijeh)
And some of them, like paleontological shvisle ...
A neki od njih,
(Laughter)
kao "paleontološki shvisle",
they contain Snoop Dogg.
imaju Snoop Dogga.
(Laughter)
(Smijeh)
OK, so this is my favorite number of reCAPTCHA. So this is the favorite thing that I like about this whole project. This is the number of distinct people that have helped us digitize at least one word out of a book through reCAPTCHA: 750 million, a little over 10 percent of the world's population, has helped us digitize human knowledge. And it is numbers like these that motivate my research agenda. So the question that motivates my research is the following: If you look at humanity's large-scale achievements, these really big things that humanity has gotten together and done historically -- like, for example, building the pyramids of Egypt or the Panama Canal or putting a man on the Moon -- there is a curious fact about them, and it is that they were all done with about the same number of people. It's weird; they were all done with about 100,000 people. And the reason for that is because, before the Internet, coordinating more than 100,000 people, let alone paying them, was essentially impossible. But now with the Internet, I've just shown you a project where we've gotten 750 million people to help us digitize human knowledge. So the question that motivates my research is, if we can put a man on the Moon with 100,000, what can we do with 100 million?
U redu, ovo je moj najdraži broj reCaptche. Ovo je najdraža stvar koju volim u cijelom ovom projektu. Ovo je broj određenih ljudi koji su nam pomogli digitalizirati najmanje jednu riječ iz knjige putem reCaptche. 750 milijuna, što je malo preko 10 posto svjetske populacije, pomoglo nam je digitalizirati ljudsko znanje. Upravo brojevi poput ovih motiviraju moju istraživačku agendu. Dakle, pitanje koje motivira moje istraživanje glasi: ako pogledate velika ljudska dostignuća, te doista velike stvari koje je čovječanstvo steklo zajedno i učinilo povijesno -- kao npr., gradnja egipatskih piramida ili Panamskog kanala ili slanje čovjeka na Mjesec -- zanimljiva je činjenica da su sve napravljene s otprilike istim brojem ljudi. Čudno je; sve su napravljene s oko 100.000 ljudi. I razlog je tomu taj što je, prije Interneta, koordinacija više od 100.000 ljudi, a da i ne spominjem njihovo plaćanje, bilo u osnovi nemoguće. Ali sad s Internetom, upravo sam vam pokazao projekt gdje nam je 750 milijuna ljudi pomoglo digitalizirati ljudsko znanje. Pa pitanje koje motivira moje istraživanje glasi, ako možemo poslati čovjeka na Mjesec sa 100.000, što možemo učiniti sa 100 milijuna?
So based on this question, we've had a lot of different projects that we've been working on. Let me tell you about one that I'm most excited about. This is something that we've been semiquietly working on for the last year and a half or so. It hasn't yet been launched. It's called Duolingo. Since it hasn't been launched, shhh!
Na temelju ovog pitanja, imamo puno različitih projekata na kojima radimo. Reći ću vam nešto o jednom zbog kojeg sam najviše uzbuđen. To je nešto na čemu smo napola tiho radili zadnju godinu i pol. Još nije pokrenut. Naziva se Duolingo. Budući da još nije pokrenut, pssst!
(Laughter)
(Smijeh)
Yeah, I can trust you'll do that. So this is the project. Here's how it started. It started with me posing a question to my graduate student, Severin Hacker. OK, that's Severin Hacker. So I posed the question to my graduate student. By the way, you did hear me correctly; his last name is Hacker.
Da, vjerujem vam da ćete to učiniti. Ovo je projekt. Evo kako je počeo. Počeo je s pitanjem koje sam postavio svojem diplomiranom studentu, Severinu Hackeru. Ok, ovo je Severin Hacker. Dakle, postavio sam pitanje svojem studentu. Uzgred budi rečeno, dobro ste me čuli; njegovo je prezime Hacker.
(Laughter)
Postavio sam mu pitanje:
So I posed this question to him: How can we get 100 million people translating the web into every major language for free? There's a lot of things to say about this question. First of all, translating the web. Right now, the web is partitioned into multiple languages. A large fraction of it is in English. If you don't know English, you can't access it. But there's large fractions in other different languages, and if you don't know them, you can't access it. So I would like to translate all of the web, or at least most of it, into every major language. That's what I would like to do.
Kako pridobiti 100 milijuna ljudi da prevode Mrežu na sve veće jezike besplatno? Ok, ima puno stvari koje se mogu reći o ovom pitanju. Prvo, prevoditi Mrežu. Trenutno je Mreža podijeljena na brojne jezike. Velik dio je na engleskom. Ako imalo ne znaš engleski, ne možeš joj pristupiti. Ali postoje veliki dijelovi na drugim jezicima, i ako ne znaš te jezike, ne možeš joj pristupiti. Želio bih prevesti svu Mrežu, ili barem većinu Mreže, na svaki veći jezik. Dakle, to bih želio napraviti.
Now, some of you may say, why can't we use computers to translate? Machine translation is starting to translate some sentences here and there. Why can't we use it to translate the web? The problem with that is it's not yet good enough and it probably won't be for the next 15 to 20 years. It makes a lot of mistakes. Even when it doesn't, since it makes so many mistakes, you don't know whether to trust it or not.
Sad, neki od vas bi mogli reći, zašto ne možemo upotrijebiti kompjutore da prevode? Zašto ne možemo upotrijebiti automatski prijevod? Automatski prijevod danas počinje prevoditi tu i tamo neke rečenice. Zašto ga ne možemo upotrijebiti da prevedemo cijelu Mrežu? Pa, problem je u tome da još uvijek nije dovoljno dobar i najvjerojatnije neće biti 15 do 20 godina. Radi puno grešaka. Čak i kad ne griješi, budući da radi tako puno grešaka, ne znate da li da mu vjerujete ili ne.
So let me show you an example of something that was translated with a machine. Actually, it was a forum post. It was somebody who was trying to ask a question about JavaScript. It was translated from Japanese into English. So I'll just let you read. This person starts apologizing for the fact that it's translated with a computer. So the next sentence is going to be the preamble to the question. So he's just explaining something. Remember, it's a question about JavaScript.
Pokazat ću vam jedan primjer nečega što je bilo prevedeno sa strojem. Zapravo je to bio post na forumu. Netko je pokušavao postaviti pitanje o JavaScriptu. Preveden je s japanskog na engleski. Pustit ću vas da pročitate. Ova osoba počinje se ispričavati zbog činjenice da je prevedeno kompjuterom. Pa će sljedeća rečenica biti uvod u pitanje. On samo objašnjava nešto. Zapamtite, ovo je pitanje o JavaScriptu.
[At often, the goat-time install a error is vomit.]
(Tekst: Na često, jarac-vrijeme instalira greška je izbljuvak.)
(Laughter)
(Smijeh)
Then comes the first part of the question.
Zatim dolazi prvi dio pitanja.
[How many times like the wind, a pole, and the dragon?]
(Tekst: Koliko puta kao vjetar, štap, i zmaj?)
(Laughter)
(Smijeh)
Then comes my favorite part of the question.
Zatim dolazi moj omiljeni dio pitanja.
[This insult to father's stones?]
(Tekst: Ovo uvreda očevom kamenju?)
(Laughter)
(Smijeh)
And then comes the ending, which is my favorite part of the whole thing.
Zatim dolazi kraj, moj najdraži dio cijele stvari. (Tekst: Molim ispričati se za vašu glupost. Ima mnogo hvala vam.)
[Please apologize for your stupidity. There are a many thank you.]
(Smijeh)
(Laughter)
Ok, kompjuterski prijevod, još nije dovoljno dobar.
OK, so computer translation, not yet good enough. So back to the question. So we need people to translate the whole web. So now the next question you may have is, well, why can't we just pay people to do this? We could pay professional translators to translate the whole web. We could do that. Unfortunately, it would be extremely expensive. For example, translating a tiny fraction of the whole web, Wikipedia, into one other language, Spanish. OK? Wikipedia exists in Spanish, but it's very small compared to the size of English. It's about 20 percent of the size of English. If we wanted to translate the other 80 percent into Spanish, it would cost at least 50 million dollars -- and this is even at the most exploited, outsourcing country out there. So it would be very expensive. So what we want to do is, we want to get 100 million people translating the web into every major language for free.
Nazad k pitanju. Trebamo ljude da prevode cijelu Mrežu. Dakle, sljedeće pitanje koje bi mogli imati glasi, pa zašto ne možemo samo platiti ljudima da to učine? Možemo platiti profesionalnim prevoditeljima da prevedu cijelu Mrežu. Mogli bi to učiniti. Na žalost, bilo bi ekstremno skupo. Npr., prijevod malenog, malenog dijela cijele Mreže, Wikipedie, na jedan drugi jezik, španjolski. Wikipedia postoji na španjolskom, ali je jako mala u usporedbi s veličinom na engleskom. Iznosi oko 20 posto veličine na engleskom. Da smo željeli prevesti ostalih 80 posto na španjolski, koštalo bi nas najmanje 50 milijuna dolara -- i to kod najiskorištenije, najiznajmljivačke zemlje koja postoji. Bilo bi vrlo skupo. Dakle, ono što želimo jest pridobiti 100 milijuna ljudi da prevode Mrežu na svaki veći jezik besplatno.
If this is what you want to do, you quickly realize you're going to run into two big hurdles, two big obstacles. The first one is a lack of bilinguals. So I don't even know if there exists 100 million people out there using the web who are bilingual enough to help us translate. That's a big problem. The other problem you're going to run into is a lack of motivation. How are we going to motivate people to actually translate the web for free? Normally, you have to pay people to do this. So how are we going to motivate them to do it for free? When we were starting to think about this, we were blocked by these two things. But then we realized, there's a way to solve both these problems with the same solution. To kill two birds with one stone. And that is to transform language translation into something that millions of people want to do and that also helps with the problem of lack of bilinguals, and that is language education.
Ako je ovo to što želite učiniti, vrlo brzo ćete shvatiti da ćete naletjeti na dvije prilično velike prepreke, dvije velike poteškoće. Prva je nedostatak bilingvista. Pa čak ni ne znam postoji li 100 milijuna ljudi koji koriste Mrežu koji su dovoljno bilingvalni da nam pomognu prevoditi. To je velik problem. Drugi problem na koji ćete naići jest nedostatak motivacije. Kako ćemo motivirati ljude da ustvari prevode Mrežu besplatno? Inače moraš platiti ljudima da to naprave. Kako ćemo ih motivirati da to naprave besplatno? Kad smo započinjali razmišljati o tome, zaustavile su nas ove dvije stvari. Ali tad smo shvatili da ustvari postoji način da riješimo oba problema istom solucijom. Postoji način da se ubiju dvije muhe jednim udarcem. A to je da se transformira prevođenje na nešto što milijuni ljudi žele učiniti, što također pomaže kod problema s nedostatkom bilingvista, a to je učenje jezika.
So it turns out that today, there are over 1.2 billion people learning a foreign language. People really want to learn a foreign language. And it's not just because they're being forced to do so in school. In the US alone, there are over five million people who have paid over $500 for software to learn a new language. So people really want to learn a new language. So what we've been working on for the last year and a half is a new website -- it's called Duolingo -- where the basic idea is people learn a new language for free while simultaneously translating the web. And so basically, they're learning by doing.
Ispada da danas, postoji preko 1,2 milijarde ljudi koji uče strani jezik. Ljudi doista, doista žele učiti strani jezik. I to nije samo zato što ih se tjera na to u školi. Npr., samo u Sjedinjenim Državama, ima preko pet milijuna ljudi koji su platili više od 500 dolara za softver za učenje novog jezika. Pa ljudi doista, doista žele naučiti novi jezik. Dakle, ono na čemu smo radili zadnju godinu i pol je novo mrežno mjesto -- Duolingo -- gdje je osnovna ideja da ljudi uče novi jezik besplatno dok istovremeno prevode Mrežu. I u osnovi uče dok prevode.
So the way this works is whenever you're a just a beginner, we give you very simple sentences. There's a lot of very simple sentences on the web. We give you very simple sentences along with what each word means. And as you translate them and as you see how other people translate them, you start learning the language. And as you get more advanced, we give you more complex sentences to translate. But at all times, you're learning by doing.
Način na koji ovo funkcionira jest da kad god ste samo početnik, dajemo vam vrlo, vrlo jednostavne rečenice. Postoji, naravno, puno vrlo jednostavnih rečenica na Mreži. Dajemo vam vrlo, vrlo jednostavne rečenice zajedno s onim što svaka riječ znači. I kako ih prevodite, i dok gledate kako ih drugi ljudi prevode, počinjete učiti jezik. I kako postajete sve više napredni, dajemo vam sve složenije rečenice na prijevod. Ali cijelo vrijeme, učite dok prevodite. Sad, luda stvar u svezi s ovom metodom
Now, the crazy thing about this method is that it actually really works. People are really learning a language. We're mostly done building it and now we're testing it. People really can learn a language with it. And they learn it about as well as the leading language learning software. So people really do learn a language. And not only do they learn it as well, but actually it's more interesting. Because with Duolingo, people are learning with real content. As opposed to learning with made-up sentences, people are learning with real content, which is inherently interesting. So people really do learn a language.
jest da ona doista funkcionira. Prvo, ljudi doista, doista uče jezik. Uglavnom smo završili s izradom, sada ga testiramo. Ljudi doista mogu naučiti jezik s njim. I uče ga gotovo jednako dobro kao s vodećim softverom za učenje jezika. Pa ljudi doista nauče jezik. I ne samo da ga nauče jednako dobro, zapravo je puno zanimljivije. Zato što s Duolingom ljudi zapravo uče sa stvarnim sadržajem. U suprotnosti s učenjem s izmišljenim rečenicama, ljudi uče sa stvarnim sadržajem koji je po sebi zanimljiv. Pa ljudi stvarno uče jezik.
But perhaps more surprisingly, the translations that we get from people using the site, even though they're just beginners, the translations that we get are as accurate as those of professional language translators, which is very surprising. So let me show you one example. This is a sentence that was translated from German into English. The top is the German. The middle is an English translation that was done by a professional translator who we paid 20 cents a word for this translation. And the bottom is a translation by users of Duolingo, none of whom knew any German before they started using the site. If you can see, it's pretty much perfect. Of course, we play a trick here to make the translations as good as professional language translators. We combine the translations of multiple beginners to get the quality of a single professional translator.
Ali možda što je još veće iznenađenje, prijevodi koje dobivamo od ljudi koji koriste mjesto, iako su samo početnici, prijevodi koje dobivamo su jednako točni kao oni profesionalnih prevoditelja, što vrlo iznenađuje. Pokazat će vam jedan primjer. Ovo je rečenica prevedena s njemačkog na engleski. Na vrhu je njemački. U sredini je prijevod na engleski profesionalnog prevoditelja engleskog jezika kojeg smo platili 20 centi za riječ za ovaj prijevod. Na dnu je prijevod korisnika Duolinga, od kojih ni jedan nije znao njemački prije nego što su počeli koristiti mjesto. Možete vidjeti da je prilično savršen. Naravno, izveli smo trik da napravimo prijevode jednako dobrima kao one profesionalnih prevoditelja. Kombinirali smo prijevode više početnika da dobijemo kvalitetu jednog profesionalnog prevoditelja.
Now, even though we're combining the translations, the site actually can translate pretty fast. So let me show you, this is our estimates of how fast we could translate Wikipedia from English into Spanish. Remember, this is 50 million dollars' worth of value. So if we wanted to translate Wikipedia into Spanish, we could do it in five weeks with 100,000 active users. And we could do it in about 80 hours with a million active users. Since all the projects my group has worked on so far have gotten millions of users, we're hopeful that we'll be able to translate extremely fast.
Iako kombiniramo prijevode, mjesto zapravo može prevoditi prilično brzo. Pokazat ću vam koje su naše procjene koliko brzo možemo prevesti Wikipediu s engleskog na španjolski. Zapamtite, to je vrijednost od 50 milijuna dolara. Da smo htjeli prevesti Wikipediu na španjolski, napravili bismo to u pet tjedana sa 100.000 aktivnih korisnika. I mogli smo to napraviti za otprilike 80 sati s milijun aktivnih korisnika. Budući da su svi projekti na kojima je radila moja grupa dosad pridobili milijune korisnika, nadamo se da ćemo biti u mogućnosti prevoditi ekstremno brzo s ovim projektom.
Now, the thing that I'm most excited about with Duolingo is I think this provides a fair business model for language education. So here's the thing: The current business model for language education is the student pays, and in particular, the student pays Rosetta Stone 500 dollars.
Ono što me najviše uzbuđuju kod Duolinga jest to da omogućuje pošteni poslovni model za učenje jezika. Evo o čemu se radi: Sadašnji poslovni model za učenje jezika sastoji se u tome da učenik plaća, i posebno, učenik plaća Rosetti Stone 500 dolara.
(Laughter)
(Smijeh)
That's the current business model. The problem with this business model is that 95 percent of the world's population doesn't have 500 dollars. So it's extremely unfair towards the poor. This is totally biased towards the rich. Now, see, in Duolingo, because while you learn, you're actually creating value, you're translating stuff -- which, for example, we could charge somebody for translations, so this is how we could monetize this. Since people are creating value while they're learning, they don't have to pay with their money, they pay with their time. But the magical thing here is that is time that would have had to have been spent anyways learning the language. So the nice thing about Duolingo is, I think, it provides a fair business model -- one that doesn't discriminate against poor people.
To je sadašnji poslovni model. Problem je s tim poslovnim modelom u tome da 95 posto svjetske populacije nema 500 dolara. To je krajnje nepošteno prema siromašnima. Ovo je u potpunosti podređeno bogatima. Sad, u Duolingu, zato što, dok učite vi zapravo stvarate vrijednost, prevodite stvari -- za koje bi npr.mogli platiti nekoga za prijevod. Ovo je način na koji bi to mogli unovčiti. Budući da ljudi stvaraju vrijednost dok uče, ne moraju plaćati novcem, oni plaćaju svojim vremenom. Magična je stvar u tome da plaćaju svojim vremenom, ali to je vrijeme koje bi ionako potrošili učeći jezik. Dakle, lijepa stvar u svezi s Duolingom je u tom što, mislim, omogućuje pošten poslovni model -- onaj koji ne diskriminira siromašne ljude. Ovo je stranica. Hvala vam.
So here's the site. Thank you.
(Pljesak)
(Applause)
Ovo je stranica.
We haven't yet launched, but if you go there, you can sign up to be part of our private beta, which is probably going to start in three or four weeks. We haven't yet launched it.
Još nije pokrenuto, ali ako odete tamo, možete se upisati da budete dio naše probne verzije, koja će najvjerojatnije početi za tri do četiri tjedna. Još uvijek nismo pokrenuli Duolingo.
By the way, I'm the one talking here, but Duolingo is the work of a really awesome team,
Uzgred budi rečeno, ja sam taj koji govori, ali zapravo Duolingo je rad stvarno fenomenalnog tima, od kojeg su neki ovdje.
some of whom are here. So thank you.
Hvala vam.
(Applause)
(Pljesak)