Alex Gendler: The Turing test: Can a computer pass for a human?

What is consciousness? Can an artificial machine really think? Does the mind just consist of neurons in the brain, or is there some intangible spark at its core? For many, these have been vital considerations for the future of artificial intelligence. But British computer scientist Alan Turing decided to disregard all these questions in favor of a much simpler one: can a computer talk like a human?

Što je svijest? Može li umjetni stroj zaista misliti? Čine li um samo neuroni u mozgu ili je u njegovoj srži neka neopipljiva iskra? Po mišljenju mnogih, to su ključna pitanja za budućnost umjetne inteligencije. Britanski računalni znanstvenik Alan Turing zanemario ih je i postavio jedno jednostavnije: može li računalo razgovarati kao čovjek?

This question led to an idea for measuring aritificial intelligence that would famously come to be known as the Turing test. In the 1950 paper, "Computing Machinery and Intelligence," Turing proposed the following game. A human judge has a text conversation with unseen players and evaluates their responses. To pass the test, a computer must be able to replace one of the players without substantially changing the results. In other words, a computer would be considered intelligent if its conversation couldn't be easily distinguished from a human's.

To je pitanje dovelo do zamisli o mjerenju umjetne inteligencije, koja će postati poznata kao Turingov test. U znanstvenom radu „Strojevi za računanje i inteligencija” iz 1950. Turing je predložio sljedeću igru. Sudac (čovjek) dopisuje se s igračima koje ne vidi i ocjenjuje njihove odgovore. Da bi prošlo test, računalo mora zamijeniti nekog od igrača, a da pritom ne dođe do značajne promjene u rezultatima. Drugim riječima, računalo bismo smatrali inteligentnim ako njegov govor nije lako razlikovati od ljudskog.

Turing predicted that by the year 2000, machines with 100 megabytes of memory would be able to easily pass his test. But he may have jumped the gun. Even though today's computers have far more memory than that, few have succeeded, and those that have done well focused more on finding clever ways to fool judges than using overwhelming computing power. Though it was never subjected to a real test, the first program with some claim to success was called ELIZA. With only a fairly short and simple script, it managed to mislead many people by mimicking a psychologist, encouraging them to talk more and reflecting their own questions back at them. Another early script PARRY took the opposite approach by imitating a paranoid schizophrenic who kept steering the conversation back to his own preprogrammed obsessions. Their success in fooling people highlighted one weakness of the test. Humans regularly attribute intelligence to a whole range of things that are not actually intelligent. Nonetheless, annual competitions like the Loebner Prize, have made the test more formal with judges knowing ahead of time that some of their conversation partners are machines.

Turing je predvidio da će do 2000. godine strojevi s memorijom od 100 megabajta uspjeti proći njegov test bez poteškoća. Možda se ipak malo zaletio. Iako današnja računala imaju znatno veću memoriju, malo je njih u tome uspjelo, a oni koji jesu imali dobre rezultate, više su bili usredotočeni na to kako pametno zavarati suce, nego na to kako upotrijebiti ogromnu računalnu snagu. Iako nikad nije podvrgnut stvarnom testu, prvi program koji se može pohvaliti barem djelomičnim uspjehom zvao se ELIZA. Taj prilično kratak i jednostavan program uspio je obmanuti mnoge ljude oponašajući psihologa, tako što ih je ohrabrivao da više govore i uzvraćao im njihovim vlastitim pitanjima. Drugi rani program PARRY postupio je suprotno, oponašajući paranoičnog šizofreničara koji je stalno usmjeravao razgovor na vlastite pretprogramirane opsesije. Njihovi uspjesi u obmanjivanju ljudi razotkrili su slabu točku testa. Ljudi redovito pripisuju inteligenciju čitavom nizu pojava koje s njom zapravo nemaju veze. Unatoč tome, godišnja natjecanja, kao što je Loebnerova nagrada, formalizirala su test. Suci sad unaprijed znaju da su neki od njihovih sugovornika strojevi.

But while the quality has improved, many chatbot programmers have used similar strategies to ELIZA and PARRY. 1997's winner Catherine could carry on amazingly focused and intelligent conversation, but mostly if the judge wanted to talk about Bill Clinton. And the more recent winner Eugene Goostman was given the persona of a 13-year-old Ukrainian boy, so judges interpreted its nonsequiturs and awkward grammar as language and culture barriers. Meanwhile, other programs like Cleverbot have taken a different approach by statistically analyzing huge databases of real conversations to determine the best responses. Some also store memories of previous conversations in order to improve over time. But while Cleverbot's individual responses can sound incredibly human, its lack of a consistent personality and inability to deal with brand new topics are a dead giveaway.

No iako se kvaliteta poboljšala, mnogi programeri chatbotâ koristili su se strategijama sličnima ELIZA-i i PARRY-ju. Pobjednica iz 1997. Catherine mogla je voditi zapanjujuće usredotočene i inteligentne razgovore, ali uglavnom ako bi sudac želio razgovarati o Billu Clintonu. A nedavnom pobjedniku Eugeneu Goostmanu dana je osobnost trinaestogodišnjaka iz Ukrajine, pa su suci njegove nesuvisle rečenice i nespretnu gramatiku pogrešno protumačili kao jezičnu i kulturološku barijeru. U međuvremenu su drugi programi, npr. Cleverbot, odabrali drukčiji pristup: statističkom analizom golemih baza podataka stvarnih razgovora birali su najbolje odgovore. Neki su i pohranjivali u memoriji prijašnje razgovore kako bi se s vremenom poboljšali. No iako Cleverbotovi odgovori zasebno mogu zvučati nevjerojatno ljudski, nedostatak dosljedne osobnosti i nesposobnost prilagodbe sasvim novim temama smjesta ga odaju.

Who in Turing's day could have predicted that today's computers would be able to pilot spacecraft, perform delicate surgeries, and solve massive equations, but still struggle with the most basic small talk? Human language turns out to be an amazingly complex phenomenon that can't be captured by even the largest dictionary. Chatbots can be baffled by simple pauses, like "umm..." or questions with no correct answer. And a simple conversational sentence, like, "I took the juice out of the fridge and gave it to him, but forgot to check the date," requires a wealth of underlying knowledge and intuition to parse. It turns out that simulating a human conversation takes more than just increasing memory and processing power, and as we get closer to Turing's goal, we may have to deal with all those big questions about consciousness after all.

Tko je u Turingovo vrijeme mogao predvidjeti da će današnja računala moći upravljati svemirskim letjelicama, obavljati osjetljive kirurške operacije i rješavati složene jednadžbe, no i dalje zapinjati u običnom čavrljanju? Ljudski se jezik pokazao nevjerojatno složenom pojavom koju ne može obuhvatiti čak ni najveći rječnik. Chatbote zbunjuju jednostavne pauze, npr. „hm...” ili pitanja bez točnih odgovora. A za analizu rečenice najbanalnijeg razgovora, primjerice „Izvadio sam sok iz hladnjaka i dao mu ga, ali sam zaboravio provjeriti datum.”, potrebni su ogromno prethodno znanje i intuicija. I tako se ispostavilo da je za simulaciju ljudskog razgovora potrebno više od pukog povećanja memorije i snage procesora i da ćemo se na putu prema Turingovu cilju ipak trebati pozabaviti svim tim velikim pitanjima o svijesti.

Alex Gendler: The Turing test: Can a computer pass for a human?

Alex Gendler: The Turing test: Can a computer pass for a human?

Related talks

Briana Brownell: How does artificial intelligence learn?

Matt Porter and Margaret Hamilton: NASA's first software engineer: Margaret Hamilton

Chiara Decaroli: The high-stakes race to make quantum computers work

Related talks

Briana Brownell: How does artificial intelligence learn?

Matt Porter and Margaret Hamilton: NASA's first software engineer: Margaret Hamilton

Chiara Decaroli: The high-stakes race to make quantum computers work