Rébecca Kleinberger: Why you don't like the sound of your own voice

If you ask evolutionary biologists when did humans become humans, some of them will say that, well, at some point we started standing on our feet, became biped and became the masters of our environment. Others will say that because our brain started growing much bigger, that we were able to have much more complex cognitive processes. And others might argue that it's because we developed language that allowed us to evolve as a species. Interestingly, those three phenomena are all connected. We are not sure how or in which order, but they are all linked with the change of shape of a little bone in the back of your neck that changed the angle between our head and our body. That means we were able to stand upright but also for our brain to evolve in the back and for our voice box to grow from seven centimeters for primates to 11 and up to 17 centimetres for humans.

اگر از زیست‌شناسان تکامل بپرسید بشر چه زمانی به بشر تبدیل شد؟ بعضی از آنها خواهند گفت: «خب از یک زمانی شروع کردیم به ایستادن روی پاهامون، به موجودات دوپا تبدیل شدیم و به محیط اطراف خود تسلط پیدا کردیم» گروه دیگری خواهند گفت «چون مغز ما شروع به بزرگ شدن کرد، تونستیم فرآیندهای ادراکی بسیار پیچیده‌تری را داشته باشیم» و دیگرانی بحث خواهند کرد که «اختراع زبان به ما امکان تکامل به عنوان یک گونه را داد» جالب اینجاست که هر سه‌ی این پدیده‌ها به هم مرتبط‌اند؛ هرچند به اطمینان نمی‌دانیم چطور یا به چه ترتیبی اما نقطه‌ی اتصال هر سه تغییر شکل یک استخوان کوچک در پشت گردن شما بود که زاویه‌ی بین سر و بدن را تغییر داد. در نتیجه تونستیم صاف بایستیم همینطور به مغز ما این امکان داد تا در پشت سر رشد کند و حنجره‌مان تونست از ۷ سانتی‌متر در بین نخستین‌ها به ۱۱ تا ۱۷ سانتی‌متر در انسان‌ها برسد.

And this is called the descent of the larynx. And the larynx is the site of your voice. When baby humans are born today, their larynx is not descended yet. That only happens at about three months old. So, metaphorically, each of us here has relived the evolution of our whole species. And talking about babies, when you were starting to develop in your mother's womb, the first sensation that you had coming from the outside world, at only three weeks old, when you were about the size of a shrimp, were through the tactile sensation coming from the vibrations of your mother's voice.

که به آن "پایین آمدن حلق" گفته می‌شه. و حلق جایگاه صدای شماست. امروز وقتی بچه‌ی انسان به دنیا میاد حلقش هنوز پایین نیامده. این اتفاقیه که در حدود ۳ ماهگی میفته. پس به طور استعاری میشه گفت که هر کدام از ما تکامل کل گونه‌ی انسانی را زندگی کرده‌ایم. و حالا که حرف بچه‌ها شد وقتی ما در شکم مادرهامان شروع به تکامل می‌کردیم اولین حسی که از دنیای بیرون تجربه کرده‌ایم، در سه هفتگی، وقتی اندازه‌ی یک میگو بودیم، از طریق حس لامسه‌ بود ناشی از ارتعاشات صدای مادرمان.

So, as we can see, the human voice is quite meaningful and important at the level of the species, at the level of the society -- this is how we communicate and create bonds, and at the personal and interpersonal levels -- with our voice, we share much more than words and data, we share basically who we are. And our voice is indistinguishable from how other people see us. It is a mask that we wear in society. But our relationship with our own voice is far from obvious. We rarely use our voice for ourselves; we use it as a gift to give to others. It is how we touch each other. It's a dialectical grooming.

بنابراین میشه متوجه شد که صدای انسان معنا و اهمیت زیادی دارد در سطح گونه‌ی انسانی، و در سطح اجتماع-- که شیوه‌ی برقراری ارتباط و ایجاد پیوند ماست، و در سطح فردی و بین‌فردی ما با صدای‌مان چیزی خیلی بیش از کلمات و داده‌ها را منتقل می‌کنیم؛ اساساً آنچه هستیم را به اشتراک می‌گذاریم. و صدای ما از تصویری که افراد از ما دارند قابل تفکیک نیست. نقابیه که در جامعه به چهره می‌زنیم. اما ارتباط خود ما با صدای خودمان خیلی واضح نیست. به ندرت از صدای خود برای خود، و بیشتر بعنوان هدیه‌ای به دیگران استفاده می‌کنیم. این طور بر هم تأثیر می‌گذاریم. مثل یک تیمار گویشی.

But what do we think about our own voice? So please raise your hand if you don't like the sound of your voice when you hear it on a recording machine.

ولی راجع به صدای خود چی فکر می‌کنیم؟ از شما می‌خوام اگر صدای خود را وقتی از ضبط صوت می‌شنوید را دوست ندارید دست خود را بلند کنید.

(Laughter)

{خنده}

Yeah, thank you, indeed, most people report not liking the sound of their voice recording. So what does that mean? Let's try to understand that in the next 10 minutes. I'm a researcher at the MIT Media Lab, part of the Opera of the Future group, and my research focuses on the relationship people have with their own voice and with the voices of others. I study what we can learn from listening to voices, from the various fields, from neurology to biology, cognitive sciences, linguistics. In our group we create tools and experiences to help people gain a better applied understanding of their voice in order to reduce the biases, to become better listeners, to create more healthy relationships or just to understand themselves better.

آره، ممنون، همینطوره، بیشتر مردم می‌گویند که صدای ضبط شده‌ی خود‌ را دوست ندارند خب یعنی چی؟ بیایید سعی کنیم در ۱۰ دقیقه‌ی آینده این را بفهمیم. من در آزمایشگاه رسانه‌ی دانشگاه MIT محقق هستم، و جزئی از تیم «اپرای آینده» و تمرکز تحقیقات من ر ی ارتباطیه که انسان‌ها با صدای و صدای دیگران دارند. من آنچه را که می‌شه از گوش دادن به صداها فهمید مطالعه می‌کنم، در زمینه‌های مختلف، از عصب‌شناسی، تا زیست‌شناسی، علوم ادراکی، زبان‌شناسی. گروه ما ابزارها و تجربه‌هایی ایجاد می‌کند که به مردم کمک کند درک کاربردی بهتری از صدای خود داشته باشند تا بتونند جانب‌گیری‌های خود را کاهش دهند، شنونده‌های بهتری بشن، روابط سالم‌تری بسازن، یا فقط خود را بهتر درک کنند.

And this really has to come with a holistic approach on the voice. Because, think about all the applications and implications that the voice may have, as we discover more about it. Your voice is a very complex phenomenon. It requires a synchronization of more than 100 muscles in your body. And by listening to the voice, we can understand possible failures of what happens inside. For example: listening to very specific types of turbulences and nonlinearity of the voice can help predict very early stages of Parkinson's, just through a phone call. Listening to the breathlessness of the voice can help detect heart disease. And we also know that the changes of tempo inside individual words is a very good marker of depression.

و این باید با رویکرد جامعی به صدا همراه باشه. چون‌ که... به تمام کاربردها و پیامدهایی فکر کنید که صدا می‌تونه داشته باشه، همین طور که بیشتر راجع بهش کشف می‌کنیم صدای شما پدیده‌ی بسیار پیچیده‌ایه. مستلزم هماهنگی بیش از ۱۰۰عضله در بدن شماست و با گوش کردن به صدا می‌شه اختلالات احتمالی آنچه در درون اتفاق میفته را متوجه شد. به عنوان مثال گوش کردن به انواع خیلی مشخصی از آشفتگی و اعوجاج صدا می‌تونه به پیش‌بینی مراحل خیلی ابتدایی پارکینسون کمک کنه؛ فقط از طریق یک تماس تلفنی. گوش دادن به نفس‌بریدگی صدا می‌تونه به تشخیص بیماری قلبی کمک کنه. ما همچنین می‌دونیم که تغییر در ضرب‌آهنگ کلمات مشخص مشخصه‌ی بسیار خوبی برای افسردگیه.

Your voice is also very linked with your hormone levels. Third parties listening to female voices were able to very accurately place the speaker on their menstrual cycle. Just with acoustic information. And now with technology listening to us all the time, Alexa from Amazon Echo might be able to predict if you're pregnant even before you know it. So think about --

صدای شما به سطوح هورمونی‌تان هم ارتباط بسیار نزدیکی دارد. گروه‌های ثالثی که به صدای زن‌ها گوش می‌دادند تونستن با دقت بسیار بالا مشخص کنند که صاحب صدا در چه زمانی از سیکل قاعدگیش قرار دارد. فقط با داده‌های صوتی و حالا که ابزارهای تکنولوژیک تمام مدت به صدای ما گوش می‌کنند دستگاه «الکسا»ی آمازون اکو ممکنه بتونه حاملگیتون را تشخیص بده حتی قبل از این‌که خودتان بدانید. پس به این فکر کنید

(Laughter)

{خنده حضار}

Think about the ethical implications of that. Your voice is also very linked to how you create relationships. You have a different voice for every person you talk to. If I take a little snippet of your voice and I analyze it, I can know whether you're talking to your mother, to your brother, your friend or your boss. We can also use, as a predictor, the vocal posture. Meaning, how you decide to place your voice when you talk to someone. And you vocal posture, when you talk to your spouse, can help predict not only if, but also when you will divorce.

به تمام پیامدهای اخلاقی این موضوع فکر کنید صدای شما حتی به این‌ که چطور رابطه‌ خود را شکل می‌دین مربوطه. شما برای هر فردی که باهاش صحبت می‌کنید صدای جداگانه‌ای دارید. اگه من قطعه‌ی کوتاهی از صدای شما را بردارم و بررسی کنم می‌تونم تشخیص بدم که دارید با مادرتان صحبت می‌کنید، یا با برادرتان، با دوست یا با رئیستان. ما همین طور می‌تونیم به عنوان یک ابزار پیش‌بینی از حالت صدای شما استفاده کنیم؛ به معنی جایی که برای صداتون هنگام صحبت کردن با یک فرد انتخاب می‌کنید. و حالت صدای شما وقتی با همسرتان حرف می‌زنید، نه تنها وقوع، بلکه حتی زمان اتفاق افتادن طلاق شما را هم می‌تونه پیش‌بینی کنه.

So there is a lot to learn from listening to voices. And I believe this has to start with understanding that we have more than one voice. So, I'm going to talk about three voices that most of us posses, in a model of what I call the mask. So when you look at the mask, what you see is a projection of a character. Let's call that your outward voice. This is also the most classic way to think about the voice, it's a way of projecting yourself in the world. The mechanism for this projection is well understood. Your lungs contract your diaphragm and that creates a self-sustained vibration of your vocal fold, that creates a sound. And then the way you open and close the cavities in you mouth, your vocal tract is going to transform the sound.

پس خیلی چیزها می‌شه از گوش دادن به صدای افراد فهمید. و من معتقدم که باید از درک این نکته که ما بیش از یک صدا داریم شروع بشه. من می‌خوام راجع به ۳ تا از صداهایی که اکثر ما داریم حرف بزنم، در قالب چیزی که بهش «نقاب» می‌گم. وقتی شما به یک نقاب نگاه می‌کنید، آنچه که می‌بینید تصویر یک شخصیته. بیایید اسمش را «صدای خارجی» بذاریم. این کلاسیک‌ترین شکل فکر کردن در مورد صدا هم هست؛ شیوه‌ای برای به تصویر کشیدن خودتان در جهان فرآیند این به تصویر کشیدن هم به خوبی شناخته شده: ریه‌هاتان دیافراگم را منقبض می‌کنند، و این باعث لرزش خودکفای تارهای صوتی میشه که صدا تولید می‌کنه. و بعد با توجه به شکلی که فضای داخل دهانتون را باز و بسته می‌کنید، دستگاه صوتی شما به اون صدا شکل می‌ده.

So everyone has the same mechanism. But voices are quite unique. It's because very subtle differences in size, physiology, in hormone levels are going to make very subtle differences in your outward voice. And your brain is very good at picking up those subtle differences from other people's outward voices. In our lab, we are working on teaching machines to understand those subtle differences. And we use deep learning to create a real-time speaker identification system to help raise awareness on the use of the shared vocal space -- so who talks and who never talks during meetings -- to increase group intelligence.

همه از همین فرآیند استفاده می‌کنند اما صداها منحصر به فردند. دلیلش اینه که تفاوت‎‌های خیلی جزئی در ابعاد، فیزیولوژی یا سطوح هورمونی تفاوت‌های خیلی جزئی در آوای خارجی شما ایجاد می‌کند. و مغز شما در تشخیص این تفاوت‌های جزئی در آوای خارجی دیگران خیلی تواناست. ما در آزمایشگاهمان سعی می‌کنیم به ماشین‌ها آموزش بدیم که این تفاوت‌های جزئی را متوجه بشن. ما از یادگیری عمیق برای ایجاد یک سیستم همزمان شناسایی صاحب صدا استفاده می‌کنیم تا به افزایش آگاهی در رابطه با استفاده از "فضای مشترک آوایی" کمک کنیم (اینکه در جلسه‌ها کی صحبت می‌کنه و کی هیچ‌وقت چیزی نمی‌گه) که در نتیجه خرد جمعی را افزایش بدیم.

And one of the difficulties with that is that your voice is also not static. We already said that it changes with every person you talk to but it also changes generally throughout your life. At the beginning and at the end of the journey, male and female voices are very similar. It's very hard to distinguish the voice of a very young girl from the voice of a very young boy. But in between, your voice becomes a marker of your fluid identity. Generally, for male voices there's a big change at puberty. And then for female voices, there is a change at each pregnancy and a big change at menopause. So all of that is the voice other people hear when you talk. So why is it that we're so unfamiliar with it? Why is it that it's not the voice that we hear? So, let's think about it.

و یکی از مشکلات ما در این کار اینه که صدای شما ثابت نیست. قبلاً گفتیم که صدای شما با توجه به مخاطب تغییر می‌کند. اما به علاوه صدا به طور کلی در طول عمرتان هم دچار تغییر می‌شه در ابتدا و انتهای عمر، صدای مذکر و مؤنث بسیار به هم شبیهند. خیلی سخت می‌شه صدای یک دختربچه را از صدای یک پسربچه تشخیص داد. اما در میانه(ی عمر)، صدای شما به مشخصه‌ای از هویت سیال شما تبدیل می‌شه. صدای افراد مذکر عموماً در زمان بلوغ با یک تغییر عمده مواجه می‌شه. و از طرفی در صدای افراد مؤنث با هر بارداری تغییری ایجاد می‌شه و تغییر بزرگ هم در زمان یائسگی اتفاق میافته. خب تمام این‌ها صداهایی هستند که وقتی ما صحبت می‌کنیم دیگران می‌شنوند. پس چرا خودمان انقدر باهاش غریبه‌ایم؟ چرا همون صدایی نیست که خودمان می‌شنویم؟ بیایید بهش فکر کنیم.

When you wear a mask, you actually don't see the mask. And when you try to observe it, what you will see is inside of the mask. And that's your inward voice. So to understand why it's different, let's try to understand the mechanism of perception of this inward voice. Because your body has many ways of filtering it differently from the outward voice. So to perceive this voice, it first has to travel to your ears. And your outward voice travels through the air while your inward voice travels through your bones. This is called bone conduction. Because of this, your inward voice is going to sound in a lower register and also more musically harmonical than your outward voice. Once it travels there, it has to access your inner ear. And there's this other mechanism taking place here. It's a mechanical filter, it's a little partition that comes and protects your inner ear each time you produce a sound. So it also reduces what you hear. And then there is a third filter, it's a biological filter. Your cochlea -- it's a part of your inner ear that processes the sound -- is made out of living cells. And those living cells are going to trigger differently according to how often they hear the sound. It's a habituation effect. So because of this, as your voice is the sound you hear the most in your life, you actually hear it less than other sounds.

وقتی نقابی می‌زنید خودتون در واقع آن را نمی‌بینید. و وقتی سعی می‌کنید نگاهش کنید آنچه که می‌بینید بخش داخلی نقابه. این "آوای داخلی" شماست. برای اینکه متوجه بشیم چرا (با آوای خارجی) متفاوته، بیایید سعی کنیم فرآیند دریافت این آوای داخلی را بفهمیم. چون بدن شما شیوه‌های متعددی داره که این صدا را به شکلی متفاوت از آوای خارجی‌تان پالایش کند. برای این‌که دریافت بشه، صدا باید اول به گوش‌های شما برسه. آوای خارجی از طریق هوا حرکت می‌کند در حالی که آوای داخلی از میان استخوان‌های سما منتقل می‌شه. که بهش "انتقال استخوانی" گفته می‌شه. به همین دلیله که آوای داخلی بم‌‌تر و به لحاظ موسیقایی موزون‌تر از آوای خارجی به گوش می‌رسه. وقتی صدا به گوش رسید باید به گوش داخلی منتقل بشه. و این‌جا فرآیند دیگه‌ای اتفاق میافته. که فیلتری مکانیکیه، تیغه‌ای که میاد و از گوش داخلی محافظت می‌کند، هر وقت که صدایی تولید کنید. و صدایی که می‌شنوید را هم کاهش می‌ده. فیلتر سومی هم وجود دارد؛ یک فیلتر بیولوژیک حلزون گوش -بخشی از گوش داخلی که صدا را پردازش می‌کند- از سلول‌های زنده تشکیل شده. و نحوه‌ی تحریک این سلول‌های زنده تغییر می‌کند بسته به این‌ که با چه فواصل زمانی صدا را می‌شنوند. این شکلی از سازگاریه. و به همین دلیل چون صدای خودتان بیشترین صداییه که در زندگی به گوشتان می‌رسه در واقع کمتر از بقیه‌ی صداها می‌شنویدش.

Finally, we have a fourth filter. It's a neurological filter. Neurologists found out recently that when you open your mouth to create a sound, your own auditory cortex shuts down. So you hear your voice but your brain actually never listens to the sound of your voice. Well, evolutionarily that might make sense, because we know cognitively what we are going to sound like so maybe we don't need to spend energy analyzing the signal. And this is called a corollary discharge and it happens for every motion that your body does. The exact definition of a corollary discharge is a copy of a motor command that is sent by the brain. This copy doesn't create any motion itself but instead is sent to other regions of the brain to inform them of the impending motion. And for the voice, this corollary discharge also has a different name. It is your inner voice.

در نهایت فیلتر چهارمی وجود داره. که یک فیلتر عصبیه. عصب‌شناس‌ها به تازگی متوجه شدند که وقتی دهانتان رت باز می‌کنید که صدایی تولید کنید، قشر شنوایی مغزتان خاموش می‌شه. بنابراین شما صدای خودتون را می‌شنوید اما مغزتان در واقع هیچ‌وقت به صدای شما گوش نمی‌ده. خب از منظر تکامل این ممکنه منطقی باشه، چون ما لحاظ ادراکی ما می‌دانیم که قراره چی بگیم پس شاید نیازی نباشه که برای تحلیل سیگنالش انرژی صرف کنیم. به این "تخلیه‌ی (بار عصبی) تبعی" گفته می‌شه و برای هر حرکتی که بدن انجام می‌ده اتفاق میافته. تعریف دقیق تخلیه‌ی تبعی یک کپی از یک دستور حرکتیه که به وسیله‌ی مغز فرستاده می‌شه. خود این کپی حرکتی ایجاد نمی‌کند اما در عوض به بقیه‌ی قسمت‌های مغز می‌ره تا آنها را از حرکتی که قراره انجام بشه آگاه کنه. و برای صدا، این تخلیه‌ی تبعی نام دیگه‌ای هم داره اون "آوای درونی" شماست.

So let's recapitulate. We have the mask, the outward voice, the inside of the mask, your inward voice, and then you have your inner voice. And I like to see this one as the puppeteer that holds the strings of the whole system. Your inner voice is the one you hear when you read a text silently, when you rehearse for an important conversation. Sometimes is hard to turn it off, it's really hard to look at the text written in your native language, without having this inner voice read it. It's also the voice that refuse to stop singing the stupid song you have in your head.

خب، بیایید یک بار مرور کنیم: نقاب را داریم، که آوای خارجیه، داخل نقاب را داریم، که آوای داخلیه، و بعد آوای درونی را داریم. من دوست دارم به چشم عروسک‌گردانی بهش نگاه کنم که نخ‌های همه‌ی این مجموعه در دستشه. آوای درونی شما همون صداییه که وقتی در سکوت متنی را می‌خونید می‌شنوید، یا وقتی مکالمه‌ی مهمی را در ذهنتون‌ تمرین می‌کنید. بعضی وقت‌ها ساکت کردنش کار سختیه، واقعاً به سختی ممکنه به متنی که به زبان مادری‌تان نوشته شده نگاه کنید، بی‌آنکه آوای درونی شما آن را نخونه. این همون صداییه که بی‌خیال خوندن آهنگ مزخرفی که بر زبانتان افتاده نمی‌شه.

(Laughter)

{خنده حضار}

And for some people it's actually impossible to control it. And that's the case of schizophrenic patients, who have auditory hallucinations. Who can't distinguish at all between voices coming from inside and outside their head. So in our lab, we are also working on small devices to help those people make those distinctions and know if a voice is internal or external.

و برای بعضی‌ها کنترلش واقعاً غیرممکنه. این شرایط در مورد مبتلایان به اسکیزوفرنی صادقه، که توهم‌های شنیداری دارند. و نمی‌تونند تفکیکی قائل بشن بین صداهایی که از درون سرشان میاد و صداهای بیرونی. بنابراین ما در آزمایشگاه روی دستگاه‌های کوچکی هم کار می‌کنیم که به این افراد در این تشخیص کمک کند که یک صدا داخلیه یا خارجی.

You can also think about the inner voice as the voice that speaks in your dream. This inner voice can take many forms. And in your dreams, you actually unleash the potential of this inner voice. That's another work we are doing in our lab: trying to access this inner voice in dreams. So even if you can't always control it, the inner voice -- you can always engage with it through dialogue, through inner dialogues. And you can even see this inner voice as the missing link between thought and actions.

می‌تونید به صدای درونی به عنوان صدایی که که در رؤیاهاتون حرف می‌زنه هم فکر کنید. صدای درونی می‌تونه اشکال مختلفی بگیرد. و در رؤیاهاتون شما درواقع ظرفیت‌های این آوای درونی رو آزاد می‌کنید. این کار دیگری هست که در آزمایشگاه‌مان انجام می‌دیم: سعی می‌کنیم به آوای درونی در رؤیا دسترسی پیدا کنیم. و حتی اگه همیشه نتونید کنترلش کنید، - این آوای درونی را- همیشه می‌تونید باهاش ارتباط برقرار کنید از طریق مکالمه، از طریق مکالمه‌ی درونی. و حتی می‌تونید به این آوای درونی به عنوان حلقه‌ی گمشده‌ی بین افکار و اعمال نگاه کنید.

So I hope I've left you with a better appreciation, a new appreciation of all of your voices and the role it plays inside and outside of you -- as your voice is a very critical determinant of what makes you humans and of how you interact with the world.

در خاتمه امیدوارم حس قدرانی‌ داشته باشید. که قدر همه‌ی آواهای خود را بدانید و نیز نقشی که در درون و بیرون شما دارن. چون صدای شما عامل تعیین‌کننده‌ی خیلی مهمیه در اونچه که شما را به انسان تبدیل می‌کند و در چگونگی ارتباط شما با دنیا.

Thank you.

متشکرم.

(Applause)

{تشویق }

But what do we think about our own voice? So please raise your hand if you don't like the sound of your voice when you hear it on a recording machine.

(Laughter)

{خنده}

(Laughter)

{خنده حضار}

(Laughter)

{خنده حضار}

Thank you.

متشکرم.

(Applause)

{تشویق }

Rébecca Kleinberger: Why you don't like the sound of your own voice

Rébecca Kleinberger: Why you don't like the sound of your own voice

Related talks

Max Little: A test for Parkinson's with a phone call

Rupal Patel: Synthetic voices, as unique as fingerprints

Annie Murphy Paul: What we learn before we're born

Shaylin Schundler: Why does your voice change as you get older?

Eleanor Longden: The voices in my head

Beardyman: The polyphonic me

Related talks

Max Little: A test for Parkinson's with a phone call

Rupal Patel: Synthetic voices, as unique as fingerprints

Annie Murphy Paul: What we learn before we're born

Shaylin Schundler: Why does your voice change as you get older?

Eleanor Longden: The voices in my head

Beardyman: The polyphonic me