How do the larynx and pharynx contribute to speech?

The larynx, or voice box, contains vocal folds that vibrate when air is exhaled, creating the basic sound of the voice. The pharynx and oral cavity (mouth, tongue, lips) then shape this sound into specific consonants and vowels, forming words.

What is a brain-machine interface and how is it used for paralyzed individuals?

A brain-machine interface (BMI) translates neural activity from the brain into commands for external devices. In Dr. Chang's research, it intercepts signals from speech-related brain areas and converts them into words, allowing paralyzed individuals to communicate.

What are the challenges and ethical considerations of neural augmentation?

While augmentation aims to enhance human abilities beyond normal physiological limits, ethical concerns arise regarding invasive procedures, societal access, and defining what is desirable for human advancement. The technology is advancing, but thorough discussions on its implications are needed.

How can facial expressions and avatars improve communication for disabled individuals?

Integrating facial expressions and animated avatars can make digital communication more natural and complete. For those with paralysis, decoding nonverbal cues and visual mouth movements alongside speech signals can greatly improve intelligibility and participation in digital interactions.

What causes stuttering and how can it be treated?

Stuttering is a speech disorder affecting the fluent production of words, likely due to a breakdown in the precise coordination of vocal tract movements. While anxiety can worsen it, it's not the primary cause. Treatment typically involves speech therapy focusing on initiation techniques and auditory feedback.

Key Moments

Essentials: The Science of Learning & Speaking Languages | Dr. Eddie Chang

Andrew Huberman

Science & Technology5 min read29 min video

May 21, 2026|4,401 views|303|34

andrew huberman huberman lab podcast huberman podcast dr. andrew huberman neuroscience huberman lab andrew huberman podcast the huberman lab podcast science podcast eddie chang

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

Brain-computer interfaces can translate neural signals into speech for paralyzed individuals, yet ethical questions arise about using this tech for human augmentation.

Key Insights

Speech is the motor act of shaping breath into words, while language encompasses understanding meaning, grammar, and context.

The larynx, containing vocal folds, generates the initial sound (voicing) at frequencies around 100 Hz for men and 200 Hz for women; above this, the pharynx, mouth, tongue, and lips shape the sound into specific consonants and vowels.

The "locked-in syndrome" affects individuals with intact cognition but no ability to speak or move, often caused by brain stem strokes or ALS.

The Bravo trial successfully enabled a participant paralyzed for 15 years to communicate by translating his brain activity from speech-related neural areas into text using AI, achieving a vocabulary of 50 words initially.

Augmentation beyond normal human capabilities, such as enhanced memory or communication speed, using neurotechnologies raises significant ethical concerns and requires societal discussion.

Stuttering is specifically a speech production issue where the coordination of vocal tract movements breaks down, not primarily caused by anxiety, though anxiety can exacerbate it.

Distinguishing speech from language and vocalization

Speech is defined as the physical act of producing sounds through the vocal tract to communicate, involving the coordinated movement of articulators like the lips, tongue, and jaw to shape airflow into words. Language, conversely, is the broader system of communication, encompassing semantics (meaning), syntax (grammar), and pragmatics (contextual understanding). Vocalizations, such as crying or laughter, are distinct from complex speech and language and are processed by different neural pathways, even evident in non-human primates. This distinction is crucial for understanding how the brain orchestrates communication, separating the motor execution of sounds from the cognitive processing of meaning and structure. While speech is one modality for language, other forms like sign language and reading engage different neural circuits but serve the same linguistic purpose.

The mechanics of sound production in the larynx and vocal tract

The process of speaking begins with exhalation, drawing air into the lungs. At the larynx, or voice box, two structures called vocal folds (often misnamed vocal cords) come together. As air is pushed out, it passes through these approximated folds, causing them to vibrate at high frequencies—approximately 100 Hz for males and 200 Hz for females, which accounts for typical pitch differences. This vibration generates the fundamental sound of the voice, known as voicing. The sound then travels upward through the pharynx and into the oral cavity, where the tongue, lips, jaw, and palate act as articulators. These structures dynamically shape the airflow and voicing, transforming the raw sound into the distinct vowels and consonants that form words. This complex interplay of respiration, laryngeal vibration, and articulatory shaping makes speech a remarkably intricate motor feat.

Restoring communication with brain-machine interfaces

Neurosurgeons like Dr. Eddie Chang are pioneering brain-machine interfaces (BMIs) to restore communication for individuals with severe paralysis, such as those suffering from brain stem strokes or Amyotrophic Lateral Sclerosis (ALS). These conditions can lead to 'locked-in syndrome,' where a person is fully aware but unable to move or speak, leading to profound psychological isolation. The research involves implanting electrode arrays in speech-controlling areas of the cerebral cortex to detect neural activity patterns associated with intended speech. This neural data is then transmitted via a port on the skull to a computer, where sophisticated machine learning algorithms translate these subtle electrical signals into words and sentences. The Bravo trial, for instance, successfully enabled a participant paralyzed for 15 years to communicate, initially by decoding a vocabulary of 50 words. The system uses AI and features like autocorrect, inspired by smartphone texting, to interpret and refine the decoded speech, offering a vital lifeline for those trapped without a voice. The work represents a significant leap towards reconnecting individuals with their thoughts and the outside world.

The ethical landscape of neural augmentation

Beyond restoring lost function, the rapidly advancing field of BMIs, including work from companies like Neuralink, prompts discussions about 'augmentation'—enhancing human abilities beyond natural limits. This could involve super-memory, faster communication speeds, or superior physical precision. Dr. Chang acknowledges that while the science of BMIs has a long history, the current involvement of industry is accelerating commercialization and raising new questions tied to augmentation. He stresses that, unlike the primary medical goal of restoring function, enhancement technologies require deep ethical consideration. While humans have always sought augmentation through substances like caffeine or nicotine, neurotechnologies, especially invasive ones, present a different scale of intervention. Dr. Chang suggests that such enhancements may arrive incrementally and subtly, rather than in dramatic leaps, and that society has not yet fully grappled with the implications of unequal access, societal impact, and the very desirability of these super-physiological capabilities.

Integrating facial expressions and avatars

Future communication technologies aim for more holistic expression by integrating non-verbal cues like facial expressions with speech decoding. Dr. Chang's lab is exploring how to translate neural signals related not just to speech, but also to intended facial movements, into fully animated avatars. This approach recognizes that visual cues, such as seeing the speaker's mouth move, enhance auditory comprehension. For individuals with paralysis, such as Poncho, creating avatars that emote and articulate can offer a richer and more natural form of digital communication, particularly as social interactions increasingly move into virtual spaces. This research seeks to improve the quality of communication for disabled individuals and also holds potential as a feedback mechanism for those learning to use speech neuroprosthetics, making the process feel more embodied and direct.

Understanding and addressing stuttering

Stuttering is characterized as a speech disorder, not a language one, meaning individuals possess the linguistic content and grammar but struggle with fluent articulation. It involves a breakdown in the precise, coordinated movements of the vocal tract required for fluent speech production. While not primarily caused by anxiety, anxiety can trigger or worsen stuttering episodes. The exact cause remains unclear but is linked to complex brain functions controlling speech machinery. Speech therapy is the primary treatment, focusing on strategies to improve articulation, initiate speech, and manage anxiety. Emerging research also highlights the crucial role of auditory feedback—how we hear ourselves speak—suggesting that disruptions in the brain's processing of this feedback loop might contribute to stuttering and could be a target for therapeutic intervention.

Mentioned in This Episode

●Products

●Companies

●Organizations

●Studies Cited

●People Referenced

Common Questions

Speech refers to the physical act of producing sounds with the vocal tract to communicate, focusing on the auditory signal. Language is broader, encompassing the extraction of meaning (semantics), understanding context (pragmatics), and grammatical structure (syntax) from spoken or other communication forms.

Topics

Health & Longevity Neuroscience & the Brain Technology & Innovation Brain-computer Interface Speech Disorders Speech Production Language Processing Communication Technology

Mentioned in this video

Companies

Neuralink

Elon Musk's company focused on developing brain-machine interfaces, mentioned in the context of broader advancements in the field.

People

Eddie Chang

Guest on the podcast, specializing in the neurobiology of speech and language. He discusses his research on brain-machine interfaces for communication.

Andrew Huberman

Host of Huberman Lab, professor of neurobiology and ophthalmology at Stanford School of Medicine. He leads the discussion on speech and language.

Elon Musk

Founder of Neuralink, mentioned in relation to the company's work on brain-machine interfaces.

Products

iPhone

A smartphone that provides access to information, used as an example of existing technology for cognitive enhancement.

Organizations

FDA

The U.S. Food and Drug Administration, which approved the clinical trial for the brain-machine interface technology.

Studies & Research

Bravo Trial

A clinical trial focused on using brain-machine interfaces to decode neural activity into speech for paralyzed individuals.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free