Cochlear Implant Atlas
CI Atlas · Learning to Listen: Rehabilitation and Habilitation · Module 08

8Speech Production and Articulation

A child cannot reliably produce a sound they cannot hear. By restoring access to the speaker's own voice, the cochlear implant reopens the auditory feedback loop that governs intelligible speech, this module covers the therapy that builds clear speech on top of it: working on vowels and consonants, on the melody and rhythm that carry naturalness, on the characteristic error patterns of implanted children, and on the self-monitoring that makes gains permanent.

FThe auditory feedback loop the implant restores

Speech production is normally governed by an auditory feedback loop: a speaker hears their own output and adjusts articulation toward the intended target, a loop that collapses with profound deafness. By making the child's own voice and ambient speech audible, the implant restores this feedback, which is why speech production typically improves alongside speech perception after implantation. There is a direct link between what is heard and what is produced; sounds that are well perceived through the device tend to be produced more accurately, while acoustically vulnerable sounds remain error-prone. Production gains lag perception: a child must first detect and discriminate a contrast before being able to monitor and shape their own production of it. Therapy therefore couples listening and speaking, training perception of a target and production of the same target in the same activities rather than treating them separately.[2020][2009][2011]

The auditory feedback loop of speech

×Intended targetArticulationOwn voice heardError detectedCorrectionoutputdrifts off target

Clear speech depends on hearing your own voice: you compare what you produced against the intended target, detect the error and correct it. In profound deafness the “own voice heard” link is cut, so articulation drifts and speech deteriorates — even in those who once spoke. A cochlear implant restores that monitoring link, letting output track perception again, which is why early implantation protects developing speech. Schematic.

TSegmentals: vowels and consonants

Segmental therapy targets the individual phonemes, vowels and consonants, that distinguish words; accurate vowels depend on perceiving formant (resonance) patterns and consonants on perceiving brief, often high-frequency cues. Vowels generally stabilize earlier than consonants because their formant structure is conveyed comparatively well by the implant. High-frequency, low-intensity consonants (especially /s/, /f/ and other fricatives) are the most error-prone, mirroring exactly the parts of the speech spectrum that are hardest to resolve through an implant. Co-articulation, the way neighbouring sounds blend, is trained so speech sounds connected and natural rather than a string of isolated phonemes. Therapists target a sound in listening first, then in syllables, words, and connected speech, generalizing accuracy from the clinic into spontaneous talk.[2020][2011][2009]

Speech banana, shaded by implant error-proneness

103050702505001k2k4k8kuaomnshfsthfrequency (Hz) →soft ↑ dB
Phoneme/s/Frequency6.5 kHzError risk92%

Plotting speech sounds by frequency and intensity gives the familiar “banana.” Through an implant, low-frequency vowels ride high-energy formants and stay robust (green), while the high-frequency, low-intensity fricatives /s/, /f/ and /th/ (~4–8 kHz) are the most error-prone (red) because their faint, brief energy is hardest to encode and resolve. This is why plurals, possessives and verb endings are common targets in therapy. Schematic.

TSuprasegmentals and voice quality

Suprasegmentals, intonation, stress, rhythm and timing, ride on pitch, loudness and duration cues that the implant conveys relatively well, and they strongly shape how natural and intelligible speech sounds. Therapy addresses appropriate pitch and intonation contours, contrastive stress (emphasizing the right word), and speaking rate and rhythm. Voice quality is targeted because deaf and late-implanted speakers can show abnormal pitch, breathiness, hypernasality or unstable loudness control from years without auditory feedback. Because suprasegmental cues are robust through the device, they are often an early and high-yield therapy focus, improving listener-judged naturalness even before every consonant is mastered. Prosody also serves language: stress and intonation mark questions, emphasis and grammatical boundaries, so suprasegmental work supports comprehension as well as clarity.[2020][2009][2011]

Stress shifts meaning: “I didn’t say she stole.”

high pitch ↑Ididn'tsayshestole…it was someone else, not her.

Pitch, stress and timing — the suprasegmentals — ride above individual phonemes and can flip a sentence’s meaning without changing a single word. Stressing a different word in the same five-word sentence shifts the implied contrast entirely, as the contour and gloss show. Because cochlear implants convey pitch and prosody less faithfully than a normal cochlea, training a child to hear and produce contrastive stress is a core therapy goal. Schematic.

CError patterns and self-monitoring

Common error patterns in implanted children include omission or substitution of fricatives and final consonants, vowel neutralization, and prosodic abnormalities, patterns that track the acoustic limitations of electric hearing. Standardized articulation and phonology assessment is used to profile a child's errors and direct therapy to the highest-impact targets. Self-monitoring, teaching the child to listen to their own output and compare it to the target, is the bridge from clinic accuracy to durable, generalized intelligible speech. Intelligibility is the functional endpoint, how much of the child's speech an ordinary listener understands, and it improves with earlier implantation, better perception, and consistent device use. When scoring spoken responses on listening tests, clinicians must separate misarticulation from mishearing, taking the child's known production errors into account so a perception score is not unfairly penalized.[2020][2011][2016]

Case 19.8 · Speech Production and Articulation
A 5-year-old implanted at 18 months has good open-set word recognition and clear vowels, but listeners outside the family often misunderstand her. Assessment shows she omits final /s/ and /t/, substitutes stops for fricatives, and speaks in a relatively flat, monotone voice.

Which therapy focus is most likely to improve her overall intelligibility soonest?

Self-assessment — Module 82 questions
Question 1

Why does speech production typically improve after cochlear implantation in a child who is profoundly deaf?

Question 2

Which speech sounds are most often produced in error by implanted children, and why?

Tracked locally in your browser — see /progress for the dashboard.