4The Hardest Part: Pitch and Melody
Of all of music's dimensions, pitch is the one the implant degrades most. Place-pitch is coarse and distorted, temporal-pitch saturates within a few hundred hertz, and the casualty is melody: strip away rhythm and many recipients recognise familiar tunes barely above chance.
FTwo ways to hear pitch, both broken
The healthy cochlea encodes pitch in two complementary ways: PLACE (which point along the tonotopic basilar membrane vibrates) and TIMING (the rate at which auditory-nerve fibres phase-lock to the waveform). Music exploits both. An implant inherits a degraded version of each. Place-pitch is delivered by a handful of electrodes (typically 12-22) spanning a cochlea that natively resolves thousands of frequency channels, so the place axis is grossly undersampled. Temporal-pitch is delivered by the stimulation rate or the rate of envelope modulation, but the percept stops rising with rate above roughly 300 Hz - the electrical pitch saturates well below the range music lives in. Because both channels are compromised, pitch is the dimension where the gap between implant hearing and normal hearing is widest.[2014][2004]
TWhy place-pitch is ambiguous
Tonotopy is only loosely preserved after implantation: the array sits in the scala tympani, often does not reach the apical low-frequency region, and the map between an electrode and the cochlear place it stimulates is distorted and shifted relative to a normal ear. Current spread is the deeper problem - each electrode injects current that excites a broad swath of neurons, so neighbouring electrodes overlap heavily and the effective number of independent pitch channels is far smaller than the electrode count. The result is that two notes a semitone apart may stimulate overlapping neural populations and sound the same. Pitch-ranking studies show many CI users perform near chance for 1-semitone differences, the very resolution Western melody depends on. Reusing low-frequency apical sites is limited by both electrode insertion depth and surviving spiral-ganglion neurons, so the place axis cannot simply be 'turned up' in resolution.[2014][2008]
TWhy temporal-pitch saturates
When place cues are weak, the implant must lean on temporal cues - the periodicity of the envelope or the stimulation rate - to convey pitch. Electrically evoked pitch does rise with rate at low rates, but the percept plateaus near 300 Hz; faster rates no longer sound higher. Normal pitch tracking extends to several kHz. This ceiling sits right inside the musical range: middle C is about 262 Hz and the notes immediately above it cross the saturation point, so much of a melody's pitch movement falls where electrical pitch can no longer change. Just-noticeable frequency differences for CI users average roughly 10% (and range from ~1% to 30%), an order of magnitude worse than the well-under-1% of normal hearing - far too coarse for reliable melody.[2014][2004]
CThe consequence: melody collapses, contour barely survives
Melodic contour identification - simply judging whether a five-note pattern rises, falls or stays flat - is highly variable across CI users (14-91% correct in one classic study) and improves only as the note intervals are widened. Familiar-melody recognition with rhythm and lyrics removed is the harshest test: normal-hearing adults score far higher, and many implant users fall near chance, identifying tunes only by guessing. Performance jumps when rhythm cues are restored, because recipients identify the song by its rhythmic signature rather than its pitch contour - a compensation, not a recovery of melody. Clinically: counsel recipients that pitch and melody are the slowest, least complete part of music to return, and that contour-identification training can yield measurable, generalising gains.[2007][2002][2005][2004]
What best explains the dissociation between her preserved song recognition and her failure on the equal-duration melody test?
Why does electrically evoked temporal pitch fail to convey most musical melodies?
What is the main reason place-pitch is ambiguous in cochlear implant users?
Compared with normal hearing, the average just-noticeable frequency difference for CI users is approximately: