Cochlear Implant Atlas
CI Atlas · Speech-Coding Strategies: The Complete Lineage · Module 06

6Adding F1: The F0/F1/F2 Processor

The next Nucleus feature-extraction processor added the first formant F1 to recover low-frequency vowel information, splitting the electrode array into apical (F1) and basal (F2) stimulation sets. It improved speech recognition but exposed the limits of formant tracking for consonants.

TThe F0/F1/F2 principle

F0/F1/F2 improves on F0/F2 by adding the first formant F1, the low-frequency formant that carries vowel information Two electrode sets are stimulated: apical electrodes carry F1 and basal electrodes carry F2 Pulse amplitudes are made proportional to the formant amplitudes A1 (for F1) and A2 (for F2) F0 still governs the stimulation rate as in the earlier strategy.[1987][2000]

+33Percentage-point sentence gain when F1 was added (31%->64%) [1990]
+20Percentage-point NU-6 word gain (8%->28%) [1990]

CThe three-band chain

F1 is extracted from a 280-1000 Hz band-pass filter via a zero-crossing detector plus an envelope detector for A1, driving the apical electrodes F0 is extracted from a 270 Hz low-pass filter and controls the pulse rate F2 is extracted from a 1000-4000 Hz band-pass filter via a zero-crossing detector plus an envelope detector for A2, driving the basal electrodes Pulses are about 200 microseconds wide with 800 microseconds of separation to avoid channel interaction.[1987][1999]

F1 → apical electrodes, F2 → basal electrodes

F1 region (apex)F2 region (base)A1A2
Vowel/a/F1730 HzF21100 HzF0 → rate132 pps

Adding the first formant F1 to F0/F2 gave the vowel its second anchor. The processor lights one apical electrode for F1 and one basal electrode for F2, scaling their pulse heights by the formant amplitudes A1 and A2 and pulsing at the F0 rate. It improved vowel recognition — but it still represents the spectrum with only two moving points, and consonant cues above F2 are missing, which is what drove Multipeak. Schematic.

CVoiced versus unvoiced timing

Stimulation rate equals F0 pulses per second for voiced segments Unvoiced segments are stimulated at an average of about 100 pulses per second The 200 microsecond pulse with 800 microsecond separation enforces non-simultaneity between the formant electrodes Pulse amplitude proportional to formant amplitude preserves relative formant strength.[1987][1999]

TWhy F0/F1/F2 was superseded

F0/F1/F2 emphasises low-frequency vowel information but did not significantly improve consonant recognition Most consonants carry high-frequency information that two-formant-plus-F1 tracking still missed This limitation motivated adding dedicated high-frequency bands, leading to the Multipeak (MPEAK) strategy The improvement over F0/F2 was real but consonant gains remained limited.[1987][2008]

How much of the spectrum survives?

low freqhigh freqonly 2–3 tracked features are sent

The feature-extraction bet was that a few perceptually chosen numbers — F0, F1, F2 — carry most of speech, so the rest of the spectrum can be discarded. CIS made the opposite bet: don't decide what matters, just send every band's envelope and let the brain choose. When the feature estimates are correct the formant approach is efficient; when they are wrong, the discarded spectrum cannot be recovered — and that robustness is why the field swung to filter-bank methods. Schematic.

TBy the numbers

Adding the First Formant: F0F1F2 Improvement Over F0F2

020406080Percent / words correctNU-6 wordsSentences (no context)Spondees in noise
Speech testSpondees in noiseF0F2 (before adding F1)37%F0F1F2 (F1 added)75%

Encoding the first formant alongside F0 and F2 enriched the transmission of voicing, duration, and envelope cues. Open-set sentences without context jumped from 31% to 64% (+33 pp) and spondees-in-noise from 37% to 75% (+38 pp). This step established that more of the spectral envelope, not just a single peak, was needed. Verified means from Tye-Murray 1990 (n=5).

FHear it

Synthesise a vowel — the two numbers a formant tracker chases

← F2 (Hz)F1↓/a//i//u//e//o/
Nearest vowel/a/F1730 HzF21090 Hz

Two resonances do most of the work of a vowel: F1 (lower when the mouth is closed, as in /i/ and /u/) and F2 (higher when the tongue is front, as in /i/ and /e/). Play the vowel and slide F1 and F2 — the percept walks around the vowel space. This is precisely what the F0/F2 and F0/F1/F2 strategies tried to track and transmit; when their estimate of these two numbers was wrong, the wrong vowel was sent. Synthesised in your browser.

Case 14.6 · Adding F1
An engineer upgrades a recipient from an F0/F2 to an F0/F1/F2 processor. Vowel identification improves clearly, but consonant scores barely move.

What does this pattern reveal about the F0/F1/F2 strategy?

Self-assessment — Module 62 questions
Question 1

In F0/F1/F2, how is the electrode array divided between the two formants?

Question 2

Why did F0/F1/F2 still struggle with consonant recognition?

Tracked locally in your browser — see /progress for the dashboard.