6Adding F1: The F0/F1/F2 Processor
The next Nucleus feature-extraction processor added the first formant F1 to recover low-frequency vowel information, splitting the electrode array into apical (F1) and basal (F2) stimulation sets. It improved speech recognition but exposed the limits of formant tracking for consonants.
TThe F0/F1/F2 principle
F0/F1/F2 improves on F0/F2 by adding the first formant F1, the low-frequency formant that carries vowel information Two electrode sets are stimulated: apical electrodes carry F1 and basal electrodes carry F2 Pulse amplitudes are made proportional to the formant amplitudes A1 (for F1) and A2 (for F2) F0 still governs the stimulation rate as in the earlier strategy.[1987][2000]
CThe three-band chain
F1 is extracted from a 280-1000 Hz band-pass filter via a zero-crossing detector plus an envelope detector for A1, driving the apical electrodes F0 is extracted from a 270 Hz low-pass filter and controls the pulse rate F2 is extracted from a 1000-4000 Hz band-pass filter via a zero-crossing detector plus an envelope detector for A2, driving the basal electrodes Pulses are about 200 microseconds wide with 800 microseconds of separation to avoid channel interaction.[1987][1999]
CVoiced versus unvoiced timing
Stimulation rate equals F0 pulses per second for voiced segments Unvoiced segments are stimulated at an average of about 100 pulses per second The 200 microsecond pulse with 800 microsecond separation enforces non-simultaneity between the formant electrodes Pulse amplitude proportional to formant amplitude preserves relative formant strength.[1987][1999]
TWhy F0/F1/F2 was superseded
F0/F1/F2 emphasises low-frequency vowel information but did not significantly improve consonant recognition Most consonants carry high-frequency information that two-formant-plus-F1 tracking still missed This limitation motivated adding dedicated high-frequency bands, leading to the Multipeak (MPEAK) strategy The improvement over F0/F2 was real but consonant gains remained limited.[1987][2008]
TBy the numbers
FHear it
What does this pattern reveal about the F0/F1/F2 strategy?
In F0/F1/F2, how is the electrode array divided between the two formants?
Why did F0/F1/F2 still struggle with consonant recognition?