13Speech-in-noise & ecologically valid testing
The hardest listening in real life happens in noise, and a test in quiet can completely miss it — a person may ace single words in a silent booth and still be lost at every dinner table. Speech-in-noise testing measures the thing that actually matters: the signal-to-noise ratio at which understanding holds, found by letting the noise close in adaptively until the listener is right half the time. The details turn out to be everything. Whether the masker is steady or a babble of voices changes the score, because a normal-hearing listener exploits the brief gaps in babble while a cochlear-implant user largely cannot; where the loudspeakers sit changes it too. And a growing movement toward ecological validity argues that our tests should look more like life — fluctuating, spatial, effortful — than like the quiet booth. This module covers method; the candidacy cutoffs are in the next chapter.
TWhy test in noise
Real-world difficulty is in noise, not quiet. The speech-in-noise SRT is the signal-to-noise ratio (dB SNR) for 50% correct, found adaptively — letting a cochlear-implant user be compared with a normal-hearing listener who tolerates a far more negative SNR.[1994]
CThe adaptive tests
The named tests: HINT (BKB-derived sentences, SNR adapted to 50%), QuickSIN (IEEE sentences in babble, scored as the signal-to-babble ratio), BKB-SIN (difficulty-matched list pairs), and AzBio (multi-talker conversational sentences chosen to avoid the ceiling effects that retired HINT from candidacy).[2004][2008]
CMasker type & geometry
Masker type is a deliberate variable: cochlear-implant users gain little from the dips in fluctuating babble (unlike normal-hearing listeners), so babble depresses their scores far more than steady noise — and babble better mimics restaurants and classrooms. Geometry and standards matter too: a sound-treated room meeting ANSI S3.1, the loudspeaker 1 m at 0° azimuth, head centred, and speech and noise from the same 0° speaker so directional microphones do not inflate the score.[1999]
CEcological validity & effort
Ecological validity (Keidser et al.) reframes test design: conventional quiet/steady-noise tests predict daily function only partly, which motivates spatially-separated, fluctuating, multi-source scenarios. Datalogging shows most everyday speech reaches the ear below 60 dB SPL, motivating tests that sample soft and distant speech. And listening effort and fatigue — the FUEL framework — are outcome dimensions recognition scores miss, increasingly paired with speech-in-noise testing.[2020][2016]
What testing approach captures the problem?
What does a speech-in-noise test report?
Why does multi-talker babble depress CI users' scores more than steady noise?