6.2 The auditory brainstem response (ABR)

The ABR is the averaged scalp potential evoked by a brief acoustic stimulus, recorded over the first 10 ms after stimulus onset, dominated by five consistent peaks named in Roman numerals I through V. Each peak corresponds — with surprising one-to-one fidelity — to a specific stage of the brainstem auditory pathway:

WaveLatency at 80 dB nHLAnatomical generator
I1.5 msDistal auditory nerve (just outside the modiolus)
II2.7 msProximal auditory nerve (entering the brainstem)
III3.9 msCochlear nucleus
IV5.0 msSuperior olivary complex
V5.7 msLateral lemniscus / inferior colliculus

The peaks are vertex-positive (recording electrode at Cz, reference at the ipsilateral mastoid, ground at the forehead). Wave V is the most robust peak in adults and across patients; clinical interpretation centres on its latency, amplitude, and presence at low stimulus levels.

0246810IIIIIIIVVtime after click (ms)scalp potential (µV)wave V5.70 ms450 nV amplitudeI–V interval4.20 ms(normal: ≤ 4.5 ms)generatorsI: distal CN VIIIII: proximal CN VIIIIII: cochlear nucleusIV: superior oliveV: lateral lemniscus / IC
scenario:

Normal ABR: five distinct peaks, wave V largest. Used for threshold estimation and neurodiagnostic interpretation. The ABR is recorded by signal-averaging hundreds of trials of a click-evoked scalp potential. Wave V is the most robust peak across subjects and is used clinically for two purposes: (1) threshold estimation, by lowering the stimulus level in 10-dB steps until wave V disappears — the ABR threshold typically sits within 10–15 dB of the behavioural threshold; (2) neurodiagnostic interpretation, where the I–V interpeak interval, side-to-side asymmetry, and absolute latencies localise pathology along the brainstem pathway. The 0.03 ms/dB latency-intensity slope reflects the cochlear amplifier's compressive nonlinearity at the response generator site.

What the ABR is used for

The ABR has two distinct clinical roles that share the same recording but ask different questions of it.

Threshold ABR

The audiologist starts at a high stimulus level (80–90 dB nHL) where wave V is clear, then steps down in 10–20 dB decrements, recording 1500 trials at each level, until wave V can no longer be reliably identified above the noise floor. The ABR threshold is the lowest level at which wave V is present. In normal-hearing adults this is typically 0–20 dB nHL; the ABR threshold correlates with the behavioural threshold for the stimulus’s effective frequency to within 10–15 dB.

The “effective frequency” caveat matters: a click stimulus has a flat broadband spectrum but the cochlea’s response is dominated by the high-frequency basal places (where traveling-wave speed is fastest and synchronisation is tightest). Click-ABR thresholds therefore reflect hearing at roughly 2–4 kHz, not low-frequency hearing. To get frequency-specific thresholds the click is replaced with a tone-burst — a brief (5 ms Blackman-windowed) sinusoidal pulse at a specific frequency. Tone-burst ABR works at 500, 1000, 2000, 4000 Hz; the synchronisation is weaker than for clicks (especially at 500 Hz) and the responses are smaller and need more averaging, so a full tone-burst ABR battery takes considerably longer than a click ABR.

Threshold ABR is the gold-standard objective test in three populations:

Neurodiagnostic ABR

The same recording, interpreted differently, becomes a neurological localisation test. The key metrics:

The retrocochlear ABR pattern

Before MRI became universal, ABR was the screening test for vestibular schwannoma (acoustic neuroma) — a benign tumour of CN VIII that, even at small size, compresses the auditory nerve and slows its conduction. The classical signs:

Modern MRI has displaced ABR as the primary screen — a small vestibular schwannoma may be MRI-positive but ABR-normal, especially with intracanalicular tumours of < 1 cm. But MRI is expensive and not always accessible; ABR remains useful as a low-cost screen, an intra-operative monitor during posterior-fossa surgery, and a confirmatory test in patients with MRI contraindications.

Recording setup

A clinical ABR system records three channels at minimum:

Electrodes are silver-silver-chloride disposable adhesive pads, scrubbed with abrasive prep gel to lower skin impedance below 5 kΩ per electrode. The signal is differentially amplified with gain of about 100,000 (the response is ~0.3 µV; the input voltage to the digitiser needs to be in the tens of millivolts), bandpass filtered 100–3000 Hz, digitised at 25 kHz or higher, and time-locked averaged in software with respect to the stimulus trigger.

Stimulus delivery uses an insert earphone — a tube-coupled transducer inserted into the ear canal with a soft eartip. The insert earphone has two advantages over supra-aural headphones: it provides about 70 dB of interaural attenuation (so cross-hearing is not an issue, eliminating the need for masking in most cases) and it shifts the stimulus arrival by about 0.9 ms (the tube transit time), separating the electrical stimulus artefact from the response — a critical refinement for high-stimulus-level recordings.

Pediatric ABR vs adult ABR

Newborns and infants have ABRs whose absolute latencies are longer (wave V at term: ~7.0 ms at 80 dB nHL, vs ~5.7 ms in adults) and whose interpeak intervals are still maturing. The brainstem myelination process continues for the first 18 months of life; wave V latency reaches adult values around 18 months. Pediatric ABR norms are age-specific: a 3-month-old with a wave V latency of 6.5 ms is normal; the same latency in an 18-month-old is abnormal.

The other practical difference is sedation. An adult can lie still for 60 minutes of ABR recording. A 6-month-old cannot. Pediatric ABR for diagnostic purposes is performed under natural sleep (for infants under 6 months who will sleep cooperatively after a feed) or under sedation (chloral hydrate historically, propofol increasingly). Sedation requires anesthesia involvement and dramatically increases the cost and risk of the test — the major reason ASSR (next lesson) has been pursued so aggressively as a faster, multi-frequency alternative.

The history — Jewett, Williston, and the discovery of the ABR

The ABR was discovered by Donald Jewett and John Williston at UCSF, published in their 1971 Brain paper Auditory-evoked far fields averaged from the scalp of humans. Earlier work by Sohmer and Feinmesser (1967) had shown that brief sound-evoked potentials could be recorded from the human scalp with averaging, but Jewett and Williston’s contribution was to systematically characterise the waveform — naming the five peaks I through V in the order of their appearance — and to relate them to brainstem anatomy by analogy to cat physiology that Buchwald and colleagues had developed in parallel.

The five-wave naming convention is universal and has the unusual property of being correct: each peak does correspond to a discrete brainstem generator, with reasonable specificity (wave V being the most reliable, generated by the lateral lemniscus rather than the inferior colliculus per se, but close enough that the textbook attribution holds). Wave V latency is the single most useful number in the ABR battery and has remained so for 55 years.

Clinical ABR moved quickly into audiology. Hyde and Riko (1994) and others established the ABR threshold-behavioural threshold correlations that underpin its diagnostic use. Stapells (2000) developed the tone-burst ABR protocol that is now standard for frequency-specific objective threshold estimation in pediatrics. Selters and Brackmann (1977) and Eggermont, Don, and Brackmann (1980) established the latency criteria for vestibular schwannoma screening that dominated retrocochlear diagnostic practice from 1980 until MRI scaling in the 1990s.

The ABR has remained essentially unchanged in its clinical implementation for forty years — the same five peaks, the same threshold criterion, the same Cz-mastoid montage. The technique has been so stable because the underlying neurophysiology hasn’t changed: it is, like Carhart’s audiogram, a settled clinical instrument.

The next lesson covers the two younger evoked-potential techniques that complement the ABR: the auditory steady-state response, which provides parallel frequency-specific threshold measurement, and the cortical auditory evoked potential, which probes higher-level auditory processing.