4.4 The place map
The traveling-wave model in 4.3 told us that a tone of frequency peaks at a particular location on the basilar membrane — the one whose local natural frequency matches the input. What it did not tell us is which location, in millimeters from the stapes. For that we need an empirical fit to data, because the cochlea’s stiffness profile is not given to us a priori; it has to be measured.
The classical fit is the Greenwood function. Donald Greenwood proposed it in 1961 and refined it in 1990 once direct cochlear measurements had become available. For the human cochlea, it is
where is measured from the base (the stapes end), mm is the total length of the unrolled membrane, and the constants are
▶ Derivation: the log-frequency form from uniform critical-band spacing
The Greenwood function is empirical — its constants are fit to measured cochlear data. But its functional form has a clean derivation from a single perceptual assumption.
Assumption: equal lengths along the basilar membrane correspond to equal log-frequency intervals (octave-uniform tonotopy). This is observed experimentally and is consistent with what we know about pitch perception — pitches an octave apart sound equally far apart regardless of absolute frequency.
Mathematically, this says
for some positive constant (the negative sign because decreases as increases from base to apex). Integrating:
so , or equivalently . Writing to make the exponent dimensionless,
With corresponding to the basal end at the high-frequency limit, we can rewrite this as
Absorb into a single constant , and we have
This is the leading behavior of the Greenwood function — the simple log law. The correction term is added empirically to account for deviations near the apex of the cochlea, where the simple log law no longer fits the data; specifically, the cochlear apex has nontrivial hydrodynamics (the helicotrema affects boundary conditions, and the membrane is thicker and more loosely supported). The empirical pushes the apex’s predicted frequency down toward 20 Hz rather than the few-Hz floor the pure log would give.
Putting it all together:
The constants Hz, , are Greenwood’s best fits for human cochleas.
This is the Greenwood function. It is the bridge between cochlear anatomy (a position ) and cochlear physiology (a characteristic frequency ).
Plug in the endpoints. At (base, near the stapes), Hz — close to the high-frequency limit of human hearing. At (apex, helicotrema), Hz — the low end. The Greenwood function smoothly interpolates between these limits over the 35 mm in between.
For most of the membrane, the term is a small correction, and the function reduces to its leading behavior . This is exponential in — which means is linear in . Equal distances along the basilar membrane correspond to equal log-frequency intervals. Every millimeter is worth the same fraction of a decade — decades per mm, or about 0.2 octaves per mm. The whole 35 mm spans roughly decades, somewhat more than 10 octaves.
The interactive below renders the place map. Drag the frequency slider, or drag the place slider — they are wired together. Press play tone to hear a pure sine at the current frequency, so you can map what you hear to a literal place along your basilar membrane.
- octave above 20 Hz
- 5.6
- distance from apex
- 14.0 mm
Music and speech on the cochlea
Musical pitches, with their octave-based intervals, are mapped uniformly along the cochlea. An octave is an octave anywhere on the membrane. The 88 keys of a piano span about seven and a half octaves, and they would lay out from roughly 30 mm down the cochlea (the lowest A0, 27.5 Hz) to roughly 5 mm from the stapes (the highest C8, 4186 Hz) — almost the full length of the basilar membrane.
Speech, by contrast, is not log-uniform. The phonemes of “Hey Dr. Miles!” cluster into a relatively narrow band. The vowel formants (F1 around 500–1000 Hz, F2 around 1500–2500 Hz) live in the middle, the broadband /h/ aspiration and the /m/ nasal resonance straddle them, and the fricative energy of /s/, /z/ extends into the basal third above 4 kHz. Most of speech is decoded in the basal half of the cochlea. High-frequency hair cells are the first to die under noise exposure and with age, and so the fricative consonants and the unvoiced parts of speech are the first to fade in mild hearing loss. The sentence “Hey Dr. Miles!” becomes audible but unintelligible — every vowel is heard, every consonant is smeared.
The classical audiology picture of this is the speech banana: the phonemes of conversational speech plotted in the frequency-intensity plane. The banana names the shape; the phonemes inside it are what we hear when someone talks at conversational level. Switch the listener to mild or moderate and watch which phonemes fall off the chart first: the vowels persist into severe loss while the fricatives go early.
We now have what the cochlea sends downstream, at least at the level of where. The Greenwood function plus the traveling wave tells us, for any sound: which places on the basilar membrane will move, when, and how much. We know the map.
What we do not yet have is an explanation for how sharp that map is. In a dead cochlea, the peaks of the traveling wave are broad and unimpressive, with effective values around 3 to 5. In a living cochlea, at low stimulus levels, the peaks are dramatically sharper: values of 30, 50, sometimes past 100. The cochlea is between 10 and 100 times sharper than the passive physics we have written down so far. Something is adding energy to the traveling wave on every cycle, compensating for the damping and amplifying the resonance.
That something is the outer hair cells. The story of how they do it — and the way it reshapes everything we have built up so far — is the next section.