4.2 The stiffness gradient
The basilar membrane is, dimensionally, a strip of tissue that runs the length of the cochlea — about 35 mm uncoiled. Its width and stiffness change along that length, and the change is not subtle.
At the base, near the stapes, the membrane is narrow and stiff — roughly 100 μm across. At the apex, near the helicotrema, the membrane is wide and floppy — roughly 500 μm across. The width grows monotonically by about a factor of five. The stiffness changes monotonically too, but by orders of magnitude: roughly a factor of one hundred between base and apex.
- position
- 12.3 mm from base
- local width
- 240 μm
- local stiffness
- 20.0 (relative)
- characteristic frequency
- 1.8 kHz
This is the single most important fact about cochlear mechanics, and it is a fact that does not exist in your everyday intuition for membranes. A drumhead is everywhere the same: stretch a single material to a single tension, and the whole drumhead has one resonance. The basilar membrane is the opposite. Every place along its length is mechanically different from every other place. You can think of it, very roughly, as a continuum of tiny drumheads stacked side by side, each with its own preferred frequency.
The local oscillator
Consider a small element of the membrane at coordinate , where runs from at the base to at the apex. Let be the transverse displacement of the membrane at that point — perpendicular to its surface. To a first approximation, the cross-sectional motion of that element behaves like a damped, driven simple harmonic oscillator with local effective mass per unit length , local effective stiffness per unit length , and local damping :
where is the local pressure difference across the membrane — the force per unit area that drives it. (We will write down where comes from in 4.3; for now, treat it as the input.)
▶ Derivation: where the damped, driven oscillator equation comes from
We start from Newton’s second law applied to a small element of basilar membrane at position .
The element has mass per unit area (units: kg/m²). At the instant in question, the element’s transverse displacement is , its velocity is , and its acceleration is .
Three forces per unit area act on the element:
-
Restoring force from the membrane’s own elasticity. For small displacements, this is linear in , opposing the displacement: . The constant is the local stiffness per unit area (units: N/m³).
-
Damping force from internal viscoelastic losses and from the surrounding fluid’s viscosity. For small motions, this is linear in velocity, opposing motion: . The constant is the local damping coefficient per unit area (units: N·s/m³).
-
Driving pressure from the fluid above and below the membrane: (units: N/m² = Pa).
Newton’s second law per unit area says: .
Rearranging:
This is the equation in the main text. It is the standard linear damped driven oscillator, written per unit area of the membrane.
The local natural angular frequency of this element, in the absence of damping and driving, is
▶ Derivation: the natural frequency of a spring-mass system
Consider the equation with damping and driving turned off:
or equivalently . This says: acceleration is proportional to displacement, with the proportionality constant negative. Functions whose second derivative equals minus a constant times themselves are sines and cosines.
Try the ansatz . Then . Substituting into the equation: , so , so
The undriven, undamped oscillator oscillates at this frequency forever (in the absence of energy loss). This is the natural frequency .
The mass per unit length varies modestly along . The stiffness per unit length varies by about two orders of magnitude. The result is that drops by about three orders of magnitude between base and apex, sweeping from roughly at the basal end to at the apex. In one elegant move, the cochlea has produced a one-dimensional frequency axis, embedded in its geometry, spanning the entire range of human hearing.
I want to flag two things about the model I just wrote down, because I will sin briefly against both and then earn them back.
First, this oscillator equation treats each place as independent. It pretends the membrane is a row of disconnected springs, each tuned to its own frequency. It is not: the perilymph mechanically couples adjacent places, and a disturbance at one drives motion at nearby . That coupling is exactly what produces a traveling wave rather than a set of local resonances, and it is the subject of 4.3.
Second, I have just put a damping term in the equation without saying anything about how big it should be. The size of controls the sharpness of each local resonance — the Q factor — and it turns out that the passive damping of the basilar membrane is large enough to produce only modestly sharp tuning. The exquisite frequency selectivity that human hearing actually has comes from somewhere else: from an active element, the outer hair cell, which injects energy back into the membrane on each cycle and effectively reduces the damping. We will spend 4.5 on this, and it will reframe everything in this section.
The driven oscillator at steady state
That oscillator equation deserves more than a passing acquaintance. It is the central equation of damped, driven harmonic motion, and its steady-state behavior is a story worth telling slowly. Drop the position dependence for a moment and ask what one place along the cochlea is doing when a sinusoidal pressure drives it. The equation has a closed-form steady-state solution: a sinusoid at the same frequency, scaled in amplitude and shifted in phase relative to the input. The amplitude and phase are
where , and .
▶ Derivation: amplitude and phase of the driven oscillator
We want the steady-state solution of
The trick is to work in complex numbers. Write the driving force as the real part of , and look for a complex solution , then take its real part at the end.
Substituting into the equation:
Dividing through by (since ):
This is a complex number. Its magnitude gives the amplitude scaling, and its argument gives the phase shift.
For the magnitude, use on the denominator:
So
Factor out an from inside the square root, using and :
This is the magnitude in the main text.
For the phase, use :
So the real-part solution is . ∎
Normalize the amplitude to its zero-frequency value , and define the quality factor . The ratio takes the cleanest possible form:
At this ratio is exactly . That is the entire content of resonance: at its natural frequency, an oscillator responds times more strongly than it does at zero frequency. Off resonance, the response falls off, with a bandwidth (full width at half-power) of approximately .
▶ Derivation: peak height and bandwidth in terms of Q
At , the denominator becomes . The amplitude ratio is then .
For the bandwidth, find the frequencies where (the half-power points). Setting the squared ratio to :
Cross-multiplying:
For high (), the resonance is narrow and we can work near . Write where . Then , and . Substituting:
so , giving . The full width between half-power points is . ∎
The interactive below is one of these oscillators, with fixed at 1 kHz. The top plot is on log-log axes — the response curve of one place along the cochlea. The spring-mass-damper diagram and the phasor diagram show the physical and complex-plane views of the same dynamics; the time-domain trace shows the input and steady-state response over four cycles. Drag the sliders and watch what happens at and around resonance.
- f / f₀
- 0.700
- |H(f)| / |H(0)|
- 1.94
- phase lag
- -7.8°
A few things worth noticing as you move the sliders.
Well below , the response amplitude is roughly constant and in phase with the input. The spring dominates: the oscillator is too stiff to care about a slow push. Well above , the amplitude is even smaller, and the response is 180° out of phase with the input — the mass dominates; the oscillator is too sluggish to keep up. The transition between these two regimes happens at , where the phase passes through exactly and the amplitude peaks at . The Bode-style curves you are looking at are the same curves that appear in every electrical-engineering textbook on filters, because a passive electrical filter is mathematically the same object as a damped, driven mechanical oscillator.
controls only the sharpness and height of the peak. The low- and high-frequency asymptotes are unaffected. At to — the kind of a single, isolated, passive basilar-membrane segment would show — the resonance is broad and unimpressive. The remarkable selectivity of the real cochlea, where effective values can reach , , even past at low signal levels, is the surprise that 4.5 will explain.
This is also where the place-coding seed becomes concrete. Every location along the basilar membrane is one of these oscillators. The local is its natural frequency; the local damping sets its . A pure tone of frequency drives a response that peaks at the location whose , and decays away from that place. Every frequency in “Hey Dr. Miles!” — the /h/ aspiration, the /m/ nasal, the formants of /aɪ/ in Miles — will end up driving membrane motion at a particular location, and the brain will read which locations moved much the same way it reads which key was pressed on a piano. The exact map between frequency and place — the curve that gives for the human cochlea — has a name (the Greenwood function) and we will derive it in 4.4. But first we have to honor the other confession from a few paragraphs ago: the basilar membrane is not actually a set of independent oscillators. The fluid couples them, and the result of that coupling is the wave we are about to meet.