3.1 The impedance problem

Sound travels well in air because the air molecules can move easily. Sound travels well in water for the same reason at the molecular level, but for a different reason at the macroscopic level: water is dense and incompressible, so any compression propagates very rapidly. The relevant property is acoustic impedance, defined as the ratio of pressure to particle velocity in a propagating wave: Z=p/vZ = p/v.

For air at room temperature, Zair415Pas/mZ_\text{air} \approx 415\,\text{Pa}\cdot\text{s/m}. For water — and for the perilymph of the cochlea, which is essentially salt water — Zwater1.5×106Pas/mZ_\text{water} \approx 1.5 \times 10^6\,\text{Pa}\cdot\text{s/m}. The ratio is about 3500. This is enormous.

Consider what happens when a sound wave in air meets a water surface. The pressure has to be continuous across the interface, and so does the particle velocity. But Zwater/Zair=3500Z_\text{water}/Z_\text{air} = 3500 means that for a given pressure, the water moves about 3500 times less than the air on the other side. The acoustic wave cannot push the water hard enough to keep itself going. Most of its energy reflects back.

You can compute exactly how much. The reflection coefficient at a boundary between two media is

R=Z2Z1Z2+Z1.R = \frac{Z_2 - Z_1}{Z_2 + Z_1}.
Derivation: reflection at a boundary between two media

Consider an acoustic plane wave pi=p0ei(ωtk1x)p_i = p_0\, e^{i(\omega t - k_1 x)} traveling rightward in medium 1 (impedance Z1Z_1) toward a boundary at x=0x = 0. At the boundary, some of the wave reflects (prp_r) and some transmits (ptp_t) into medium 2 (impedance Z2Z_2):

pr=Rp0ei(ωt+k1x),pt=Tp0ei(ωtk2x).p_r = R\, p_0\, e^{i(\omega t + k_1 x)}, \quad p_t = T\, p_0\, e^{i(\omega t - k_2 x)}.

Two boundary conditions hold at x=0x = 0:

  1. Pressure is continuous: pi+pr=ptp_i + p_r = p_t at x=0x = 0, giving 1+R=T1 + R = T.
  2. Particle velocity is continuous: vi+vr=vtv_i + v_r = v_t. Using v=p/Zv = p/Z for a forward wave and v=p/Zv = -p/Z for a backward wave: (pipr)/Z1=pt/Z2(p_i - p_r)/Z_1 = p_t/Z_2, so (1R)/Z1=T/Z2(1 - R)/Z_1 = T/Z_2.

From (1), T=1+RT = 1 + R. Substituting into (2):

1RZ1=1+RZ2.\frac{1 - R}{Z_1} = \frac{1 + R}{Z_2}.

Cross-multiplying: Z2(1R)=Z1(1+R)Z_2 (1 - R) = Z_1 (1 + R), so Z2Z1=R(Z1+Z2)Z_2 - Z_1 = R(Z_1 + Z_2), giving

R=Z2Z1Z1+Z2.R = \frac{Z_2 - Z_1}{Z_1 + Z_2}.

Plug in Z1=ZairZ_1 = Z_\text{air} and Z2=ZwaterZ_2 = Z_\text{water}: R0.999R \approx 0.999. The intensity reflection coefficient is R20.9989|R|^2 \approx 0.9989, which means the transmitted power fraction is

T=1R20.0011,T = 1 - |R|^2 \approx 0.0011,

or in decibels, about 29.6-29.6 dB. More than 99.9% of incoming sound power would reflect off the cochlear fluid if it were simply exposed to air. A sound at 60 dB SPL in air would arrive at the cochlea at the equivalent of 30 dB SPL — most of speech would fall below the threshold of audibility.

This is what the middle ear has to fix.