11.3 Random walks and Brownian motion

A random walk is the simplest stochastic process: a particle takes a step in a random direction at each tick of a clock. The construction is trivial; the consequences are deep. The continuum limit of a random walk is Brownian motion, the canonical model of diffusion. The same equation appears in heat conduction, particle physics, finance, and the Brownian-motion lesson of Sound 1.3. All of it descends from “many tiny independent steps.”

The 1-D random walk

Consider a particle on the integer line, starting at X0=0X_0 = 0. At each time step n=1,2,3,n = 1, 2, 3, \ldots, the particle takes a step of size 1 in a random direction:

Xn  =  Xn1+ξn,ξn=±1 with equal probability.X_n \;=\; X_{n-1} + \xi_n, \qquad \xi_n = \pm 1 \text{ with equal probability}.

The steps ξn\xi_n are independent and identically distributed (i.i.d.) Bernoulli-like random variables with E[ξn]=0\mathbb{E}[\xi_n] = 0 and Var[ξn]=1\mathrm{Var}[\xi_n] = 1.

The position after NN steps is

XN  =  n=1Nξn.X_N \;=\; \sum_{n=1}^N \xi_n.

By linearity of expectation:

E[XN]  =  0.\mathbb{E}[X_N] \;=\; 0.

The walker has no preferred direction. But the spread about zero is non-trivial:

Mean-square displacement: ⟨X_N²⟩ = N

We want E[XN2]\mathbb{E}[X_N^2]. Expand the square:

XN2  =  (n=1Nξn)2  =  n=1Nξn2  +  2m<nξmξn.X_N^2 \;=\; \left( \sum_{n=1}^N \xi_n \right)^2 \;=\; \sum_{n=1}^N \xi_n^2 \;+\; 2 \sum_{m < n} \xi_m \xi_n.

Take expectations:

E[XN2]  =  n=1NE[ξn2]  +  2m<nE[ξmξn].\mathbb{E}[X_N^2] \;=\; \sum_{n=1}^N \mathbb{E}[\xi_n^2] \;+\; 2 \sum_{m < n} \mathbb{E}[\xi_m \xi_n].

Each ξn2=(±1)2=1\xi_n^2 = (\pm 1)^2 = 1 always, so E[ξn2]=1\mathbb{E}[\xi_n^2] = 1. For the cross terms, independence of ξm\xi_m and ξn\xi_n (for mnm \neq n) implies E[ξmξn]=E[ξm]E[ξn]=00=0\mathbb{E}[\xi_m \xi_n] = \mathbb{E}[\xi_m]\, \mathbb{E}[\xi_n] = 0 \cdot 0 = 0. So

E[XN2]  =  n=1N1  =  N.\mathbb{E}[X_N^2] \;=\; \sum_{n=1}^N 1 \;=\; N.

The root-mean-square (RMS) displacement is therefore

E[XN2]  =  N.\sqrt{\mathbb{E}[X_N^2]} \;=\; \sqrt{N}.

After NN steps, the typical walker has wandered a distance N\sim \sqrt{N} from the start — not NN. This is the universal signature of diffusion: distance grows as the square root of time (when each step happens in one tick).

The variance grows linearly in NN, but the distance grows as N\sqrt{N}. A million-step walker has wandered 1000\sim 1000 steps from the start, not a million. Random motion is dramatically slower than directed motion — that’s the whole point of diffusion.

The same calculation in higher dimensions: each component is an independent random walk; the squared distance is the sum of squared components; E[XN2]=N\mathbb{E}[|\mathbf{X}_N|^2] = N for any dimension. The walker spreads as N\sqrt{N} regardless of how many directions it can move in.

5 1-D walks (step vs. position)±√Nfinal-position distribution (5000 walks)-3030RMS displacement: 9.97 ≈ √N = 10.00
mode:

Each step is independent and equally likely to go left/right (1-D) or in one of the four cardinal directions (2-D). The expected position is zero — the walks have no preferred direction — but the expected *squared* displacement grows linearly with the number of steps: ⟨X_N²⟩ = N. So the typical walker after N steps is a distance ≈ √N from the origin, marked by the red dashed envelope or circle. The histogram on the right shows the final-position distribution over 5000 independent walks; it is approximately Gaussian by the CLT.

The interactive shows several walkers simultaneously, plus a histogram of where 5000 ensemble walkers end up after NN steps. Three things to absorb:

The continuum limit: Brownian motion

Take the random walk and let the step size shrink and step rate grow together: Δx0\Delta x \to 0, Δt0\Delta t \to 0, with Δx2/Δt=2D\Delta x^2 / \Delta t = 2 D held fixed (the diffusion coefficient DD). The discrete random walk becomes Brownian motion B(t)B(t), a continuous-time stochastic process with three defining properties:

The variance grows linearly in time: E[B(t)2]=2Dt\mathbb{E}[B(t)^2] = 2 D t. The RMS displacement after time tt is 2Dt\sqrt{2 D t}. The probability density at time tt is the heat-equation Green’s function:

p(x,t)  =  14πDtexp ⁣(x24Dt),p(x, t) \;=\; \frac{1}{\sqrt{4 \pi D t}}\, \exp\!\left( -\frac{x^2}{4 D t} \right),

a Gaussian spreading at rate 2Dt\sqrt{2 D t}. This is one and the same as the heat equation solution we built earlier — Brownian motion is the stochastic dual of the heat equation, and the heat equation governs the probability density of a Brownian particle.

This duality is one of the most beautiful results in mathematical physics. The same equation tp=Dx2p\partial_t p = D \partial_x^2 p describes:

All have the same generator — the second-derivative Laplacian — and all are continuum limits of i.i.d. random walks.

The Einstein relation

Einstein’s 1905 derivation of the diffusion equation from kinetic theory established a remarkable connection. If a Brownian particle of mass mm is suspended in a fluid at temperature TT, and the fluid exerts a viscous drag force ζv-\zeta v on the particle (with ζ\zeta a friction coefficient), then the diffusion coefficient is

  D  =  kBTζ.  \boxed{\;D \;=\; \frac{k_B T}{\zeta}.\;}

This is the Einstein relation (also called the Stokes–Einstein relation when ζ=6πηa\zeta = 6 \pi \eta a for a spherical particle of radius aa in a fluid of viscosity η\eta). It says: the random motion of the particle (parametrised by DD) and the deterministic dissipation of its velocity (parametrised by ζ\zeta) are linked by temperature. Fluctuation and dissipation are two faces of the same underlying microscopic motion.

The Einstein relation was the experimental key to Avogadro’s number. Jean Perrin’s 1908 measurements of Brownian motion in pollen grains let him compute kBk_B, and hence NA=R/kBN_A = R/k_B, settling the long-standing question of whether atoms actually existed. (Modern value: NA6.022×1023N_A \approx 6.022 \times 10^{23}.) Einstein’s 1905 paper on Brownian motion was one of his “annus mirabilis” four, alongside special relativity, the photoelectric effect, and mass-energy equivalence.

Brownian motion has unusual properties

Brownian motion is continuous (you can plot a path without lifting the pen) but nowhere differentiable — the velocity is undefined at every instant. The classical derivative of a Brownian path diverges; only stochastic notions of “derivative” (Itô calculus, Stratonovich calculus) make sense.

The total path length of Brownian motion over any time interval is infinite: a walker covers infinite distance in finite time. This sounds paradoxical until you remember that “distance” includes the back-and-forth wiggles — the displacement is finite (and grows as t\sqrt{t}), but the arc length of the wiggle is unbounded.

These pathologies are why classical calculus does not apply to Brownian motion, and why stochastic calculus is its own subject. Most physical applications get away with averaged statistics — mean, variance, autocorrelation — without ever invoking pathwise calculus, and the bookshelf follows the same shortcut.

What we use this for

Random walks and Brownian motion underwrite a lot of the bookshelf:

The next lesson, 11.4 — Poisson processes, turns to counting processes — random events arriving at a fixed rate — and the Poisson distribution that governs their statistics.