This is the lesson the rest of the bookshelf has been quietly assuming. The “eigenvalues of A” you have already met in the ODE phase plane (Foundations 5.4), in modal analysis of PDEs (Foundations 6.5), in Helmholtz cavity modes (Foundations 6.7), and in the time-independent Schrödinger equation (Foundations 6.8) — all of those are instances of the same idea: a matrix or operator has special directions it merely stretches or contracts rather than rotates. Find those directions, and the linear map’s behaviour becomes transparent.
The defining equation
A non-zero vector v is an eigenvector of the matrix A if it satisfies
Av=λv
for some scalar λ. The number λ is the corresponding eigenvalue. The equation says: when A acts on v, the result is v itself scaled by λ — no rotation, no twisting, just a stretch by factor λ (or a contraction if ∣λ∣<1, or a flip if λ<0).
Most vectors are not eigenvectors: most directions get rotated when A acts. The eigenvectors are the special directions left invariant up to scaling. Their existence and structure are what make many matrix problems tractable.
Why eigenvectors matter
If A is an n×n matrix with n linearly independent eigenvectors v1,…,vn with eigenvalues λ1,…,λn, then any vector x can be written as a linear combination
x=c1v1+c2v2+⋯+cnvn.
Applying A to both sides and using linearity plus the eigenvalue equation:
Ax=c1λ1v1+c2λ2v2+⋯+cnλnvn.
The action of A on x has become componentwise multiplication in the eigenvector basis — each component ck just gets multiplied by its eigenvalue λk. This is the diagonalisation trick: in the right basis, A looks like a diagonal matrix, and diagonal matrices are trivial to power, exponentiate, or invert.
This is the deep mathematical reason mode expansions work in PDEs (Foundations 6.5). The modes are the eigenfunctions of the spatial differential operator; the operator acts on a sum-over-modes by multiplying each mode by its eigenvalue. The same algebraic move that makes a finite-dimensional matrix easy to handle makes an infinite-dimensional linear differential operator solvable.
Finding eigenvalues
To find the eigenvalues of a given matrix A, rearrange the defining equation:
Av=λv⟺(A−λI)v=0.
We want this to have a non-zero solution v (eigenvectors aren’t allowed to be the zero vector — that would be trivial). A square linear system has a non-zero solution exactly when its matrix is singular, i.e. has determinant zero. So the eigenvalues are the roots of
det(A−λI)=0,
which is a polynomial equation in λ of degree n — the characteristic polynomial of A. An n×n matrix therefore has at most n eigenvalues (counted with multiplicity). Once each eigenvalue is known, the corresponding eigenvector is found by solving the homogeneous linear system (A−λI)v=0 — Gaussian elimination from 4.3, with a singular matrix on purpose.
For a 2×2 matrix the characteristic polynomial is a quadratic:
det(A−λI)=λ2−(trA)λ+detA,
where trA=a11+a22 is the trace (sum of diagonal entries). The quadratic formula gives the two eigenvalues directly:
λ±=2trA±(2trA)2−detA.
This is exactly the formula that appears in the ODE phase-plane analysis with A as the linearised flow matrix; the sign of the discriminant determines whether the eigenvalues are real distinct (overdamped), real repeated (critical), or complex conjugate (underdamped, spirals).
Worked example: a 2x2 matrix by hand
▶Worked example: every step, no shortcuts
The problem. Find the eigenvalues and eigenvectors of
A=(3113).
Step 1 — Set up the characteristic equation.A−λI is
(3−λ113−λ),
and the determinant is
det(A−λI)=(3−λ)2−1⋅1=(3−λ)2−1.
Step 2 — Solve. Expand (3−λ)2−1=λ2−6λ+9−1=λ2−6λ+8=0. Quadratic formula or factoring:
λ2−6λ+8=(λ−4)(λ−2)=0.
So λ1=4 and λ2=2. Two distinct real eigenvalues.
(Cross-check: trace = 3+3=6=λ1+λ2. Determinant = 9−1=8=λ1λ2. ✓ The trace always equals the sum of eigenvalues; the determinant always equals their product.)
Step 3 — Find the eigenvector for λ1=4. Solve (A−4I)v=0:
A−4I=(−111−1).
The two rows are negatives of each other (as expected — the matrix is singular). The single non-trivial equation is −v1+v2=0, i.e. v2=v1. Any non-zero v=(1,1) works. Conventionally we normalise:
v1=21(11).
Step 4 — Find the eigenvector for λ2=2. Solve (A−2I)v=0:
A−2I=(1111).
The non-trivial equation is v1+v2=0, i.e. v2=−v1. Take v=(1,−1) and normalise:
Step 6 — Notice. The two eigenvectors are orthogonal: v1⋅v2=21(1⋅1+1⋅(−1))=0. This is not a coincidence — the matrix A here happens to be symmetric (AT=A), and symmetric matrices always have orthogonal eigenvectors. That fact is the spectral theorem for symmetric matrices, which we will meet in full generality in 4.6. It is what makes self-adjoint operators in PDEs so well-behaved.
When eigenvalues are complex
Not every matrix has real eigenvalues. The matrix
R=(01−10)
is a 90° rotation. Its characteristic polynomial is λ2+1=0, with roots λ=±i. No real direction is preserved under a 90° rotation — every arrow turns ninety degrees. The eigenvalues are complex, and the corresponding eigenvectors live in C2 rather than R2. The complex eigenvectors are useful: they let you express the rotation as componentwise complex multiplication in the right basis, recovering the connection to complex exponentials from Foundations 3.
In acoustics and elsewhere, complex eigenvalues typically signal oscillation — they appear in the underdamped regime of the damped oscillator, in the spiral fixed points of Foundations 5.4, and in any time-evolution problem with rotational character.
Power iteration: finding eigenvectors numerically
For large matrices the characteristic polynomial is impractical to solve directly. The standard numerical method instead is power iteration: pick an arbitrary unit vector v0 and repeatedly apply A, renormalising at each step:
vk+1=∥Avk∥Avk.
Generically, vk converges to the eigenvector associated with the eigenvalue of largest absolute value (the dominant eigenvalue). The intuition: writing v0=∑civi in the eigenvector basis, applying A many times gives Akv0=∑ciλikvi. Whichever λi is largest in absolute value, its term grows fastest, and after enough iterations the sum is dominated by that one mode.
Once the dominant eigenvector is known, the dominant eigenvalue follows from the Rayleigh quotientvTAv/vTv, which approaches the dominant eigenvalue as vk approaches its eigenvector.
preset:
Each click of iterate applies A to the current unit vector and renormalises. Generically the result converges to the dominant eigenvector — drawn as the dashed gold line for reference when the eigenvalues are real. The Rayleigh quotient vᵀA v / vᵀv approaches the dominant eigenvalue at the same time. The complex eigenvalues preset shows what happens with no real eigenvector to attract toward: the vector spins around indefinitely. The shear (degenerate) preset has a single repeated eigenvalue and one eigenvector — convergence still happens but is slower.
Pick a matrix preset, set the starting direction θ0, and step the iteration. Watch the vector swing toward the dominant eigenvector (the dashed gold line). The Rayleigh quotient on the side converges to the dominant eigenvalue in parallel. The complex eigenvalues preset shows what happens when no real eigenvector exists — the iteration rotates rather than converging.
Power iteration is the simplest of a family of iterative eigenvalue algorithms that scale to large matrices. The PageRank algorithm Google used to launch in 1998 is power iteration on a billion-by-billion link matrix; the largest eigenvalue’s eigenvector ranks web pages by importance. Modal analysis in audio, image-compression’s principal component analysis (PCA), and many machine-learning algorithms all reduce, at the bottom, to “find the dominant eigenvectors of this matrix.”
⏳The history— From Cayley to Hilbert: a century building the spectral theorem
Matrix algebra as we know it was assembled by Arthur Cayley and James Joseph Sylvester in the 1850s in England. Cayley’s 1858 Memoir on the Theory of Matrices defined matrix addition, multiplication, and the characteristic polynomial — the equation det(A−λI)=0 from this lesson. Sylvester coined the word “matrix” in 1850 and introduced “discriminant” and “minor” along with much of the modern vocabulary. The two were friends and lifelong correspondents; the era is sometimes called the Cayley–Sylvester period of algebra.
The eigenvalue–eigenvector machinery was fully understood for finite matrices by the 1880s. The leap to infinite dimensions — operators on function spaces, the natural home of PDEs and quantum mechanics — was made by David Hilbert in the early 1900s, in his work on integral equations. Hilbert’s six papers from 1904–1910 established what we now call Hilbert space, and the proof that self-adjoint operators on a Hilbert space have a complete orthonormal eigenbasis is the spectral theorem, the deepest result in the chain. The full machinery was reformulated and extended by Hilbert’s student John von Neumann in the 1930s, providing the mathematical foundation that Werner Heisenberg’s matrix mechanics and Erwin Schrödinger’s wave mechanics needed to be the same theory. Eigenvalues, in other words, ran the central arc of mathematical physics from 1850 to 1930.
What we use this for
A short tour of where eigenvalues appear in the bookshelf:
ODE phase-plane stability (Foundations 5.4) — the matrix A that linearises a system near a fixed point has eigenvalues whose real parts determine stability and whose imaginary parts determine oscillation.
Mode catalogue of a bounded domain (Foundations 6.5) — the spatial differential operator (with boundary conditions) has eigenfunctions Xn and eigenvalues −kn2; the time evolution decomposes mode by mode.
Helmholtz cavity modes (Foundations 6.7) — exactly an eigenvalue problem: ∇2ϕ=−k2ϕ with boundary conditions, eigenvalues kmn2 giving cavity frequencies ωmn=ckmn.
Time-independent Schrödinger equation (Foundations 6.8) — H^ψ=Eψ is the eigenvalue problem; energy eigenstates are the eigenvectors of the Hamiltonian operator.
The principal modes of vibration of a coupled-oscillator system, like a chain of masses (treated in Sound 3.1) — solving the matrix system gives normal modes as eigenvectors and frequencies as ∣λ∣.
Statistical room acoustics (Sound 7.8) — counting the density of cavity eigenvalues per unit frequency is what gives the Schroeder frequency.
Whenever you see “in the right basis the problem becomes diagonal,” that is eigenvalue analysis happening. The next lesson, 4.5, develops the inner product structure that makes the “right basis” of orthogonal eigenvectors usable for projection — the formal underpinning of Fourier expansion and modal decomposition.