8.7 The free-energy principle, honestly
Karl Friston’s free-energy principle (FEP) is the most ambitious framework in this space. It claims that any self-organizing system that maintains its existence over time must, mathematically, minimize a quantity called variational free energy, which approximates the negative log-evidence of its sensory observations under its generative model. Under the FEP, perception, action, learning, and even biological homeostasis are all special cases of free-energy minimization.
The variational free energy can be written, for a system with hidden states and observations and a generative model and a recognition distribution , as
▶ Derivation: variational free energy as a bound on negative log-evidence
Suppose we want to compute the log-evidence for our observations under a generative model . This is the quantity we’d like to maximize — it tells us how well our model explains the data.
Computing exactly requires integrating over all hidden states , which is intractable.
Introduce an approximate posterior , our best current guess at . Multiply and divide inside the integral by :
By Jensen’s inequality (log is concave), :
This lower bound is the evidence lower bound (ELBO). The variational free energy is its negative:
Then is an upper bound on negative log-evidence: , with equality when .
Equivalently, can be rewritten as
making it the Kullback-Leibler divergence between the approximate and true posteriors, minus log-evidence. Minimizing does two things: (i) brings closer to the true posterior, (ii) maximizes log-evidence (model fit).
In the FEP, the brain is hypothesized to perform inference and learning by minimizing variational free energy. Perception minimizes over (refining the posterior); action minimizes over observations (changing the world to match predictions). ∎
The first term is the Kullback-Leibler divergence between the recognition distribution and the true posterior — a measure of how good the brain’s approximation is. The second term is the negative log-evidence. Minimizing is mathematically equivalent to (a) bringing the recognition distribution closer to the true posterior, and (b) maximizing the evidence the model has for its observations.
This is, with the right interpretation, a perfectly reasonable framework. It contains predictive coding as a special case (a Gaussian approximation under particular assumptions). It is also the subject of substantial criticism, ranging from claims of unfalsifiability (Bowers & Davis 2012, in Psychological Bulletin) to concerns about how the principle is operationalized in different contexts. I will not adjudicate this controversy here. The principle is productive — it has generated useful predictions and useful theory — and it is also not yet a settled brain theory in the way that, say, the wave equation is a settled equation. Treat it as a research program, not a result.