learnbayes.org > Completing the square > Univariate

Univariate

Preliminaries

Consider the case of observing $N$ independent samples from a $Normal (μ, σ^{2})$ distribution. Suppose we know the true value of $σ^{2}$ , and we are interested in determining the posterior distribution of $μ$ . It is conventional to place a conjugate normal prior on $μ$ . Our model is:

$\begin{array}{rcl} y_{i} & \overset{i i d}{\sim} & Normal (μ, σ^{2}) \\ μ & \sim & Normal (μ_{0}, τ^{2}) \end{array}$

In Bayesian statistics, the posterior is proportional to the likelihood times the prior. Because all observations are independent, our likelihood is the product of $N$ Normal density functions: one for each $y_{i}$ . The prior then provides another Normal density function term. After simplifying and dropping terms that are not functions of $μ$ , we end up with a posterior distribution for $μ$ proportional to

$\exp {- \frac{1}{2 σ^{2}} \sum_{i = 1}^{N} (y_{i} - μ)^{2}} \exp {- \frac{1}{2 τ^{2}} (μ - μ_{0})^{2}}$

The first term is the likelihood, and the second term is the prior. Because we desire a posterior that is a simple function of $μ$ , we need to gather all the terms that include $μ$ together; as the posterior is written above, $μ$ is scattered across $N + 1$ terms. Completing the square is the trick that will allow us to gather all the $μ$ terms into one.

The first step is to expand the squares containing $μ$ . This yields

$\exp {- \frac{1}{2 σ^{2}} \sum_{i = 1}^{N} (y_{i}^{2} - 2 μ y_{i} - μ^{2})} \exp {- \frac{1}{2 τ^{2}} (μ^{2} - 2 μ μ_{0} + μ_{0}^{2})}$

We now distribute the sum, yielding

$\exp {- \frac{1}{2 σ^{2}} (\sum_{i = 1}^{N} y_{i}^{2} - 2 μ \sum_{i = 1}^{N} y_{i} + N μ^{2})} \exp {- \frac{1}{2 τ^{2}} (μ^{2} - 2 μ μ_{0} + μ_{0}^{2})}$

There are several terms above that do not involve $μ$ ; these are highlighted in red. When we drop those terms, combine all remaining terms within one exponent, and then focus only on what is in the exponent, what remains is

$- \frac{1}{2} (\frac{N}{σ^{2}} μ^{2} + \frac{1}{τ^{2}} μ^{2} - 2 μ \frac{\sum_{i = 1}^{N} y_{i}}{σ^{2}} - 2 μ \frac{1}{τ^{2}} μ_{0})$

We can combine the $μ^{2}$ terms together, and the $μ$ terms together:

$- \frac{1}{2} ((\frac{N}{σ^{2}} + \frac{1}{τ^{2}}) μ^{2} - 2 (\frac{\sum_{i = 1}^{N} y_{i}}{σ^{2}} + \frac{1}{τ^{2}} μ_{0}) μ)$

Finally, for ease of notation, we can use the fact that $\sum_{i = 1}^{N} y_{i} = N \bar{y}$ :

$- \frac{1}{2} ((\frac{N}{σ^{2}} + \frac{1}{τ^{2}}) μ^{2} - 2 (\frac{N}{σ^{2}} \bar{y} + \frac{1}{τ^{2}} μ_{0}) μ)$

After all of this algebraic simplification, inside the parentheses what we have looks something like $a x^{2} - 2 b x$ . We can now “complete the square” to obtain something of the form $(x - c)^{2}$ .

Competing the square

Our first step is to make the notation easier to follow. Let $a = \frac{N}{σ^{2}} + \frac{1}{τ^{2}}$ and $b = \frac{N}{σ^{2}} \bar{y} + \frac{1}{τ^{2}} μ_{0}$ . Using the new, simplified notation, we have

$- \frac{1}{2} (a μ^{2} - 2 b μ)$

We can move the coefficient $a$ on $μ$ outside the parentheses:

$- \frac{a}{2} (μ^{2} - 2 \frac{b}{a} μ)$

We now add and subtract the same value inside the parentheses. This doesn’t change the value at all, since the terms sum to 0:

$- \frac{a}{2} (μ^{2} - 2 \frac{b}{a} μ + \frac{b^{2}}{a^{2}} - \frac{b^{2}}{a^{2}})$

In fact, neither term is a function of $μ$ , so we can simply drop the term colored in red.

$- \frac{a}{2} (μ^{2} - 2 \frac{b}{a} μ + \frac{b^{2}}{a^{2}})$

The terms within the parentheses are of the form $x^{2} - 2 x c + c^{2}$ , which, from the rules learned in algebra, can be simplified to $(x - c)^{2}$ . Applying this to our terms, we obtain:

$- \frac{a}{2} {(μ - \frac{b}{a})}^{2}$

We have thus completed the square. We are not done, however: this was only the portion of the posterior distribution that was in the exponent. Replacing the terms in the exponent yields:

$\exp {- \frac{a}{2} {(μ - \frac{b}{a})}^{2}}$

By completing the square, we have revealed that the posterior distribution of $μ$ has the form of a normal distribution with a mean of $b / a$ and a variance of $1 / a$ , or

$μ ∣ y \sim Normal (μ_{n}, σ_{n}^{2})$ where

$\begin{array}{rcl} σ_{n}^{2} = \frac{1}{a} & = & {(\frac{N}{σ^{2}} + \frac{1}{τ^{2}})}^{- 1}, \\ μ_{n} = \frac{b}{a} & = & σ_{n}^{2} (\frac{N}{σ^{2}} \bar{y} + \frac{1}{τ^{2}} μ_{0}) . \end{array}$