Estimator Variance

Free Books Spectral Audio Signal Processing

As mentioned in §6.12, the pwelch function in Matlab and Octave offer ``confidence intervals'' for an estimated power spectral density (PSD). A confidence interval encloses the true value with probability (the confidence level). For example, if , then the confidence level is $99\%$ .

This section gives a first discussion of ``estimator variance,'' particularly the variance of sample means and sample variances for stationary stochastic processes.

Sample-Mean Variance

The simplest case to study first is the sample mean:

$\displaystyle \hat{\mu}_x(n) \isdef \frac{1}{M}\sum_{m=0}^{M-1}x(n-m)$

(C.29)

Here we have defined the sample mean at time as the average of the successive samples up to time --a ``running average''. The true mean is assumed to be the average over any infinite number of samples such as

$\displaystyle \mu_x = \lim_{M\to\infty}\hat{\mu}_x(n)$

(C.30)

$\displaystyle \mu_x = \lim_{K\to\infty}\frac{1}{2K+1}\sum_{m=-K}^{K}x(n+k) \isdefs {\cal E}\left\{x(n)\right\}.$

(C.31)

Now assume $\mu_x=0$ , and let $\sigma_x^2$ denote the variance of the process $x(\cdot)$ , i.e.,

Var $\displaystyle \left\{x(n)\right\} \isdefs {\cal E}\left\{[x(n)-\mu_x]^2\right\} \eqsp {\cal E}\left\{x^2(n)\right\} \eqsp \sigma_x^2$

(C.32)

Then the variance of our sample-mean estimator $\hat{\mu}_x(n)$ can be calculated as follows:

$\begin{eqnarray*} \mbox{Var}\left\{\hat{\mu}_x(n)\right\} &\isdef & {\cal E}\left\{\left[\hat{\mu}_x(n)-\mu_x \right]^2\right\} \eqsp {\cal E}\left\{\hat{\mu}_x^2(n)\right\}\\ &=&{\cal E}\left\{\frac{1}{M}\sum_{m_1=0}^{M-1} x(n-m_1)\, \frac{1}{M}\sum_{m_2=0}^{M-1} x(n-m_2)\right\}\\ &=&\frac{1}{M^2}\sum_{m_1=0}^{M-1}\sum_{m_2=0}^{M-1} {\cal E}\left\{x(n-m_1) x(n-m_2)\right\}\\ &=&\frac{1}{M^2}\sum_{m_1=0}^{M-1}\sum_{m_2=0}^{M-1} r_x(\vert m_1-m_2\vert) \end{eqnarray*}$

where we used the fact that the time-averaging operator ${\cal E}\left\{\right\}$ is linear, and denotes the unbiased autocorrelation of . If is white noise, then $r_x(\vert m_1-m_2\vert) = \sigma_x^2\delta(m_1-m_2)$ , and we obtain

$\begin{eqnarray*}\mbox{Var}\left\{\hat{\mu}_x(n)\right\} &=&\frac{1}{M^2}\sum_{m_1=0}^{M-1}\sum_{m_2=0}^{M-1} \sigma_x^2\delta(m_1-m_2)\\ &=&\zbox {\frac{\sigma_x^2}{M}}\\ \end{eqnarray*}$

We have derived that the variance of the -sample running average of a white-noise sequence is given by $\sigma_x^2/M$ , where $\sigma_x^2$ denotes the variance of . We found that the variance is inversely proportional to the number of samples used to form the estimate. This is how averaging reduces variance in general: When averaging independent (or merely uncorrelated) random variables, the variance of the average is proportional to the variance of each individual random variable divided by .

Sample-Variance Variance

Consider now the sample variance estimator

$\displaystyle \hat{\sigma}_x^2(n) \isdefs \frac{1}{M}\sum_{m=0}^{M-1}x^2(n-m) \isdefs \hat{r}_{x(n)}(0)$

(C.33)

where the mean is assumed to be $\mu_x ={\cal E}\left\{x(n)\right\}=0$ , and $\hat{r}_{x(n)}(l)$ denotes the unbiased sample autocorrelation of based on the samples leading up to and including time . Since $\hat{r}_{x(n)}(0)$ is unbiased, ${\cal E}\left\{[\hat{\sigma}_x^2(n)]^2\right\} = {\cal E}\left\{\hat{r}_{x(n)}^2(0)\right\} = \sigma_x^2$ . The variance of this estimator is then given by

$\begin{eqnarray*} \mbox{Var}\left\{\hat{\sigma}_x^2(n)\right\} &\isdef & {\cal E}\left\{[\hat{\sigma}_x^2(n)-\sigma_x^2]^2\right\}\\ &=& {\cal E}\left\{[\hat{\sigma}_x^2(n)]^2-\sigma_x^4\right\} \end{eqnarray*}$

where

$\begin{eqnarray*} {\cal E}\left\{[\hat{\sigma}_x^2(n)]^2\right\} &=& \frac{1}{M^2}\sum_{m_1=0}^{M-1}\sum_{m_1=0}^{M-1}{\cal E}\left\{x^2(n-m_1)x^2(n-m_2)\right\}\\ &=& \frac{1}{M^2}\sum_{m_1=0}^{M-1}\sum_{m_1=0}^{M-1}r_{x^2}(\vert m_1-m_2\vert) \end{eqnarray*}$

The autocorrelation of need not be simply related to that of . However, when is assumed to be Gaussian white noise, simple relations do exist. For example, when $m_1\ne m_2$ ,

$\displaystyle {\cal E}\left\{x^2(n-m_1)x^2(n-m_2)\right\} = {\cal E}\left\{x^2(n-m_1)\right\}{\cal E}\left\{x^2(n-m_2)\right\}=\sigma_x^2\sigma_x^2= \sigma_x^4.$

(C.34)

by the independence of and , and when , the fourth moment is given by ${\cal E}\left\{x^4(n)\right\} = 3\sigma_x^4$ . More generally, we can simply label the th moment of as $\mu_k = {\cal E}\left\{x^k(n)\right\}$ , where corresponds to the mean, corresponds to the variance (when the mean is zero), etc.

When is assumed to be Gaussian white noise, we have

$\displaystyle {\cal E}\left\{x^2(n-m_1)x^2(n-m_2)\right\} = \left\{\begin{array}{ll} \sigma_x^4, & m_1\ne m_2 \\ [5pt] 3\sigma_x^4, & m_1=m_2 \\ \end{array} \right.$

(C.35)

so that the variance of our estimator for the variance of Gaussian white noise is

Var $\displaystyle \left\{\hat{\sigma}_x^2(n)\right\} = \frac{M3\sigma_x^4 + (M^2-M)\sigma_x^4}{M^2} - \sigma_x^4 = \zbox {\frac{2}{M}\sigma_x^4}$

(C.36)

Again we see that the variance of the estimator declines as .

The same basic analysis as above can be used to estimate the variance of the sample autocorrelation estimates for each lag, and/or the variance of the power spectral density estimate at each frequency.

As mentioned above, to obtain a grounding in statistical signal processing, see references such as [201,121,95].

Next Section:
Product of Two Gaussian PDFs
Previous Section:
Independent Implies Uncorrelated