- This is part of probstat
Consider a certain distribution. The mean
of the distribution is the expected value of a random variable
sample from the distribution. I.e.,
.
Also recall that the variance of the distribution is
.
And finally, the standard deviation is
.
Sample Statistics
Suppose that you take
samples
independently from this distribution. (Note that
are random variables.)
Sample means
The statistic
is called a sample mean. Since
are random variables, the mean
is also a random variable.
We hope that
approximates
well. We can compute:
and
Sample variances and sample standard deviations
We can also use the sample to estimate
.
The statistic
is called a sample variance. The sample standard deviation is
.
Note that the denominator is
instead of
.
We can show that
.
We note that since
and
are independent, we have that
.
Let's deal with the middle term here.
Let's work on the third term which ends up being the same as the middle term.
Let's put everything together:
\begin{array}{rcl}
\mathrm{E}[S^2]
&=& \frac{1}{n-1}\left( \sum_{i=1}^n E[X_i^2]
- (2/n)\cdot\sum_{i=1}^n \sum_{j=1}^n E\left[X_i\cdot X_j\right]
+ (1/n)\cdot E\left[\left(\sum_{j=1}^n X_j\right)^2\right] \right) \\
&=& \frac{1}{n-1}\left( n E[X^2]
- (2/n)(n E[X^2] + n(n-1)\mu^2)
+ (1/n)(n E[X^2] + n(n-1)\mu^2) \right) \\
&=& \frac{1}{n-1}\left( n E[X^2]
- 2E[X^2] - 2(n-1)\mu^2
+ E[X^2] + (n-1)\mu^2 \right) \\
&=& \frac{1}{n-1}\left((n-1) E[X^2] - (n-1)\mu^2 \right) \\
&=& E[X^2] - \mu^2 = \sigma^2\\
\end{array}
</math>
Distribution of sample means