- This is part of probstat
Consider a certain distribution. The mean
of the distribution is the expected value of a random variable
sample from the distribution. I.e.,
Wikimedia Error
Error
Too many requests (f061ab2)
.
Also recall that the variance of the distribution is
Wikimedia Error
Error
Too many requests (f061ab2)
And finally, the standard deviation is
Wikimedia Error
Error
Too many requests (f061ab2)
.
Sample Statistics
Suppose that you take
samples
independently from this distribution. (Note that
are random variables.)
Sample means
The statistic
Wikimedia Error
Error
Too many requests (f061ab2)
is called a sample mean. Since
Wikimedia Error
Error
Too many requests (f061ab2)
are random variables, the mean
is also a random variable.
We hope that
approximates
well. We can compute:
Wikimedia Error
Error
Too many requests (f061ab2)
and since
are independent, we have that
Wikimedia Error
Error
Too many requests (f061ab2)
Sample variances and sample standard deviations
We can also use the sample to estimate
.
The statistic
is called a sample variance. The sample standard deviation is
Wikimedia Error
Error
Too many requests (f061ab2)
.
Note that the denominator is
Wikimedia Error
Error
Too many requests (f061ab2)
instead of
.
We can show that
.
Wikimedia Error
Error
Too many requests (f061ab2)
We note that since
and
are independent, we have that
Wikimedia Error
Error
Too many requests (f061ab2)
.
Let's deal with the middle term here.
Wikimedia Error
Error
Too many requests (f061ab2)
Let's work on the third term which ends up being the same as the middle term.
Wikimedia Error
Error
Too many requests (f061ab2)
Let's put everything together:
Summary
Sample means:
Wikimedia Error
Error
Too many requests (f061ab2)
Sample variance:
Properties of sample means and sample variances
Distribution of sample means
While we know basic properties of sample means
, if we want to perform other statistical calculation (i.e., computing confidence intervals or testing hypotheses), it is very useful to know the exact distribution of
.
For a general population, it will be hard to deal the the distribution of
exactly. However, if the population is normal, we are in a very good shape.
Recall the definition of
:
Therefore,
is a sum of independent normally distributed random variables. A nice property of normal random variables is that the sum of normally distributed random variables remains a normal random variable. Since a normal random variable is uniquely determined by its mean and variance, we have the following observation.
Distribution of sample means of normal populations.
If the population is normally distributed with mean and variance , the distribution of is normal with mean
and variance
.
Examples
Ex1. Suppose that the population has mean
Wikimedia Error
Error
Too many requests (f061ab2)
and variance . If you select a sample of size 20, what is the probability that the sample mean
is greater than 17?
Solution:
The sample mean is normal with mean
and variance
Wikimedia Error
Error
Too many requests (f061ab2)
. Therefore,
is unit normal.
Note that
We can look at the standard normal table and find out that , for a unit normal random variable Z. Thus, the probability
Wikimedia Error
Error
Too many requests (f061ab2)
which is roughly 1%.
Ex2.
- To be added...
Why do we use normal distributions?
Normal random variables appear very often in our treatment of statistics. This is not just a coincidence. See limit theorems.