Probstat/notes/sample means and sample variances

จาก Theory Wiki
ไปยังการนำทาง ไปยังการค้นหา
This is part of probstat

Consider a certain distribution. The mean of the distribution is the expected value of a random variable sample from the distribution. I.e.,

.

Also recall that the variance of the distribution is

Error

Too many requests (f061ab2)

And finally, the standard deviation is .

Sample Statistics

Suppose that you take samples independently from this distribution. (Note that are random variables.)

Sample means

The statistic

is called a sample mean. Since

Error

Too many requests (f061ab2)

are random variables, the mean is also a random variable.

We hope that approximates well. We can compute:

Error

Too many requests (f061ab2)

and since are independent, we have that

Error

Too many requests (f061ab2)

Sample variances and sample standard deviations

We can also use the sample to estimate .

The statistic

is called a sample variance. The sample standard deviation is .

Note that the denominator is instead of .

We can show that .

We note that since and are independent, we have that

.

Let's deal with the middle term here.

Let's work on the third term which ends up being the same as the middle term.

Error

Too many requests (f061ab2)

Let's put everything together:

Error

Too many requests (f061ab2)

Summary

Sample means:

Sample variance:

Properties of sample means and sample variances

Distribution of sample means

While we know basic properties of sample means , if we want to perform other statistical calculation (i.e., computing confidence intervals or testing hypotheses), it is very useful to know the exact distribution of .

For a general population, it will be hard to deal the the distribution of exactly. However, if the population is normal, we are in a very good shape.

Recall the definition of :

Error

Too many requests (f061ab2)

Therefore, is a sum of independent normally distributed random variables. A nice property of normal random variables is that the sum of normally distributed random variables remains a normal random variable. Since a normal random variable is uniquely determined by its mean and variance, we have the following observation.

Distribution of sample means of normal populations.

If the population is normally distributed with mean and variance , the distribution of is normal with mean and variance

Error

Too many requests (f061ab2)

.

Examples

Ex1. Suppose that the population has mean and variance

Error

Too many requests (f061ab2)

. If you select a sample of size 20, what is the probability that the sample mean is greater than 17?

Solution:

The sample mean is normal with mean and variance . Therefore,

is unit normal.

Note that

We can look at the standard normal table and find out that , for a unit normal random variable Z. Thus, the probability

Error

Too many requests (f061ab2)

which is roughly 1%.

Ex2.

To be added...

Why do we use normal distributions?

Normal random variables appear very often in our treatment of statistics. This is not just a coincidence. See limit theorems.

รายการเลือกการนำทาง