Review of the Central Limit Theorem

If a random sample of N cases is drawn from a population with mean m and standard deviation s, then the sampling distribution of the mean (the distribution of all possible means for samples of size N)

1) has a mean equal to the population mean mx:

2) has a standard deviation (also called "standard error" or "standard error of the mean") equal to the population standard deviation, sx, divided by the square root of the sample size, N:

3) and the shape of the sampling distribution of the mean approaches normal as N increases.

This last point is especially important: The shape of the sampling distribution approaches normal as the size of the sample increases, whatever the shape of the population distribution. The facts represented in the Central Limit Theorem allow us to determine the likely accuracy of a sample mean, but only if the sampling distribution of the mean is approximately normal.

If the population distribution is normal, then the sampling distribution of the mean will be normal for any sample size N (even N = 1). If a population distribution is not normal, but it has a bump in the middle and no extreme scores and no strong skew, then a sample of even modest size (e.g., N = 30) will have a sampling distribution of the mean that is very close to normal. However, if the population distribution is far from normal (e.g., extreme outliers or strong skew), then to produce a sampling distribution of the mean that is close to normal it may be necessary to draw a very large sample (e.g., N = 500 or more).

Important note: You should not assume that the sampling distribution of the mean is normal without considering the shape of the population distribution and the size of your sample. A sample with N > 30 does not guarantee a normal sampling distribution if the population distribution is far from normal.

Close This Window