1 sampling distributions. central limit theorem*

27
1 Sampling Distributions

Upload: irene-wilcox

Post on 04-Jan-2016

237 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: 1 Sampling Distributions. Central Limit Theorem*

1

Sampling Distributions

Page 2: 1 Sampling Distributions. Central Limit Theorem*

Central Limit Theorem*

Page 3: 1 Sampling Distributions. Central Limit Theorem*

Distribution of Sample Means• Consider the following data as a Population

2, 4, 6, 8

– The population mean is 5– The population standard deviation is 2.236

• Now we are going to take ALL possible samples of n = 2 from this population.

• We will calculate the mean for each sample

Page 4: 1 Sampling Distributions. Central Limit Theorem*

Sampling Distribution of Means for Samples of n = 2

Pick 1 Pick 2 Mean Mean 2 Variance Standard Deviation

2 2 2 4 0 0.0002 4 3 9 2 1.4142 6 4 16 8 2.8282 8 5 25 18 4.2434 2 3 9 2 1.4144 4 4 16 0 0.0004 6 5 25 2 1.4144 8 6 36 8 2.8286 2 4 16 8 2.8286 4 5 25 2 1.4146 6 6 36 0 0.006 8 7 49 2 1.4148 2 5 25 18 4.2438 4 6 36 8 2.8288 6 7 49 2 1.4148 8 8 64 0 0.00 80 440

Page 5: 1 Sampling Distributions. Central Limit Theorem*

Central Limit Theorem Applied

Page 6: 1 Sampling Distributions. Central Limit Theorem*

Results from a survey of students who were asked how many hours they spend per week using a search engine on the Internet.

n = 400 μ = 3.88 σ = 2.40

Page 337

Page 7: 1 Sampling Distributions. Central Limit Theorem*

A sample of 32 students selected from the 400 on the previous slide.

The mean of this sample is x = 4.17.

1.1 7.8 6.8 4.9 3.0 6.5 5.2 2.2 5.1 3.4 4.7 7.0 3.8 5.7 6.5 2.7 2.6 1.4 7.1 5.5 3.1 5.0 6.8 6.5 1.7 2.1 1.2 0.3 0.9 2.4 2.5 7.8

Sample 1

A different sample of 32 students selected from the 400.

Now you have two sample means that don’t agree with each other, and neither one agrees with the true population mean.

1.8 0.4 4.0 2.4 0.8 6.2 0.8 6.6 5.7 7.9 2.5 3.6 5.2 5.7 6.5 1.2 5.4 5.7 7.2 5.1 3.2 3.1 5.0 3.1 0.5 3.9 3.1 5.8 2.9 7.2 0.9 4.0

Sample 2

For this sample is = 3.98.x̄

Page 8: 1 Sampling Distributions. Central Limit Theorem*

Figure 8.6 shows a histogram that results from 100 different samples, each with 32 students. Notice that this histogram is very close to a normal distribution and its mean is very close to the population mean, μ = 3.88.

Figure 8.6 A distribution of 100 sample means, with a sample size of n = 32, appears close to a normal distribution with a mean of 3.88.

Page 9: 1 Sampling Distributions. Central Limit Theorem*

• The distribution of sample means is approximately a normal distribution.

• The mean of the distribution of sample means is 3.88 (the mean of the population).

• The standard deviation of the distribution of sample means depends on the population standard deviation and the sample size. The population standard deviation is σ = 2.40 and the sample size is n = 32, so the standard deviation of sample means is

= = 0.42σn

2.4032

Central Limit Theorem application: If we were to include all possible samples of size n = 32, this distribution would have these characteristics:

Page 10: 1 Sampling Distributions. Central Limit Theorem*

10

Margin of Error for the MeanThe margin of error for the 95% confidence interval is

where s is the standard deviation of the sample.

We find the 95% confidence interval by adding and subtracting the margin of error from the sample mean. That is, the 95% confidence interval ranges

from (x – margin of error) to (x + margin of error)

We can write this confidence interval more formally as

– E < μ < + E

or more briefly as ± E

margin of error = E ≈2s n

x̄x̄

Page 11: 1 Sampling Distributions. Central Limit Theorem*

95% Confidence Interval

Page 12: 1 Sampling Distributions. Central Limit Theorem*

Constructing a Confidence Interval

Page 13: 1 Sampling Distributions. Central Limit Theorem*

Interpreting the Confidence Interval

Figure 8.10 This figure illustrates the idea behind confidence intervals. The central vertical line represents the true population mean, μ. Each of the 20 horizontal lines represents the 95% confidence interval for a particular sample, with the sample mean marked by the dot in the center of the confidence interval. With a 95% confidence interval, we expect that 95% of all samples will give a confidence interval that contains the population mean, as is the case in this figure, for 19 of the 20 confidence intervals do indeed contain the population mean. We expect that the population mean will not be within the confidence interval in 5% of the cases; here, 1 of the 20 confidence intervals (the sixth from the top) does not contain the population mean.

Page 14: 1 Sampling Distributions. Central Limit Theorem*

Using StatCrunch -Confidence Intervals

• In the data set; select:– STAT– Z Statistics– One-Sample– With Data – Select Variable– Click next– Select confidence interval and percent – Calculate

Page 15: 1 Sampling Distributions. Central Limit Theorem*

15

Page 16: 1 Sampling Distributions. Central Limit Theorem*

16

Determine Minimum Sample Size

• Solve the margin of error formula [E =2s/√n] for n.

• You want to study housing costs in the country by sampling recent house sales in various (representative) regions. Your goal is to provide a 95% confidence interval estimate of the housing cost. Previous studies suggest that the population standard deviation is about $7,200. What sample size (at a minimum) should be used to ensure that the sample mean is within

• a. $500 of the true population mean?

22

sn

E

4.8298.28500

200,722 222

n

E

Page 17: 1 Sampling Distributions. Central Limit Theorem*

Solution: a. With E = $500 and σ estimated as $7,200, the minimum sample size that

meets the requirements is

EXAMPLE Constructing a Confidence Interval

You want to study housing costs in the country by sampling recent house sales in various (representative) regions. Your goal is to provide a 95% confidence interval estimate of the housing cost. Previous studies suggest that the population standard deviation is about $7,200. What sample size (at a minimum) should be used to ensure that the sample mean is within

a. $500 of the true population mean?

b. $100 of the true population mean?

4.8298.28500

200,722 222

n

E

Page 18: 1 Sampling Distributions. Central Limit Theorem*

Solution:a. (cont.) Because the sample size must be a whole number, we conclude that

the sample should include at least 830 prices.

b. With E = $100 and σ = $7,200, the minimum sample size that meets the requirements is

EXAMPLE Constructing a Confidence Interval

736,20144100

200,722 222

n

E

Notice that to decrease the margin of error by a factor of 5 (from $500 to $100), we must increase the sample size by a factor of 25. That is why achieving greater accuracy generally comes with a high cost.

Page 19: 1 Sampling Distributions. Central Limit Theorem*

Distribution of Sample Proportions

Page 20: 1 Sampling Distributions. Central Limit Theorem*

Sample Proportions

In a survey where 400 students were asked if they own a car, 240 replied that they did.

The exact proportion of car owners is

p = = 0.6240400

This population proportion, p = 0.6, is another example of a population parameter.

Page 340

Page 21: 1 Sampling Distributions. Central Limit Theorem*

21

95% Confidence Interval for a Population Proportion

For a population proportion, the margin of error for the 95% confidence interval is

where is the sample proportion.

The 95% confidence interval ranges

from – margin of error to + margin of error

We can write this confidence interval more formally as

n

ppE

)ˆ1(ˆ2

EppEp ˆ–ˆ

p̂p̂

Page 22: 1 Sampling Distributions. Central Limit Theorem*

22

Choosing the Correct Sample Size

In order to estimate a population proportion with a 95% degree of confidence and a specified margin of error of E, the size of the sample should be at least

n > 1E 2

Page 23: 1 Sampling Distributions. Central Limit Theorem*

The Nielsen ratings for television use a random sample of households. A Nielsen survey results in an estimate that a women’s World Cup soccer game had 72.3% of the entire viewing audience. Assuming that the sample consists of n = 5,000 randomly selected households, find the margin of error and the 95% confidence interval for this estimate.

Solution: The sample proportion, = 72.3% = 0.723, is the best estimate of the population proportion.

The margin of error is

EXAMPLE TV Nielsen Ratings

013.0000,5

)723.01(723.02

)ˆ1(ˆ2

n

ppE

Page 24: 1 Sampling Distributions. Central Limit Theorem*

Solution: (cont.) The 95% confidence interval is

0.723 – 0.013 < p < 0.723 + 0.013,

or

With 95% confidence, we conclude that between 71.0% and 73.6% of the entire viewing audience watched the women’s World Cup soccer game.

EXAMPLE 2 TV Nielsen Ratings

0.710 < p < 0.736

Page 25: 1 Sampling Distributions. Central Limit Theorem*

25

You plan a survey to estimate the proportion of students on your campus who carry a cell phone regularly. How many students should be in the sample if you want (with 95% confidence) a margin of error of no more than 4 percentage points?

Solution: Note that 4 percentage points means a margin of error of 0.04. From the given formula, the minimum sample size is

You should survey at least 625 students.

EXAMPLE Minimum Sample Size for Survey

1E 2n = = = 625

10.04 2

Page 26: 1 Sampling Distributions. Central Limit Theorem*

Core Logic of Hypothesis Testing

• Considers the probability that the result of a study could have come about if the experimental procedure had no effect

• If this probability is low, scenario of no effect is rejected and the theory behind the experimental procedure is supported

Page 27: 1 Sampling Distributions. Central Limit Theorem*

Hypothesis Testing using Confidence Intervals

State the claim about the population mean Determine desired confidence level Select a random sample from the population Calculate the confidence interval for the desired level of

confidence. If the claim is contained within the interval, the claim is

reasonable; if it is not within the interval, the claim is not reasonable, at the given level of confidence.

See Testing a Claim document in Doc Sharing