testing claims about the standard deviation

23
Testing Claims about the Standard Deviation Lecture 18

Upload: jason-edington

Post on 21-Jan-2018

61 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Testing claims about the standard deviation

Testing Claims about the Standard DeviationLecture 18

Page 2: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Now we are going to test claims about a third parameter, the population

standard deviation

• You might wonder why we’d want to do this

• The population mean and the population proportion make sense – you’d want to be able to draw inferences about them

• The standard deviation might seem irrelevant, but it isn’t

• There are situations in which it’s important to be able to draw conclusions about this measure of variation

Page 3: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Here’s a situation in which it’s important for the standard deviation to be small

You’re manufacturing screws

The mean diameter of the screws is obviously important for the screws to fit, but if the diameters have a lot of variation many of the screws won’t be usable

Variation needs to be kept very small

• And here’s a situation in which it’s important for the standard deviation to be large You’re giving a standardized test

If all the scores are similar (with little variation), your test won’t be useful for differentiating the test performances

Everybody’s score will be about the same

Page 4: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• So we want to be able to test claims about 𝜎, the population standard deviation, and to do so

we have to know about the third of the four great statistical distributions, or probability density functions

• The first two were the normal, in particular the z-distribution, and the t-distribution, with its degrees of freedom

• Now we’re going to use the chi-square distribution, or 𝜒2-distribution Pronounced with a hard K sound, and rhymes with Sigh or Pie – ‘Kigh’

• Whereas the z- and the t- distributions are symmetrical and theoretically extend forever in both directions, the-distribution has neither characteristics It starts at 0 extends forever to the right and is not symmetrical

though it becomes more so as the number of degrees of freedom increases

• It has degrees of freedom, like the t-distribution, and you determine them the same way – by subtracting one from the sample size, n

Page 5: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• The reason that the 𝜒2-distribution starts at 0 is, as

the name implies, because it involves adding things that are squared and thus cannot be negative

• The more of these “things” that are added, the larger the number of degrees of freedom and the bigger the value of 𝜒2

• Here’s the pdf for a 𝜒2 with three degrees of freedom (n = 4)

• Its hump occurs over 𝜒2 = 1

• Note that we put a vertical axis at 0, because the pdf doesn’t extend to the left of 0

Page 6: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Here’s the pdf for a 𝜒2 with seven degrees of

freedom (n = 8)

• Its hump occurs over = 5

Page 7: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Finally, here’s the pdf for a 𝜒2 with 14 degrees of

freedom (n = 15):

• Its hump occurs over = 12

Page 8: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Here are all three 𝜒2 ’s on one graph

• As you can see, the curves become lower and more symmetrical as the number of degrees of freedom increases, and the humps move to the right and are located over d.f. - 2, or n-3

Page 9: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Testing claims about the population standard deviations involves finding areas

under 𝜒2 pdf’s

• We can’t use the calculator’s test menu

You might notice that there is a 𝜒2-Test listed on the menu, but it isn’t meant for this purpose and can’t be used for it

• To determine what area to find under the pdf involves a process we haven’t used yet in testing claims, though it’s used in the old-fashioned, classical method of testing all claims about parameters

Finding the test value

Page 10: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• The test value comes from a formula involving the parameter

𝜎0 in this case

• And the statistic s, the standard deviation of our sample

• And it is this test value which we place on the 𝜒2-distribution axis

• Before we get to that, though, let’s practice finding areas under 𝜒2 pdfs

Page 11: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Say you want to find the area under a 𝜒2 pdf for a sample size of 10 to the left

of 3.782. Here’s the diagram:

• I put 7 (d.f. = n – 1 = 10 – 1 = 9, and the hump is over d.f. – 2 = 9 – 2 = 7) under the hump so we could decide which side of the hump to put 3.782 on

• Remember to label the axis 𝜒2

Page 12: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Say you want to find the area under a 𝜒2 pdf for a sample size of 10 to the left

of 3.782. Here’s the diagram:

• To find the shaded area, use 𝜒2cdf, followed by the left-shaded number, the right-shaded number, and the number of degrees of freedom:

• 𝜒2 cdf ( 0, 3.782, 9 ) ≈ 0.075

Page 13: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• One more: What is the area under a 𝜒2 pdf for a sample size of 25 to the right

of 31.621?

• 𝜒2 cdf ( 31.621, 1000, 24 ) ≈ 0.137

• Notice how much more symmetrical and normal this curve looks than the one for d.f. = 9

Page 14: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Now we’re ready to test claims

• The number that you’ll be putting on the 𝜒2 axis, the test value, will be given by this formula:

TEST VALUE: 𝝌𝟐 =(𝒏−𝟏)𝒔𝟐

𝝈𝟎𝟐

Page 15: Testing claims about the standard deviation

Testing Claims about the Standard Deviation• Here are the directions for testing claims about 𝜎:A. State the hypotheses and identify the claimB. Find the test value, and determine the p-value.

Make a diagram showing the distribution and the test value Shade and label the p-value, which should be rounded to the nearest thousandth

C. Decide whether or not to reject the null hypothesisD. Summarize the results

• As you can see, the directions are identical to those for testing claims about and p except for the second step

Page 16: Testing claims about the standard deviation

Testing the First Claim• The standard deviation of the shoe sizes of Mendocino College students

exceeds 1.8 sizes

Use 𝛼 = 0.05

• A) 𝐻0: 𝜎 ≤ 1.8

𝐻1: 𝜎 > 1.8 (Claim)

• B) So 1.8 is 𝜎0, and from the calculator we get 2.032 for s, so

𝜒2 =(𝑛 − 1)𝑠2

𝜎02

=(102 − 1) ∙ 2.0322

1.82≈ 128.713

• We’re doing a right-tailed test, so the p-value is given by 𝜒2 cdf (128.713, 1000, 101) ≈ 0.033

Page 17: Testing claims about the standard deviation

Testing the First Claim• C) 0.033< 0.05, or 𝑝 < 𝛼, so reject 𝐻0

• D) Lower-left rectangle: There is sufficient evidence to support the claim that the standard deviation of the shoe sizes of Mendocino College students exceeds 1.8 sizes

Page 18: Testing claims about the standard deviation

Testing the Second Claim• The variance of the ages of Mendocino College students is at least 80

Use the 10% significance level

• Notice that the parameter mentioned in this claim is the variance, 𝜎2, not the standard deviation, 𝜎

• This means that when you go to figure out the 𝜒2 test value, you’ll use the 80 as is, without squaring it

• A) 𝐻0: 𝜎2 ≥ 80 (Claim)

𝐻1: 𝜎2 < 80

• B) So 80 is 𝜎02, and the calculator gives 7.700 for s, so

𝜒2 =(𝑛−1)𝑠2

𝜎02 =

(102−1)∙7.7002

80≈ 74.854

• This is a left-tailed test, so the p-value is given by 𝜒2 cdf (0, 74.854, 101) ≈ 0.024

Page 19: Testing claims about the standard deviation

Testing the Second Claim• The variance of the ages of Mendocino College students is at least 80

Use the 10% significance level

• This is a left-tailed test, so the p-value is given by 𝜒2 cdf (0, 74.854, 101) ≈ 0.024

• C) 0.024 < 0.10, or 𝑝 < 𝛼, so reject 𝐻0

• D) Upper-left rectangle: There is sufficient evidence to reject the claim that the variance of the ages of Mendocino College students is at least 80

Page 20: Testing claims about the standard deviation

Testing the Third Claim• Remember the coffee machine from Lecture #11, the one that was way out of

whack?

• Well, perhaps it’s fixed now – we’ll see By ‘fixed’ we mean that the standard deviation of the number of fluid ounces it

deposits in cups is at most 0.05 fluid ounces

If the standard deviation is less than or equal to 0.05 fluid ounces we say that the machine is working properly

otherwise we call the company that owns the machine to come and service it yet again

• So we take a random sample of eight such cups and measure their contents.

• These are the volumes in fluid ounces: 6.02, 5.89, 6.05, 6.01, 5.96, 5.87, 6.03, 5.92

Page 21: Testing claims about the standard deviation

Testing the Third Claim• Let’s claim that the machine is working properly, i.e. that the standard deviation

of the volumes is at most 0.05 fluid ounces

Use 𝛼 = 0.01

• A) 𝐻0: 𝜎 ≤ 0.05 fluid ounces (Claim)

𝐻1: 𝜎 > 0.05

• We put the eight volumes in a list, and the calculator tells us that the sample standard deviation is 0.069

Page 22: Testing claims about the standard deviation

Testing the Third Claim• B) So 𝜎0 is 0.05, and s is 0.069, so

𝜒2 =(𝑛−1)𝑠2

𝜎02 =

(8−1)∙0.0692

0.052≈13.331

• This is a right-tailed test, so the p-value is 𝜒2 cdf ( 13.331, 1000, 7 ) ≈ 0.064

• C) 0.064 > 0.01, or 𝑝 > 𝛼, so do not reject 𝐻0

• D) Upper-right rectangle: There isn’t sufficient evidence to reject the claim that the standard deviation of the volumes is at most 0.05 fluid ounces, or that the machine is working properly

No need to call for servicing

Page 23: Testing claims about the standard deviation

Activity 18: Testing Claims about the Standard Deviation• Perform hypothesis tests on these claims. Include these steps in each test:• A)State the hypotheses and identify the claim.• B)Find the test value, and determine the p-value.

Make a diagram showing the distribution and the test value. Shade and label the p-value, which should be rounded to the nearest thousandth.

• C)Decide whether or not to reject the null hypothesis.• D)Summarize the results.

1. At the 5% significance level, test the claim that the standard deviation of the heights of Mendocino College students is at most 4 inches.

2. At the 1% level of significance, test the claim that the variance of the number of pets owned by students at Mendocino College is less than 5.

3. A machine is supposed to fill bags with exactly 16 ounces of pasta. The machine is said to be working properly if the standard deviation of the weights of the pasta in the bags is at most 0.1 ounce. Six randomly-selected bags are weighed, and the resulting weights in ounces are: 16.15, 15.88, 15.82, 16.03, 16.10, 15.76

Test the claim that the machine is working properly at the 5% significance level.

• TEST VALUE: 𝝌𝟐 =(𝒏−𝟏)𝒔𝟐

𝝈𝟎𝟐