confidence intervals and the missing link. youve taught students the normal curve and the central...

21
HOW CONFIDENT ARE YOU? Confidence Intervals and the Missing Link

Upload: ruby-chatt

Post on 31-Mar-2015

219 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

HOW CONFIDENT ARE YOU?

Confidence Intervals and the Missing Link

Page 2: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

What Do Students Need to Know?

You’ve taught students the normal curve and the central limit theorem, but they just don’t get confidence intervals.

Page 3: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Z-Scores – the missing link

What are z-scores? Sue got her math test back with a mark

of 82%. The teacher said the marks were normally distributed with a mean of 72% and a standard deviation of 10. How many standard deviations was Sue’s mark above the mean mark?

deviationsstdofdevstd

meanmark.#

.

Page 4: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Z-Scores are the number of standard deviations some x-value is above or below the mean. If z is

positive, x is above the mean; negative, x is below the mean.

x

z

Page 5: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

In Sue’s case, her z-score is 1. That is, her mark is one standard deviation above the mean her mark is being compared to. Using the 68-95-99.7 rule, we can conclude that Sue had a higher mark than 84% of the class.

Page 6: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

But what if the z-score does not work out to be a whole number? How can we estimate percentages if all we have is the 68-95-99.7 rule?

Z-Scores use a chart that let’s us easily look-up the values we want. Convenient tables allow us to find the numbers needed

http://www.epatric.com/documentation/statistics/z-score_table.html

Page 7: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Example Standard Normal Distribution:  On

Human Gestation Studies have determined that the average gestation period (time between conception and giving birth) for humans to be approximately 266 days, while the standard deviation (σ) is about 16 days.  If the length of gestation is considered to follow a normal distribution, then find the percent of human pregnancies which would be expected to last more than 39 weeks?

Using the z-score chart we find the % below this z value is 67%. Since we want the % above 39 weeks, we subtract this from 100%. 33% of pregnancies would be expected to last longer than 39 weeks.

44.016

266)39*7(

z

Page 8: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

What percent would be expected to last between 266 days and 286 days?

For this example, we need to calculate 2 z-score values and find the % in between.

Again, using the table, there is 89.44% below a z-value of 1.25. Subtracting 50% from this, we have 39.44% of pregnancies are expected to last between 266 and 286 days.

016

266266

z

25.116

266286

z

Page 9: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Example Coach recorded all the practice times for

the Horton track team. These times were N(5.6, 0.76) minutes – meaning a normal distribution with mean of 5.6 and standard deviation of 0.76. The coach says if you are in the lowest 5% you can compete in provincials. What time do you need?

5% matches a z-value of -1.64 Using the formula, we can solve for x You must have a time of < 4.35 mins.

x

x

35.476.0

6.564.1

Page 10: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Making the Connection

Central Limit Theorem

Page 11: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Example

The average height of a NS grade 11 student is 63” with a standard deviation of 4”; heights are normally distributed.

If a single grade 11 student is chosen at random what is the probability their height will be >66”?

2266.07734.01

75.04

6366

tableFrom

z

Page 12: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Sample vs. individual…

If 16 grade 11 students are chosen at random, what is the probability that their average height ( ) is greater than 66”?

x

0013.09987.01

316/4

6366

tableFrom

z

Page 13: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Example – not normal…

The average price of a house sold in NS in 2007 was $150000 with a standard deviation of $15000. Housing prices are right skewed. If you randomly selected one house from those sold, what is the probability it sold for < $125000?

Sol’n - We can’t use z-scoresbecause the distribution is not normal. Given our background, This cannot be found.

Page 14: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Non –normal continued…

You randomly choose 40 houses from those sold in 2007. what is the probability that the average selling price of these 40 houses is < $125000?

Sol’n – Since the sample size is ≥ 30, the sampling distribution of all possible

sample means is normal.

This value is “off the chart”. Basically, the probability is zero!

54.1040/15000

150000125000

z

Page 15: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Confidence Intervals

The recent scores of literacy tests for Grade 9 NS students was N(400, 36). If we choose one student at random , what is the probability their score is between 375 & 425?

69.036

400375

z

69.036

400425

z

5098.02451.07549.0 tableFrom

Page 16: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Taking a sample from the population…

If we take one sample of size 25 from all these scores, what is the probability the mean of the sample will be between 375 & 425?

From table -

47.325/36

400375

z

47.325/36

400425

z

9994.00003.09997.0

Page 17: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Creating an interval…

What if we take another sample of 25? How can we create an interval, using the mean, that will include µ?

If the mean of our sample was 407, we could add/subtractone std. dev. And get an intervalthat contains µ=400. The problemis we could get almost any value forthe mean of our sample AND we usually

don’t know the value of µ.

Page 18: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

What if we want to be 90% sure?

What if we want to be 90% sure that the interval we calculate from our sample mean will contain µ? How many standard deviations do we need to add and subtract to create this interval?

Using z-scores, we need to look-up a z-value with 95% below it. Z = 1.645We have to ± 1.645 std. dev.to the sample mean to get our interval.

n

zxCI

Page 19: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Applications …

The average weight of infants born in Canada in 1947 was N(3, 0.6) kg. We think babies are getting heavier so we took a SRS of 36 babies born in 2007 and got a sample mean of 3.4 kg. Are babies getting heavier?

Sol’n – we need a CI for the population of newborns in 2007. We decide to use a 90% confidence level. We can…

)56.3,24.3(

36

6.0645.14.3

Page 20: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

OR using the calculator…

Page 21: Confidence Intervals and the Missing Link. Youve taught students the normal curve and the central limit theorem, but they just dont get confidence intervals

Conclusion…

Since our population mean of 3 kg from 1947 is not contained in the interval estimate for the 2007 population , we can conclude with 90% confidence that newborns are getting heavier.