s1: chapter 9 the normal distribution dr j frost ([email protected]) last modified: 4 th...

31
S1: Chapter 9 The Normal Distribution Dr J Frost ([email protected]) Last modified: 4 th March 2014

Upload: eric-francis

Post on 24-Dec-2015

251 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

S1: Chapter 9The Normal Distribution

Dr J Frost ([email protected])

Last modified: 4th March 2014

Page 2: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

What does it look like?

The following shows what the probability distribution might look like for a random variable X, if X is the height of a randomly chosen person.

Height in cm (x)

p(x)

180cm

We expect this ‘bell-curve’ shape, where we’re most likely to pick someone with a height around the mean of 180cm, with the probability diminishing symmetrically either side of the mean.

A variable with this kind of distribution is said to have a normal distribution. For normal distributions we tend to draw

the axis at the mean for symmetry.

Page 3: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

What does it look like?

We can set the mean and the standard deviation of the Normal Distribution.

Page 4: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Height in cm (x)180cm

For a Normal Distribution to be used, the variable has to be:continuous

With a discrete variable, all the probabilities had to add up to 1.For a continuous variable, similarly:the area under the probability graph has to be 1.

To find P(170cm < X < 190cm), we could:find the area between these values.

Would we ever want to find P(X = 200cm) say?Since height is continuous, the probability someone is ‘exactly’ 200cm is infinitesimally small. So not a ‘probability’ in the normal sense.

Normal Distribution Q & A

Q1

Q2

Q3

190cm170cm

?

?

?

p(x)

Q4

?

Page 5: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Notation

If a variable is ‘normally distributed’ (i.e. its probability function uses a normal distribution), then we write:

𝑋 𝑁 (𝜇 ,𝜎 2 )

The random variable X...

...is distributed...

...using a Normal distribution with mean and variance

Page 6: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Z valueThe Z value is the number of standard deviations a value is above the mean.

100

IQ, by definition, is normally distributed for a given population. By definition, and i.e.

IQ (x)

Example

IQ Z

100 0

130 2

85 -1

165 4.333

62.5 -2.5

?????

p(x)

! ?

Page 7: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

You may be wondering why we have to look up the values in a table, rather than being able to calculate it directly. The reason is that calculating the area under the graph involves integrating (see C2), but the probability function for the normal distribution (which you won’t see here) cannot be integrated!

A z-table allows us to find the probability that the outcome will be less than a particular z value.For IQ, P(Z < 2) would mean “the probability your IQ is less than 130”. (You can find these values at the back of your textbook, and in a formula booklet in the exam.)

P(Z < 2) = 0.9772?

Z table

IQ (x)

100 130115

z=0 z=1 z=2

Expand

Minimise

Page 8: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Use of the z-tableSuppose we’ve already worked out the z value, i.e. the number of standard deviations above or below the mean.

Bro Tip: We can only use the z-table when:a) The z value is

positive (i.e. we’re on the right half of the graph)

b) We’re finding the probability to the left of this z value.

1

𝑧=−0.3

This is clear by symmetry.?

𝑧

2

𝑧=−0.3

?

𝑧

3

𝑧=1

?

𝑧

4

𝑧=−2

?

𝑧

Page 9: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Exercise Determine the following probabilities, ensuring you show how you have manipulated your probabilities (as per the previous examples).

123456789

10

??

??

??

???

?

Page 10: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

‘Standardising’

We have seen that in order to look up a value in the table, we needed to first convert our IQ into a value. We call this ‘standardising’ our variable, because we’re turning our normally distributed variable into another one where the mean is 0 and the standard deviation is 1.

𝑋 𝑍𝑍=

𝑋−𝜇𝜎

e.g. 𝑍 𝑁 (0,12 )

Standardise

Why is ? Well consider a z value of 3 for example. We understand that to mean 3 standard deviations above the mean. But if and , the 3 is 3 lots of 1 above 0!

Page 11: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

‘Standardising’

Here’s how they’d expect you to lay out your working in an exam:

𝑋 𝑁 (160 ,102 )

The heights in a population are normally distributed with mean 160cm and standard deviation 10cm. Find the probability of a randomly chosen person having a height less than 180cm.

No marks attached with this, but good practice!

A statement of the problem.

M1 for “attempt to standardise”

Look up in z-table

?

?

??

Page 12: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Test your UnderstandingSteel rods are produced by with a mean weight of 10kg, and a standard deviation of 1.6kg. Determine the probability that a randomly chosen steel rod has a weight below 9kg.

𝑋 𝑁 (10 ,1.62 )?

Page 13: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Exam QuestionsEdexcel S1 May 2012

Edexcel S1 May 2013 (Retracted)

Edexcel S1 Jan 2011

P(Z > -1.6) = P(Z < 1.6) = 0.9452

P(Z < -0.5) = P(Z > 0.5) = 1 – P(Z < 0.5)= 0.3085

P(Z > 1.6)= 1 – P(Z < 1.6)= 0.0548

?

?

?

Page 14: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Quickfire ProbabilitiesThese are for further practice if you’re viewing these slides outside of class. You don’t need a calculator, as the values are obvious!Let X represent the IQ of a randomly chosen person, where Use your Z-table to find:

Example:P(X < 115) = P(Z < 1) = 0.8413

P(X < 107.5) = P(Z < 0.5) = 0.6915P(X > 122.5) = P(Z > 1.5) = 1 – P(Z < 1.5) = 0.0668P(X > 85) = P(X < 115) = P(Z < 1) = 0.8413P(X < 106) = P(Z < 0.4) = 0.6554P(X > 95) = P(X < 105) = P(Z < 0.33) = 0.6293P(X < 92.5) = P(Z < -0.5) = P(Z > 0.5) = 1 – P(Z < 0.5) = 0.3085P(X < 80) = P(Z < -1.33) = P(Z > 1.33) = 1 – P(Z < 1.33) = 0.0918

12

345

67

??

???

??

(For further practice outside class)

Page 15: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Probabilities for RangesAgain, let X represent the IQ of a randomly chosen person, where Thinking about the graph of the normal distribution, find:

P(96 < X < 112)

IQ (x)

100

z=0

96 112

This easiest way is to find P(X < 112) and ‘cut out’ the area corresponding to P(X < 96):

?

We can see that, in general:P(a < Z < b) = P(Z < b) – P(Z < a)

Page 16: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Quickfire ProbabilitiesLet X represent the IQ of a randomly chosen person, where Use your Z-table to find:

P(100 < X < 107.5) = P(Z < 0.5) – 0.5 = 0.1915

P(145 < X < 151) = P(3 < Z < 3.4) = P(Z < 3.4) – P(Z < 3) = 0.001

P(116 < X < 120) = P(1.07 < X < 1.33) = 0.9082 – 0.8577 = 0.0505

P(80 < X < 110) = P(-1.33 < X < 0.67) = 0.7486 – (1 – 0.9082) = 0.6568

P(70 < X < 90) = P(-2 < Z < -0.67) = (1 – 0.7486) – (1 – 0.9772) = 0.2286

1

2

3

4

5

?

?

?

?

?

Page 17: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Summary so far

Original value

z-value(num standard

deviations above mean)Probability

Use Z-tableor

This essentially says that our value is standard deviations above the mean.

must be positive.

must be <

Page 18: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

The reverse: Finding the z-value for a probability

Sometimes we’re given the probability, and need to find the value, so that we can determine a missing value or the standard deviation/mean.Just use the z-table backwards!

Bro Tip: Remember that either flipping the inequality, or changing the sign of will cause your probability to become 1 minus it.

????

??

????

?

??

For nice ‘round’ probabilities, we have to look in the second z-table. You’ll lose a mark otherwise.

?

Page 19: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

The reverse: Finding the z-value for a probability

Again, let X represent the IQ of a randomly chosen person, where

What IQ corresponds to the top 22% of the population?

State the problem in probabilistic terms.

𝑃 (𝑍<𝑥−10015 )=0.78

Find the closest probability in the z-table.𝑧=0.77

We can use this value to find the value of .

?

?

0.78

Bro Tip: Draw a diagram for these types of questions if it helps.

z = 0.77

𝑃 (𝑋<𝑥 )=0.78

‘Standardise’.

?

?

Page 20: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

The reverse: Finding the z-value for a probability

Again, let X represent the IQ of a randomly chosen person, where

What IQ corresponds to the top 10% of the population?

‘Standardise’

𝑃 (𝑍<𝑥−10015 )=0.9

Find the closest probability in the z-table.

z = 1.2816

Convert from value to IQ.

?

?

?0.9

z = 1.2816

State the problem in probabilistic terms.𝑃 (𝑋<𝑥 )=0.9?

Page 21: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

𝑃 (𝑍 <−𝑧)=0.7

The reverse: Finding the z-value for a probability

Again, let X represent the IQ of a randomly chosen person, where

What IQ corresponds to the bottom 30% of the population?

𝑃 (𝑋<𝑥 )=0.3

We can’t look up probabilities less than 0.5 in the table, so manipulate:

z = -0.5244

Convert back into an IQ.

?

?

Find the closest probability in the z-table.Again, use the second table.

?

𝑃 (𝑍<𝑧 )=0.3

‘Standardise’

State the problem in probabilistic terms.

(I haven’t written yet because it’ll make the manipulation more tedious. I’ve used a variable to represent the value)?

?

Page 22: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Test Your Understanding

Remember:1. State your problem in probabilistic terms.2. Standardise. Manipulate if necessary so that your probability is above 0.5 and you’re finding .3. Use your z-table backwards to find the z-table.4. Use to find your value of .

On the planet Frostopolis, the mean height of a Frongal is 1.57m and the standard deviation 0.2m. Determine:

a) The height for which 65% of Frongals have a height less than.

b) The height for which 40% of Frongals have a height more than.

c) The height for which 23% of Frongals have a height less than.

?

?

?

Page 23: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Exercise 9C Page 184

Given a) b)

Given , find such that

Given that , find such that

. Find the value of and such that:a)

b)

c)

1

6

7

9

??

?

?

??

?

Page 24: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Quartiles

IQ (x)

P(X = x)

Q3 = 110 Q1 = 90 ? ?Q2 = 100?

𝑋 𝑁 (100 ,152 )

i.e. The upper quartile is two-thirds of a standard deviation above a mean (useful to remember!)

Page 25: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Edexcel S1 Jan 2011

Exam Questions

z = -2.3263 (using 2nd z-table)w = 160 – (5 x 2.3263) = 148.3685

Edexcel S1 May 2012

P(Z < z) = 0.6z = 0.2533 (using 2nd table)W = 162 + (7.5 x 0.2533) = 163.89975cm

?

?

Edexcel S1 May 2010

a) P(D > 20) = P(Z > -1.25) = P(Z < 1.25) = 0.8944

b) P(Z < z) = 0.75 -> z = 0.67Q3 = 30 + (0.67 x 8) = 35.36

c) Q1 = 30 – (0.67 x 8) = 24.64

?

Page 26: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Finding missing and

The random variable Given that P(X > 20) = 0.20, find the value of .

(Normalising)

The random variable Given that P(X < 46) = 0.2119, find the value of .

If your standard deviation is negative, you know you’ve done something wrong!

?

?

Page 27: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Finding missing and

The random variable Given that P(X > 35) = 0.025 and P(X < 15) = 0.1469, find the value of and the value of

P(X < 35) = 1 – 0.025 = 0.975If P(Z < z) = 0.975, then z = 1.96.

?

First deal with P(X > 35) = 0.025 Next deal with P(X < 15) = 0.1469

We now have two simultaneous equations. Solving gives:

?

?

Page 28: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Test your understanding

For the weights of a population of squirrels, which are normally distributed, Q1 = 0.55kg and Q3 = 0.73kg.Find the standard deviation of their weights.

Due to symmetry, = (0.55 + 0.73)/2 = 0.64kgIf P(Z < z) = 0.75, then z = 0.67.0.64 + 0.67 = 0.73 = 0.134

= 0.114 = 0.124 = 0.134 = 0.144

Only 10% of maths teachers live more than 80 years. Triple that number live less than 75 years. Given that life expectancy of maths teachers is normally distributed, calculate the standard deviation and mean life expectancy.

= 2.77 = 2.78 = 79 = 2.80

= 76.15 = 76.25 = 76.35 = 76.45Similarly – 0.5244 = 75

RIP

A IngallHe loved math

Page 29: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Exam Questions

P(Z < z) = 0.85, then z = 1.04.So 115 minutes is 1.04 standard deviations above the mean.

Edexcel S1 May 2013 (Retracted)

Edexcel S1 Jan 2011 Using z-table, 160 is 2.32 standard deviations above the mean. So:

Similarly, z-value for 0.9 is 1.28. By symmetry, 152 has z value of -1.28. So:

Solving, and

Edexcel S1 Jan 2002

a) z-value for 0.975 is 1.96. By symmetry, 235 is 1.96 standard deviations below mean.So . The result follows.

b) P(Z < z) = 0.85. So z = 1.04

c) Solving, , d) If 0.683 in the middle, (0.683/2)+0.5=0.8415

prob below value above mean. Thus z = 1.So values are 154.8 – 2.22 and 154.8 + 2.22

?

?

?

Page 30: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Summary

If a random variable X is normally distributed with mean 50 and standard deviation 2, we would write:

A z-value is:The number of standard deviations above the mean.

A z-table is:A cumulative distribution function for a normal distribution with mean 0 and standard deviation 1.

P(IQ < 115) = 0.8413

We can treat quartiles and percentiles as probabilities.

For IQ, what is the 85th percentile?100 + (1.04 x 15) = 115.6

We can form simultaneous equations to find the mean and standard deviation, given known values with their probabilities.

B

C

D

G

H

?

?

?

?

?

F P(a < X < b) = P(X < b) – P(X < a)?

A normal distribution is good for modelling data which:tails off symmetrically about some mean/ has a bell-curve like distribution.

A

?

E We need to use the second z-table whenever:we’re looking up the z value for certain ‘round’ probabilities.?

Page 31: S1: Chapter 9 The Normal Distribution Dr J Frost (jfrost@tiffin.kingston.sch.uk) Last modified: 4 th March 2014

Mixed QuestionsEdexcel S1 June 2001