the normal distribution. n = 20,290 = 2622.0 = 2037.9 population

49
The Normal Distribution QuickTime™ and a TIFF (Uncompressed) decompressor are needed to see this picture.

Post on 20-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

The Normal Distribution

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 2: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

n = 20,290 = 2622.0 = 2037.9

Population

Page 3: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Y = 2675.4s = 1539.2

Y = 2588.8s = 1620.5

Y = 2702.4s = 1727.1

Y = 2767.2s = 2044.7

SAMPLES

Page 4: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Y = 2675.4s = 1539.2

Y = 2588.8s = 1620.5

Y = 2702.4s = 1727.1

Y = 2767.2s = 2044.7

Sampling distribution of the mean

Page 5: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

1000 samples

Sampling distribution of the mean

Page 6: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Sampling distribution of the mean

Page 7: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Sampling distribution of the mean

Non-normal

Approximately normal

Page 8: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population
Page 9: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Sample means are normally distributed

• The mean of the sample means is .

• The standard deviation of the sample means is

Y

n

*If the variable itself is normally distributed,or sample size (n) is large

Page 10: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Standard error

The standard error of an estimate of a mean is the standard deviation of the distribution of sample means

Y

n

We can approximate this by

SEY

=s

n

Page 11: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Distribution of means of samples with n =10

Page 12: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Larger samples equal smaller standard

errors

Page 13: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Central limit theorem

Page 14: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Button pushing times

Frequency

Time (ms)

Page 15: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Distribution of means

Page 16: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Binomial Distribution

Page 17: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Normal approximation to the binomial distribution

The binomial distribution, when number of trials n is large

and probability of success p is not close to 0 or 1, is

approximated by a normal distribution having mean np

and standard deviation

np1−p( ).

Page 18: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Example

A scientist wants to determine if a loonie is a fair coin. He carries out an experiment where he flips the coin1,000,000 times, and counts the number of heads. Heads come up 543,123 times. Using these data, test the fairnessof the loonie.

Page 19: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Inference about means

Z =Y − μ

σY

=Y − μ

σ n

Because is normally distributed:

Y

Page 20: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

But... We don’t know

t =Y − μ

s / n

A good approximation to the standard normal is then:

Because we estimated s, t is not exactly a standard normal!

Page 21: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

t has a Student’s t distribution

}

Page 22: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Degrees of freedom

df = n - 1

Degrees of freedom for the student’s t distribution for a sample mean:

Page 23: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Confidence interval for a mean

Y ± 2 * SEY

Page 24: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Confidence interval for a mean

Y ± SEY

tα 2( ), df

(2) = 2-tailed significance level

Df = degrees of freedom

SEY = standard error of the mean

Page 25: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

95% confidence interval for a meanExample: Paradise flying snakes

0.9, 1.4, 1.2, 1.2, 1.3, 2.0, 1.4, 1.6

Undulation rates (in Hz)

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 26: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Estimate the mean and standard deviation

Y =1.375

s = 0.324

n = 8

Page 27: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Find the standard error

Y ± SEY

tα 2( ),df

SEY

=s

n=

0.324

8= 0.115

Page 28: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

df (1)=0.1

(2)=0.2

(1)=0.05

(2)=0.10

(1)=0.025

(2)=0.05

(1)=0.01

(2)=0.02

(1)=0.005

(2)=0.01

(1)=0.001

(2)=0.0021 3.08 6.31 12.71 31.82 63.66 318.312 1.89 2.92 4.3 6.96 9.92 22.333 1.64 2.35 3.18 4.54 5.84 10.214 1.53 2.13 2.78 3.75 4.6 7.175 1.48 2.02 2.57 3.36 4.03 5.896 1.44 1.94 2.45 3.14 3.71 5.217 1.41 1.89 2.36 3. 3.5 4.798 1.4 1.86 2.31 2.9 3.36 4.5

Table A3.3

Page 29: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Find the critical value of t

df = n −1 = 7

tα 2( ),df = t0.05 2( ),7 = 2.36

Page 30: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Putting it all together...

Y ± SEY

tα 2( ),df =1.375 ± 0.115 2.36( )

=1.375 ± 0.271

1.10 < μ <1.65

Page 31: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

99% confidence interval

tα 2( ),df = t0.01 2( ),7 = 3.50

Y ± SEY

tα 2( ),df =1.375 ± 0.115 3.50( )

=1.375 ± 0.403

0.97 < μ <1.78

Page 32: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Confidence interval for the variance

df s2

χ α

2,df

2≤ σ 2 ≤

df s2

χ1−

α

2,df

2

Page 33: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

χ2

Frequency

χ2

1- α /2

χ2

α /2

2.5%

2.5%

α = 0.05

Page 34: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

χ2

,df

2 = χ 0.025,72 =16.01

χ1−

α

2,df

2 = χ 0.975,72 =1.69

dfX 0.999 0.995 0.99 0.975 0.95 0.05 0.025 0.01 0.005 0.001

1 1.6E-6

3.9E-5 0.00016 0.00098 0.00393 3.84 5.02 6.63 7.88 10.83

2 0 0.01 0.02 0.05 0.1 5.99 7.38 9.21 10.6 13.823 0.02 0.07 0.11 0.22 0.35 7.81 9.35 11.34 12.84 16.274 0.09 0.21 0.3 0.48 0.71 9.49 11.14 13.28 14.86 18.475 0.21 0.41 0.55 0.83 1.15 11.07 12.83 15.09 16.75 20.526 0.38 0.68 0.87 1.24 1.64 12.59 14.45 16.81 18.55 22.467 0.6 0.99 1.24 1.69 2.17 14.07 16.01 18.48 20.28 24.328 0.86 1.34 1.65 2.18 2.73 15.51 17.53 20.09 21.95 26.12

Table A3.1

Page 35: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

95% confidence interval for the variance of flying snake

undulation rate

df s2

χ α

2,df

2≤ σ 2 ≤

df s2

χ1−

α

2,df

2

7 0.324( )2

16.01≤ σ 2 ≤

7 0.324( )2

1.69

0.0459 ≤ σ 2 ≤ 0.435

Page 36: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

95% confidence interval for the standard deviation of

flying snake undulation rate

df s2

χ α

2,df

2≤ σ ≤

df s2

χ1−

α

2,df

2

0.0459 ≤ σ ≤ 0.435

0.21≤ σ ≤ 0.66

Page 37: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

One-sample t-test

The one-sample t-test compares the mean of a random

sample from a normal population with the population mean

proposed in a null hypothesis.

Page 38: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Hypotheses for one-sample t-tests

H0 : The mean of the population is 0.

HA: The mean of the population is not 0.

Page 39: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Test statistic for one-sample t-test

t =Y − μ 0

s / n

0 is the mean value proposed by H0

Page 40: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Example: Human body temperature

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

H0 : Mean healthy human body temperature is 98.6ºF

HA: Mean healthy human body temperature is not 98.6ºF

Page 41: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Human body temperature

n = 24

Y = 98.28

s = 0.940

t =Y − μ0

s / n=

98.28 − 98.6

0.940 / 24= −1.67

Page 42: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Degrees of freedom

df = n-1 = 23

Page 43: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Comparing t to its distribution to find

the P-value

Page 44: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

A portion of the t table

df (1)=0.1

(2)=0.2

(1)=0.05

(2)=0.10

(1)=0.025

(2)=0.05

(1)=0.01

(2)=0.02

(1)=0.005

(2)=0.01... ... ... ... ... ...20 1.33 1.72 2.09 2.53 2.8521 1.32 1.72 2.08 2.52 2.8322 1.32 1.72 2.07 2.51 2.8223 1.32 1.71 2.07 2.5 2.8124 1.32 1.71 2.06 2.49 2.825 1.32 1.71 2.06 2.49 2.79

Page 45: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population
Page 46: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

-1.67 is closer to 0 than -2.07, so P >

With these data, we cannot reject the null hypothesis that the mean human body temperature is 98.6.

Page 47: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Body temperature revisited: n = 130

n =130

Y = 98.25

s = 0.733

t =Y − μ0

s / n=

98.25 − 98.6

0.733/ 130= −5.44

Page 48: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

Body temperature revisited: n = 130

t = −5.44

t0.05(2),129 = ±1.98

t is further out in the tail than the critical value, so we could reject the null hypothesis. Human body temperature is not 98.6ºF.

Page 49: The Normal Distribution. n = 20,290  = 2622.0  = 2037.9 Population

One-sample t-test: Assumptions

• The variable is normally distributed.

• The sample is a random sample.