© 2002 prentice-hall, inc.chap 14-1 introduction to multiple regression model

68
© 2002 Prentice-Hall, Inc. Chap 14-1 Introduction to Multiple Regression Model

Upload: clinton-gilbert-underwood

Post on 27-Dec-2015

227 views

Category:

Documents


1 download

TRANSCRIPT

© 2002 Prentice-Hall, Inc. Chap 14-1

Introduction to Multiple Regression Model

© 2002 Prentice-Hall, Inc. Chap 14-2

Chapter Topics Multiple linear regression (MLR) model Residual analysis Influence analysis Testing for the significance of the

regression model Inferences on the population regression

coefficients Testing portions of the multiple

regression model

© 2002 Prentice-Hall, Inc. Chap 14-3

0 1 1 2 2i i i k ki iY b b X b X b X e

Population Y-intercept

Population slopesRandom Error

Multiple Linear Regression Model

A relationship between one dependent and two or more independent variables is a

linear function

Dependent (Response) variable for sample

Independent (Explanatory) variables for sample model

1 2i i i k ki iY X X X

Residual

© 2002 Prentice-Hall, Inc. Chap 14-4

Population Multiple Regression Model

X2

Y

X1YX = 0 + 1X 1i + 2X 2i

0

Y i = 0 + 1X 1i + 2X 2i + i

ResponsePlane

(X 1i,X 2i)

(O bserved Y )

i

X2

Y

X1YX = 0 + 1X 1i + 2X 2i

0

Y i = 0 + 1X 1i + 2X 2i + i

ResponsePlane

(X 1i,X 2i)

(O bserved Y )

i

Bivariate model

© 2002 Prentice-Hall, Inc. Chap 14-5

Sample Multiple Regression Model

X2

Y

X1

b0

Y i = b0 + b1X 1 i + b2X 2 i + e i

ResponsePlane

(X 1i, X 2i)

(O bserved Y)

^

e i

Y i = b0 + b1X 1 i + b2X 2 i

X2

Y

X1

b0

Y i = b0 + b1X 1 i + b2X 2 i + e i

ResponsePlane

(X 1i, X 2i)

(O bserved Y)

^

e i

Y i = b0 + b1X 1 i + b2X 2 i

Bivariate model

Sample Regression PlaneSample Regression Plane

© 2002 Prentice-Hall, Inc. Chap 14-6

Simple and Multiple LinearRegression Compared:

Example

Two simple regressions:

Multiple regression:

0 1

0 1

Oil Temp

Oil Insulation

0 1 2Oil Temp Insulation

© 2002 Prentice-Hall, Inc. Chap 14-7

Multiple Linear Regression Equation

Too complicated

by hand! Ouch!

© 2002 Prentice-Hall, Inc. Chap 14-8

Interpretation of Estimated Coefficients

Slope (bi) Estimated that the average value of Y changes

by bi for each one unit increase in Xi holding all other variables constant (ceterus paribus)

Example: if b1 = -2, then fuel oil usage (Y) is expected to decrease by an estimated two gallons for each one degree increase in temperature (X1) given the inches of insulation (X2)

Y-intercept (b0) The estimated average value of Y when all Xi = 0

© 2002 Prentice-Hall, Inc. Chap 14-9

Multiple Regression Model: Example

Oil (Gal) Temp Insulation275.30 40 3363.80 27 3164.30 40 1040.80 73 694.30 64 6

230.90 34 6366.70 9 6300.60 8 10237.80 23 10121.40 63 331.40 65 10

203.50 41 6441.10 21 3323.00 38 352.50 58 10

(0F)

Develop a model for estimating heating oil used for a single family home in the month of January based on average temperature and amount of insulation in inches.

© 2002 Prentice-Hall, Inc. Chap 14-10

1 2ˆ 562.151 5.437 20.012i i iY X X

Sample Multiple Regression Equation: Example

CoefficientsIntercept 562.1510092X Variable 1 -5.436580588X Variable 2 -20.01232067

Excel Output

For each degree increase in temperature, the estimated average amount of heating oil used is decreased by 5.437 gallons, holding insulation constant.

For each increase in one inch of insulation, the estimated average use of heating oil is decreased by 20.012 gallons, holding temperature constant.

0 1 1 2 2i i i k kiY b b X b X b X

© 2002 Prentice-Hall, Inc. Chap 14-11

Venn Diagrams and Explanatory Power of

Regression

Oil

Temp

Variations in oil explained by temp or variations in temp used in explaining variation in oil

Variations in oil explained by the error term

Variations in temp not used in explaining variation in Oil

SSE

SSR

© 2002 Prentice-Hall, Inc. Chap 14-12

Venn Diagrams and Explanatory Power of

Regression

Oil

Temp

2

r

SSR

SSR SSE

(continued)

© 2002 Prentice-Hall, Inc. Chap 14-13

Venn Diagrams and Explanatory Power of

Regression

Oil

TempInsulation

Overlapping Overlapping variation in both Temp and Insulation are used in explaining the variationvariation in Oil but NOTNOT in the estimationestimation of nor

12

Variation NOTNOT explained by Temp nor Insulation SSE

© 2002 Prentice-Hall, Inc. Chap 14-14

Coefficient of Multiple Determination

Proportion of total variation in Y explained by all X variables taken together

Never decreases when a new X variable is added to model Disadvantage when comparing models

212

Explained Variation

Total VariationY k

SSRr

SST

© 2002 Prentice-Hall, Inc. Chap 14-15

Venn Diagrams and Explanatory Power of

Regression

Oil

TempInsulation

212

Yr

SSR

SSR SSE

© 2002 Prentice-Hall, Inc. Chap 14-16

Adjusted Coefficient of Multiple Determination

Proportion of variation in Y explained by all X variables adjusted for the number of X variables used

Penalize excessive use of independent variables

Smaller than Useful in comparing among models

2 212

11 1

1adj Y k

nr r

n k

212Y kr

© 2002 Prentice-Hall, Inc. Chap 14-17

Coefficient of Multiple Determination

Regression StatisticsMultiple R 0.982654757R Square 0.965610371Adjusted R Square 0.959878766Standard Error 26.01378323Observations 15

Excel Output

SST

SSRr ,Y 2

12

Adjusted r2

reflects the number of explanatory variables and sample size

is smaller than r2

© 2002 Prentice-Hall, Inc. Chap 14-18

Interpretation of Coefficient of Multiple Determination

96.56% of the total variation in heating oil can be explained by difference in temperature and amount of insulation

95.99% of the total fluctuation in heating oil can be explained by difference in temperature and amount of insulation after adjusting for the number of explanatory variables and sample size

2,12 .9656Y

SSRr

SST

2adj .9599r

© 2002 Prentice-Hall, Inc. Chap 14-19

Using The Model to Make Predictions

Predict the amount of heating oil used for a home if the average temperature is 300 and the insulation is six inches.

The predicted heating oil used is 278.97 gallons

1 2

ˆ 562.151 5.437 20.012

562.151 5.437 30 20.012 6

278.969

i i iY X X

© 2002 Prentice-Hall, Inc. Chap 14-20

Residual Plots

Residuals vs. May need to transform Y variable

Residuals vs. May need to transform variable

Residuals vs. May need to transform variable

Residuals vs. time May have autocorrelation

Y

1X

2X1X

2X

© 2002 Prentice-Hall, Inc. Chap 14-21

Residual Plots: Example

Insulation Residual Plot

0 2 4 6 8 10 12

No discernable pattern

Temperature Residual Plot

-60

-40

-20

0

20

40

60

0 20 40 60 80

Re

sid

ua

ls

May be some non-linear relationship

© 2002 Prentice-Hall, Inc. Chap 14-22

Influence Analysis To determine observations that have

influential effect on the fitted model Potentially influential points become

candidates for removal from the model Criteria used are

The hat matrix elements hi

The Studentized deleted residuals ti*

Cook’s distance statistic Di

All three criteria are complementary Only when all three criteria provide

consistent results should an observation be removed

© 2002 Prentice-Hall, Inc. Chap 14-23

The Hat Matrix Element hi

If , Xi is an Influential Point Xi may be considered a candidate for

removal from the model

2

2

1

1 i

i n

ii

X Xh

n X X

2 1 /ih k n

© 2002 Prentice-Hall, Inc. Chap 14-24

The Hat Matrix Element hi :Heating Oil Example

Oil (Gal) Temp Insulation h i

275.30 40 3 0.1567363.80 27 3 0.1852164.30 40 10 0.175740.80 73 6 0.246794.30 64 6 0.1618

230.90 34 6 0.0741366.70 9 6 0.2306300.60 8 10 0.3521237.80 23 10 0.2268121.40 63 3 0.244631.40 65 10 0.2759

203.50 41 6 0.0676441.10 21 3 0.2174323.00 38 3 0.157452.50 58 10 0.2268

No hi > 0.4 No observation appears to be a candidate for removal from the model

15 2

2 1 / 0.4

n k

k n

© 2002 Prentice-Hall, Inc. Chap 14-25

The Studentized Deleted Residuals ti

*

: difference between the observed and predicted based on a model that includes all observations except observation i

: standard error of the estimate for a model that includes all observations except observation i

An observation is considered influential if is the critical value of a two-tail test at a

alpha level of significance

* 1

i

i

ii

et

S h

iY

iY

iS

ie

*2i n kt t

2n kt

© 2002 Prentice-Hall, Inc. Chap 14-26

The Studentized Deleted Residuals ti

* :ExampleOil (Gal) Temp Insulation t i

*

275.30 40 3 -0.3772363.80 27 3 0.3474164.30 40 10 0.824340.80 73 6 -0.187194.30 64 6 0.0066

230.90 34 6 -1.0571366.70 9 6 -1.1776300.60 8 10 -0.8464237.80 23 10 0.0341121.40 63 3 -1.853631.40 65 10 1.0304

203.50 41 6 -0.6075441.10 21 3 2.9674323.00 38 3 1.168152.50 58 10 0.2432

2 11

15 2

1.7957n k

n k

t t

t10* and t13

* are influential points for potential removal from the model

*10t

*13t

© 2002 Prentice-Hall, Inc. Chap 14-27

Cook’s Distance Statistic Di

is the Studentized residual

If , an observation is considered influential

is the critical value of the F distribution at a 50% level of significance

2

2 1i i

ii

SR hD

h

1i

i

YX i

eSR

S h

1, 1i k n kD F

1, 1k n kF

© 2002 Prentice-Hall, Inc. Chap 14-28

Cook’s Distance Statistic Di : Heating Oil Example

Oil (Gal) Temp Insulation D i

275.30 40 3 0.0094363.80 27 3 0.0098164.30 40 10 0.049640.80 73 6 0.004194.30 64 6 0.0001

230.90 34 6 0.0295366.70 9 6 0.1342300.60 8 10 0.1328237.80 23 10 0.0001121.40 63 3 0.308331.40 65 10 0.1342

203.50 41 6 0.0094441.10 21 3 0.4941323.00 38 3 0.082452.50 58 10 0.0062

No Di > 0.835 No observation appears to be candidate for removal from the modelUsing the three criteria, there is insufficient evidence for the removal of any observation from the model

1, 1 3,12

15 2

0.835k n k

n k

F F

© 2002 Prentice-Hall, Inc. Chap 14-29

Testing for Overall Significance

Show if there is a linear relationship between all of the X variables together and Y

Use F test statistic Hypotheses:

H0: …k = 0 (no linear relationship) H1: at least one i ( at least one independent

variable affects Y ) The null hypothesis is a very strong statement Almost always reject the null hypothesis

© 2002 Prentice-Hall, Inc. Chap 14-30

Testing for Overall Significance

Test statistic:

where F has p numerator and (n-p-1) denominator degrees of freedom

(continued)

all /

all

SSR pMSRF

MSE MSE

© 2002 Prentice-Hall, Inc. Chap 14-31

Test for Overall SignificanceExcel Output: Example

ANOVAdf SS MS F Significance F

Regression 2 228014.6 114007.3 168.4712 1.65411E-09Residual 12 8120.603 676.7169Total 14 236135.2

p = 2, the number of explanatory variables n - 1

p value

Test StatisticMSR

FMSE

© 2002 Prentice-Hall, Inc. Chap 14-32

Test for Overall SignificanceExample Solution

F0 3.89

H0: 1 = 2 = … = p = 0

H1: At least one i 0 = .05df = 2 and 12

Critical Value(s):

Test statistic:

Decision:

Conclusion:

Reject at = 0.05

There is evidence that at least one independent variable affects Y

= 0.05

F 168.47(Excel Output)

© 2002 Prentice-Hall, Inc. Chap 14-33

Test for Significance:Individual Variables

Show whether there is a linear relationship between the variable Xi and Y

Use t Test Statistic Hypotheses:

H0: i 0 (No linear relationship) H1: i 0 (Linear relationship between Xi and

Y)

© 2002 Prentice-Hall, Inc. Chap 14-34

t Test StatisticExcel Output: Example

Coefficients Standard Error t StatIntercept 562.1510092 21.09310433 26.65093769X Variable 1 -5.436580588 0.336216167 -16.16989642X Variable 2 -20.01232067 2.342505227 -8.543127434

t Test Statistic for X1 (Temperature)

t Test Statistic for X2 (Insulation)

i

i

b

bt

S

© 2002 Prentice-Hall, Inc. Chap 14-35

t Test : Example Solution

H0: 1 = 0

H1: 1 0

df = 12

Critical Value(s):

Test Statistic:

Decision:

Conclusion:

Reject H0 at = 0.05

There is evidence of a significant effect of temperature on oil consumption.

t0 2.1788-2.1788

.025

Reject H0 Reject H0

.025

Does temperature have a significant effect on monthly consumption of heating oil? Test at = 0.05.

t Test Statistic = -16.1699

© 2002 Prentice-Hall, Inc. Chap 14-36

Venn Diagrams and Estimation of Regression

Model

Oil

TempInsulation

Only this information is used in the estimation of 2

Only this information is used in the estimation of

1This information is NOT used in the estimation of nor1 2

© 2002 Prentice-Hall, Inc. Chap 14-37

Confidence Interval Estimate

for the Slope

Provide the 95% confidence interval for the population slope 1 (the effect of temperature on oil consumption).

11 1n p bb t S

Coefficients Lower 95% Upper 95%Intercept 562.151009 516.1930837 608.108935X Variable 1 -5.4365806 -6.169132673 -4.7040285X Variable 2 -20.012321 -25.11620102 -14.90844

-6.169 1 -4.704

The estimated average consumption of oil is reduced by between 4.7 gallons to 6.17 gallons per each increase of 10 F.

© 2002 Prentice-Hall, Inc. Chap 14-38

Contribution of a Single Independent Variable

Let Xk be the independent variable of interest

Measures the contribution of Xk in explaining the total variation in Y (SST)

kX

| all others except

all all others except

k k

k

SSR X X

SSR SSR X

© 2002 Prentice-Hall, Inc. Chap 14-39

Contribution of a Single Independent Variable kX

1 2 3

1 2 3 2 3

| and

, and and

SSR X X X

SSR X X X SSR X X

Measures the contribution of in explaining SST

1X

From ANOVA section of regression for

From ANOVA section of regression for

0 1 1 2 2 3 3i i i iY b b X b X b X 0 2 2 3 3i i iY b b X b X

© 2002 Prentice-Hall, Inc. Chap 14-40

Coefficient of Partial Determination of

Measures the proportion of variation in the dependent variable that is explained by Xk while controlling for (holding constant) the other independent variables

2 all others

| all others

all | all others

Yk

k

k

r

SSR X

SST SSR SSR X

kX

© 2002 Prentice-Hall, Inc. Chap 14-41

Coefficient of Partial Determination for kX

(continued)

1 221 2

1 2 1 2

|

, |Y

SSR X Xr

SST SSR X X SSR X X

Example: Two Independent Variable Model

© 2002 Prentice-Hall, Inc. Chap 14-42

Venn Diagrams and Coefficient of Partial Determination forkX

Oil

TempInsulation

1 2|SSR X X

21 2

1 2

1 2 1 2

|

, |

Yr

SSR X X

SST SSR X X SSR X X

=

© 2002 Prentice-Hall, Inc. Chap 14-43

Contribution of a Subset of Independent Variables

Let Xs be the subset of independent variables of interest

Measures the contribution of the subset xs in explaining SST

| all others except

all all others except

s s

s

SSR X X

SSR SSR X

© 2002 Prentice-Hall, Inc. Chap 14-44

Contribution of a Subset of Independent Variables:

Example

Let Xs be X1 and X3

1 3 2

1 2 3 2

and |

, and

SSR X X X

SSR X X X SSR X

From ANOVA section of regression for

From ANOVA section of regression for

0 1 1 2 2 3 3i i i iY b b X b X b X 0 2 2i iY b b X

© 2002 Prentice-Hall, Inc. Chap 14-45

Testing Portions of Model

Examines the contribution of a subset Xs of explanatory variables to the relationship with Y

Null hypothesis: Variables in the subset do not

significantly improve the model when all other variables are included

Alternative hypothesis: At least one variable is significant

© 2002 Prentice-Hall, Inc. Chap 14-46

Testing Portions of Model

Always one-tailed rejection region Requires comparison of two regressions

One regression includes everything Another regression includes everything

except the portion to be tested

(continued)

© 2002 Prentice-Hall, Inc. Chap 14-47

Partial F Test For Contribution of Subset of X

variables Hypotheses:

H0 : Variables Xs do not significantly improve the model given all others variables included

H1 : Variables Xs significantly improve the model given all others included

Test Statistic:

with df = m and (n-p-1) m = # of variables in the subset Xs

| all others /

allsSSR X m

FMSE

© 2002 Prentice-Hall, Inc. Chap 14-48

Partial F Test For Contribution of A Single

Hypotheses: H0 : Variable Xj does not significantly improve

the model given all others included H1 : Variable Xj significantly improves the

model given all others included Test Statistic:

With df = 1 and (n-p-1) m = 1 here

jX

| all others

alljSSR X

FMSE

© 2002 Prentice-Hall, Inc. Chap 14-49

Testing Portions of Model: Example

Test at the = .05 level to determine whether the variable of average temperature significantly improves the model given that insulation is included.

© 2002 Prentice-Hall, Inc. Chap 14-50

Testing Portions of Model: Example

H0: X1 (temperature) does not improve model with X2 (insulation) included

H1: X1 does improve model

= .05, df = 1 and 12

Critical Value = 4.75

ANOVASS

Regression 51076.47Residual 185058.8Total 236135.2

ANOVASS MS

Regression 228014.6263 114007.313Residual 8120.603016 676.716918Total 236135.2293

(For X1 and X2) (For X2)

Conclusion: Reject H0; X1 does improve model

1 2

1 2

| 228,015 51,076261.47

, 676.717

SSR X XF

MSE X X

© 2002 Prentice-Hall, Inc. Chap 14-51

When to Use the F test

The F test for the inclusion of a single variable after all other variables are included in the model is IDENTICAL to the t test of the slope for that variable

The only reason to do an F test is to test several variables together

© 2002 Prentice-Hall, Inc. Chap 14-52

Chapter Summary

Developed the multiple regression model

Discussed residual plots Presented influence analysis Addressed testing the significance

of the multiple regression model Discussed inferences on population

regression coefficients Addressed testing portion of the

multiple regression model

© 2002 Prentice-Hall, Inc. Chap 14-53

nieXXY iippii ,,2,1,110

Multiple Linear Regression

npnpn

p

p

n e

e

e

e

XX

XX

XX

X

Y

Y

Y

whereY

2

1

1

0

1

221

111

2

1

,,

1

1

1

,

Data Model:

Matrix Model: eXY

© 2002 Prentice-Hall, Inc. Chap 14-54

ppp

n

iippiip

p

XXXXwhereXYXXY

XXYQ

Q

FittingSquaresLeast

,,,,1

,,,

,,,,min.1

:

21

22

110

1

211010

10

ˆˆˆˆˆ

,,,minˆ,,ˆ.2

110

100

XXXY

valuefittedtheThen

QimizeLet

pp

pp

noningularisXXiff

YXXXXYXXYYSinceT

TTT

)(

)(ˆ0)ˆ(ˆ 1

© 2002 Prentice-Hall, Inc. Chap 14-55

YYXXY rRt ˆ,,,; 1

Multiple Correlation Coefficient:

Multiple Coefficient of Determination: may be interpreted as the proportion of variance explained by the regression of Y on X.

2ˆ,

2,,;

2

2

1

,

YYXXY

eYY

rRR

SSS

Moreover

t

2

2ˆ2

Y

Y

S

SR

© 2002 Prentice-Hall, Inc. Chap 14-56

)

0

0

..(,0~2

2

2..

eVareiINeassume

eXYModel

dii

YXXXEESolve TT 1ˆ:

12,~ˆ XXN TTheorem:

YEXXX TT 1

XXXX

XEXXX

eXEXXX

TT

TT

TT

1

1

1

© 2002 Prentice-Hall, Inc. Chap 14-57

12,~ˆ XXN T

ˆ,ˆˆ: CovVarSolve

YXXXYXXXCov TTTT 11,

TTTTT XXXYYCovXXX11

,

TTTTT XXXeXVarXXX11

TTTT XXXIXXX121

112 XXXXXX TTT

12 XX T

1

ˆ1

1

1

ˆˆ

1

21

2

2

pn

RSSYY

pnpn

ewhere

n

iii

n

ii

© 2002 Prentice-Hall, Inc. Chap 14-58

DATA;INPUT X1 X2 Y;CARDS;68 60 7549 94 6360 91 57.77 78 72;PROC PRINT; PROC REG;MODEL Y=X1 X2 / COVB CORRB R INFLUENCE; RUN;

© 2002 Prentice-Hall, Inc. Chap 14-59

Model: MODEL1 Dependent Variable: Y Analysis of Variance  Sum of Mean Source DF Squares Square F Value Pr > F  Model 2 1966.20840 983.10420 14.86 0.0002 Error 17 1124.79160 66.16421 Corrected Total 19 3091.00000   Root MSE 8.13414 R-Square 0.6361 Dependent Mean 74.50000 Adj R-Sq 0.5933 Coeff Var 10.91831   Parameter Estimates  Parameter Standard Variable DF Estimate Error t Value Pr > |t|  Intercept 1 14.49614 14.20435 1.02 0.3218 X1 1 0.56319 0.11801 4.77 0.0002 X2 1 0.26736 0.15704 1.70 0.1069

© 2002 Prentice-Hall, Inc. Chap 14-60

Covariance of Estimates 

COVB Intercept X1 X2  Intercept 201.7635339 -0.635820247 -1.851491131 X1 -0.635820247 0.0139252459 -0.003440529 X2 -1.851491131 -0.003440529 0.0246625524 

Correlation of Estimates 

COVB Intercept X1 X2  Intercept 1.0000 -0.3793 -0.8300 X1 -0.3793 1.0000 -0.1857 X2 -0.8300 -0.1857 1.0000

© 2002 Prentice-Hall, Inc. Chap 14-61

Dep Var Predicted Std Error Std Error Student Cook's

Obs Y Value Predict Residual Residual Residual -2-1 0 1 2 D

1 75.0000 68.8346 4.2678 6.1654 6.925 0.890 | |* | 0.100

2 63.0000 67.2242 3.3214 -4.2242 7.425 -0.569 | *| | 0.022

3 57.0000 72.6172 2.2988 -15.6172 7.803 -2.002 | ****| | 0.116

4 88.0000 74.4491 1.9107 13.5509 7.907 1.714 | |*** | 0.057

5 88.0000 90.5143 4.2002 -2.5143 6.966 -0.361 | | | 0.016

6 79.0000 85.2747 2.6984 -6.2747 7.674 -0.818 | *| | 0.028

7 82.0000 67.5089 2.4898 14.4911 7.744 1.871 | |*** | 0.121

8 73.0000 66.4506 2.8567 6.5494 7.616 0.860 | |* | 0.035

9 90.0000 81.2755 2.5928 8.7245 7.710 1.132 | |** | 0.048

10 62.0000 59.7208 3.8097 2.2792 7.187 0.317 | | | 0.009

11 70.0000 77.4755 1.8990 -7.4755 7.909 -0.945 | *| | 0.017

12 96.0000 93.1309 3.8760 2.8691 7.151 0.401 | | | 0.016

13 76.0000 73.9825 2.5281 2.0175 7.731 0.261 | | | 0.002

14 75.0000 80.1776 2.3793 -5.1776 7.778 -0.666 | *| | 0.014

15 85.0000 84.6150 3.2590 0.3850 7.453 0.0517 | | | 0.000

16 40.0000 50.3917 5.9936 -10.3917 5.499 -1.890 | ***| | 1.414

17 74.0000 76.2637 2.1866 -2.2637 7.835 -0.289 | | | 0.002

18 70.0000 69.0846 2.0768 0.9154 7.865 0.116 | | | 0.000

19 75.0000 72.2929 2.6787 2.7071 7.680 0.352 | | | 0.005

20 72.0000 78.7158 2.5093 -6.7158 7.737 -0.868 | *| | 0.026

21 . 83.7560 3.0157 . . . .

© 2002 Prentice-Hall, Inc. Chap 14-62

Hat DiagObs Residual RStudent H 1 6.1654 0.8846 0.2753 2 -4.2242 -0.5572 0.1667 3 -15.6172 -2.2211 0.0799 4 13.5509 1.8281 0.0552 5 -2.5143 -0.3515 0.2666 6 -6.2747 -0.8094 0.1100 7 14.4911 2.0374 0.0937 8 6.5494 0.8530 0.1233 9 8.7245 1.1417 0.1016 10 2.2792 0.3086 0.2194 11 -7.4755 -0.9420 0.0545 12 2.8691 0.3911 0.2271 13 2.0175 0.2537 0.0966 14 -5.1776 -0.6543 0.0856 15 0.3850 0.0501 0.1605 16 -10.3917 -2.0627 0.5429 17 -2.2637 -0.2810 0.0723 18 0.9154 0.1130 0.0652 19 2.7071 0.3432 0.1084 20 -6.7158 -0.8613 0.0952

© 2002 Prentice-Hall, Inc. Chap 14-63

|

100 +

|

| o

| o o o o o

| o o o

90 + o

| o

| o

H |

O | o o

M 80 + o o

E | o

W |

O | o

R |

K 70 +

|

|

|

|

60 + o

|

|

|

|

50 + o

|

-+------------+------------+------------+------------+------------+------------+------------+

30 40 50 60 70 80 90 100

MIDTERM

© 2002 Prentice-Hall, Inc. Chap 14-64

0

00

11 ,.1

rrH

rrH

hypothesisthetesttowantweccrLet

a

pp

::

rdtspntr

rforICaConstruct

cccrLet pp

ˆˆ1ˆ

..%95

.2

025.0

1100

TccVarcVarrVar

andpntrdts

rErtisstatistictesttheThen

ˆˆˆ

,1~ˆˆˆˆ

ˆ

ˆ),,,(,ˆˆˆ0

000

p

ppp cccwhereccrchooseWe

© 2002 Prentice-Hall, Inc. Chap 14-65

setdataaforelbetterachoosetoHow mod

Goodness of Fit

1,~1 0

pnqpFRSS

RSSRSS

qp

pnF

pqjoneleastatH

H

eldrestriceteatoeledunrestrictanreducetoHow

j

pq

,,1,0

0

?modmod

1

10

eXXY

elstricted

eXXY

eledUnrestrict

pqAssume

qq

pp

110

110

modRe

mod

© 2002 Prentice-Hall, Inc. Chap 14-66

?mod datathefittoelbetterachoosetoHow

eYelrestricted

eXYeledunrestrictEx

mod

mod:

00 0 :H

2,1~1

2

0.0

20

10

nRSS

RSSRSSnF

HsvHSolve

:::

.ˆˆ1

2

1

2

0

n

iii

n

ii

XYRSS

YYRSSwhere

© 2002 Prentice-Hall, Inc. Chap 14-67

Regression Effect

eY

elstricted

eXXY

eledUnrestrict

pp

0

110

modRe

mod

2ˆ0 11

:e

Y

S

S

p

pn

RSS

RSSRSS

p

pnFSolve

,0

0210

ja

p

oneleastatH

H

::

2

2

1

1

R

R

p

pnFthen

22ˆ

22ˆ1

Ye

YY

SS

SS

p

pn

2

2

1

1

R

R

p

pn

© 2002 Prentice-Hall, Inc. Chap 14-68

i

ijiij

ijiij

njkifor

RSSeYeledUnrestrict

RSSeXYelstricted

,,2,1,,2,1

:mod

:modRe 0

k

iinn

RSS

RSSRSS

k

knF

1

0 ,2

jiif

jiifdkidddLet ijikkiii ,0

,1,,,2,1,2211

Goodness of Fit for using replicate observations

)(

mod

2211 RSSedddY

asrewrittenbecaneledunrestricttheThen

ijikkiiij