2nd half review anova (ch. 11) non-parametric (7.11, 9.5) regression (ch. 12) ancova categorical...

2nd Half Review

ANOVA (Ch. 11)

Non-Parametric (7.11, 9.5)

Regression (Ch. 12)

ANCOVA

Categorical (Ch. 10)

Correlation (Ch. 12)

The Exam

• Thursday, April 27, 9:00am– TC 348 Abdi to Middleton– TC 348a Minto to Shetty– TC 348b Siddiqui to Zdravic

• 50 Questions• Not Cumulative• 3hr• Bring a calculator• No formula Sheets

• Variance Partitions– Total = Among + Within

• Grand Mean, Group Mean and associated Deviations

• When do we reject based on variance ratio???

ANOVA

ANOVA Table

Source df SS MS F

Among Treatments k-1 SSamong

dfamong

MSamong

MSwithin

Within Treatments n-k SSwithin

dfwithin

Total n-1

k

j

n

ijij XX

1 1

2)(

k

jjj XXn

1

2

k

j

n

iij XX

1 1

2)(

ANOVA

• When do we use??

• Model I vs Model II vs Model III??

• Multi-Factors??

• Main Effects vs Interactions??

Example From TextQuestion #11.40, p. 518

10 women in an aerobic exercise class, 10 women in a modern dance class, and a control group of 9 women were studied. One measurement made on each women was change in fat-free mass over the course of the 16-week training period. Summary statistics are given in the table. The ANOVA SS(between) is 2.465 and the SS(within) is 50.133.

Aerobics Dance Control

Mean 0 0.44 0.71

SD 1.31 1.17 1.68

n 10 10 9

a) State the null hypothesis

b) Construct the ANOVA table and test the null hypothesis (α = 0.05)

Example From TextQuestion #11.57, p. 522

A new investigational drug was given to 4 male and 4 female dogs, at doses 8 mg/kg and 25 mg/kg. The variable recorded was alkaline phosphatase level (U/Li).

Dose (mg/kg) Male Female

8 171 150

154 127

104 152

143 105

Avg 143 133.5

25 80 101

149 113

138 161

131 197

Avg 124.5 143

(SS(sex) = 81, SS(dose) = 81, SS(interaction) = 784, and SS(within) = 12604

a) Construct the ANOVA Tableb) Carry out an F test for

interactions: use (α = 0.05).c) Test the null hypothesis that

does has no effect on alkaline phosphatase level. (α = 0.05)

Extra Questions from the Text

• 11.4-11.6

• 11.9-11.11

• 11.17, 11.19

• 11.42, 11.43, 11.50, 11.54

Non-Parametric

• When to use??– Normality– Homogenous of Variance– Independent Observations

• What about Power??

• What do they use to compare data??

Mann-Whitney Test

• Compares two samples

• Replaces two-sample t-test

U n nn n

R

1 21 1

1

1

2

( )

U n n

n nR2 1

2 22

1

2

( )

If either U or U’ is greater than the critical value of U, then you should reject the Ho

Critical U

If either U or U’ is greater than the critical value of U, then you should reject the Ho

U UCritical n n 0 05 2 1 2. ,( ), , if n1 < n2

U UCritical n n 0 05 2 2 1. ,( ), , if n1 > n2

One-tailed Mann-Whitney U test

Use U or U’ depending on whether you expect sample 1 or sample 2 to be bigger

Ho: G1 G2 HA: G1 < G2

Ho: G1 G2 HA: G1 > G2

Ranking is low to high

U U’

Ranking is high to low

U’ U

Wilcoxon paired sample test

• Compares two paired samples

• Replaces Paired t-test

Deer12345678910

FrontLeg142140144144142146149150142148

BackLeg138136147139143141143145136146

Diff44-35-156562

Rank|d|

SignedRank |d|

Critical TSum the positive ranks - T+

Sum the negative ranks - T-

If either T+ or T- is less than or equal to T0.05, (2),n then reject Ho

Can also do these one tailed:

Ho: Measurement 1 Measurement 2HA: Measurement 1 > Measurement 2

--> reject Ho if T- T0.05, (1),n

Ho: Measurement 1 Measurement 2HA: Measurement 1 < Measurement 2

--> reject Ho if T+ T0.05, (1),n

Kruskal Wallis Test

• Test for three or more groups

• Replaces ANOVA

HN N

R

nNi

ii

k

12

13 1

2

1( )( )

• Rank each individual sample across all groups

• Sum ranks within each group = R

• Critical value - 21,05.0 k

Correction factor for tied ranks

Ct

N N

1

118

24 241 0 0013

0 9987

3

3

.

.

t t ti ii

m

3

1

3 3 32 2 2 2 2 2

18

HH

Cc 44123

0 998744180

.

..

Critical Value

Practice Questions

• Mann-Whitney– 7.79, 7.80, 7.82-7.84

• Wilcoxon– 9.30 – 9.33

• Kruskal-Wallis– 11.54, 11.57 use Kruskal-Wallis instead of

ANOVA

Regression

• Two or more continuous variables• Linear relationship between

XY 10

Intercept Slope

Least Squares

• Line with smallest residual sum of squares

YYiˆResidual

ˆ tois as X tois

ANOVA Table

Source of Variation SS DF MS F

Regression

Residual

Total

( )Y Yi 2

( )Y Yi 2

( )Y Yi 2

n-2

1

n-1

SS

DFgression

gression

Re

Re

SS

DFsidual

sidual

Re

Re

MS

MSgression

sidual

Re

Re

Coefficient of Determination

Total

gression

SS

SSr Re2

- proportion of variation explained

Coefficientsa

-1.047 .243 -4.314 .0015.80E-02 .010 .858 6.013 .000

(Constant)DIAM

Model1

B Std. Error

UnstandardizedCoefficients

Beta

Standardized

Coefficients

t Sig.

Dependent Variable: STRENGTHa.

. .Y X 1047 0 058

SlopeIntercept

Confidence Interval for Slope

1for %95 CI

ˆ2),2(1 *ˆ st n

XY 10ˆˆˆ

Practice Questions

• 12.45

• 12.49-12.54

ANCOVA

• Continuous Dependent

• Continuous and Discrete Independents

• Compares relationship of two variables across two groups

Categorical Data

• Discrete Response variable

• Interested in Frequencies

Chi-Squared Test

• Observed vs Expected Frequencies

22

1

( )exp

exp

f f

fobserved ected

ectedi

k

f expected is hypothesized ratio e.g. 50:50 males to females

21,05.0

2 kcritical

Contingency Table

• Test for independence among variables

• 2x2, 2x3, 3x3 etc.

• 2 variables with 2 levels, 2 variables with 3 levels, 3 variables with 3 levels etc.

22

1

( )exp

exp

f f

fobserved ected

ectedi

k

21,05.0

2 kcritical

Variable AColumn 1 Column 2 Column 3 Total

Row 1 O11 O21 O31 R1

Variable B

Row 2 O12 O22 O31 R2

C1 C2 C3 Total

R1 = sum of observed in Row 1R2 = sum of observed in Row 2C1 = sum of observed in Column 1C2 = sum of observed in Column 2C3 = sum of observed in Column 3Total = sum of all observed

E R C Totalij i j * *

Expected calculation

Variable A Column 1 Column 2 Column 3 Total

Row 1

E11 E21 E31 R1

Variable B Row 2

E12 E22 E32 R2

C1 C2 C3 Total

/

Practice Questions

• 10.73-10.78

• 10.84-10.88

Logistic Regression

• Discrete dependent – usually dichotomous

• Continuous independent

2nd half review anova (ch. 11) non-parametric (7.11, 9.5) regression (ch. 12) ancova categorical...

Documents

sample ttestif

critical value of u

positive ranks t

negative ranks t

g1 g2ha

vs model

individual sample

use kruskalwallis