hypothesis testing 5th - 9th december 2011, rome

32
Hypothesis testing 5th - 9th December 2011, Rome

Upload: jerry-bennis

Post on 01-Apr-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Hypothesis testing 5th - 9th December 2011, Rome

Hypothesis testing

5th - 9th December 2011, Rome

Page 2: Hypothesis testing 5th - 9th December 2011, Rome

Hypothesis testing

Hypothesis testing involves:1. defining research questions and2. assessing whether changes in an independent variable are

associated with changes in the dependent variable by conducting a statistical test

Dependent and independent variables Dependent variables are the outcome variables Independent variables are the predictive/ explanatory variables

Page 3: Hypothesis testing 5th - 9th December 2011, Rome

Examples…

Research question: Is educational level of the mother related to birthweight?

What is the dependent and independent variable?

Research question: Is access to roads related to educational level of mothers?

Now?

Page 4: Hypothesis testing 5th - 9th December 2011, Rome

Tests statistics To test hypotheses, we rely on test statistics… Test statistics are simply the result of a particular

statistical test

The most common include: T-tests calculate T-statistics ANOVAs calculate F-statistics Correlations calculate the pearson correlation coefficient

Page 5: Hypothesis testing 5th - 9th December 2011, Rome

Significant test statistic Is the relationship observed by chance, or because there actually is a

relationship between the variables???

This probability is referred to as a p-value and is expressed a decimal percent (ie. p=0.05)

If the probability of obtaining the value of our test statistic by chance is less than 5% then we generally accept the experimental hypothesis as true: there is an effect on the population

Ex: if p=0.1-- What does this mean? Do we accept the experimental hypothesis?

This probability is also referred to as significance level (sig.)

Page 6: Hypothesis testing 5th - 9th December 2011, Rome

Statistical significance

Page 7: Hypothesis testing 5th - 9th December 2011, Rome

Hypothesis testing Part 1: Continuous variables

Page 8: Hypothesis testing 5th - 9th December 2011, Rome

Topics to be covered in this presentation

T- test One way analysis of variance (ANOVA) Correlation

Page 9: Hypothesis testing 5th - 9th December 2011, Rome

Hypothesis testing…WFP tests a variety of hypothesis…

Some of the most common include:

1. Looking at differences between groups of people (comparisons of means)

Ex. Are different livelihood groups more likely to have different levels food consumption??

2. Looking at the relationship between two variables…Ex. Is asset wealth associated with food consumption??

Page 10: Hypothesis testing 5th - 9th December 2011, Rome

How to assess differences in two means statistically

T-tests

Page 11: Hypothesis testing 5th - 9th December 2011, Rome

T-testA test using the t-statistic that establishes whether two means differ significantly.

Independent means t-test: It is used in situations in which there are two experimental

conditions and different participants have been used in each condition.

Dependent or paired means t-test: This test is used when there are two experimental

conditions and the same participants took part in both conditions of experiment.

Page 12: Hypothesis testing 5th - 9th December 2011, Rome

T-test: assumptions

Independent T-tests works well if:

continuous variables groups to compare are composed of different people within each group, variable’s values are normally distributed there is the same level of homogeneity in the 2 groups.

Page 13: Hypothesis testing 5th - 9th December 2011, Rome

Normal distribution

Normal distributions are perfect symmetrical around the mean (mean is equal to zero)

Values close to the mean (zero) have higher frequency.

Values very far from the mean are less likely to occur (lower frequency)

Page 14: Hypothesis testing 5th - 9th December 2011, Rome

Variance

Variance measures how cases are similar on a specific variable (level of homogeneity)

V = sum of all the squared distances from the Mean / N

Variance is low → cases are very similar to the mean of the distribution (and to each other). The group of cases is therefore homogeneous (on this variable)

Variance is high → cases tend to be very far from the mean (and different from each other). The group of cases is therefore heterogeneous (on this variable)

Page 15: Hypothesis testing 5th - 9th December 2011, Rome

Homogeneity of Variance

T-test works well if the two groups have the same homogeneity (variance) on the variable. If one group is very homogeneous and the another is not, T-test fails.

Page 16: Hypothesis testing 5th - 9th December 2011, Rome

The independent t-test

The independent t-test compares two means, when those means have come from different groups of people;

Page 17: Hypothesis testing 5th - 9th December 2011, Rome

To conduct an independent t-test in SPSS

1. Click on “Analyze” drop down menu2. Click on “Compare Means”3. Click on “Independent- Sample T-Test…”4. Move the independent and dependent variable into

proper boxes5. Click “OK”

Page 18: Hypothesis testing 5th - 9th December 2011, Rome

T-test: SPSS procedure

Drag the variables into the proper boxes

define values for the independent variable

Page 19: Hypothesis testing 5th - 9th December 2011, Rome

One note of caution about independent t-testsIt is important to ensure that the assumption of homogeneity of variance is met:

To do so:

Look at the column labelled Levene’s Test for Equality of Variance.

If the Sig. value is less than .05 then the assumption of homogeneity of variance has been broken and you should look at the row in the table labelled Equal variances not assumed.

If the Sig. value of Levene’s test is bigger than .05 then you should look at the row in the table labelled Equal variances assumed.

Page 20: Hypothesis testing 5th - 9th December 2011, Rome

T-test: SPSS output

Look at the Levene’s Test …

If the Sig. value of the test is less than .05, groups have different variance. Read the row “Equal variances not assumed”

If the Sig. value of test is bigger than .05, read the row labelled “Equal variances assumed”

Independent Samples Test

.004 .950 -.791 1147 .429 -1.47311 1.86149 -5.12542 2.17921

-.791 1140.469 .429 -1.47311 1.86261 -5.12764 2.18143

Equal variancesassumed

Equal variancesnot assumed

coping strategies indexF Sig.

Levene's Test forEquality of Variances

t df Sig. (2-tailed)Mean

DifferenceStd. ErrorDifference Lower Upper

95% ConfidenceInterval of the

Difference

t-test for Equality of Means

Group Statistics

581 40.9019 30.70829 1.27399

568 42.3750 32.38332 1.35877

beneficiary householdas per CP records1

2

coping strategies indexN Mean Std. Deviation

Std. ErrorMean

Page 21: Hypothesis testing 5th - 9th December 2011, Rome

What to do if we want to statistically compare differences in three means?

Analysis of variance

(ANOVA)

Page 22: Hypothesis testing 5th - 9th December 2011, Rome

Analysis of Variance (ANOVA) ANOVAs test tells us if there are any difference among the

different means but not how (or which) means differ.

ANOVAs are similar to t-tests and in fact an ANOVA conducted to compare two means will give the same answer as a t-test.

Page 23: Hypothesis testing 5th - 9th December 2011, Rome

Calculating an ANOVA

ANOVA formulas: calculating an ANOVA by hand is complicated and knowing the formulas are not necessary…

Instead, we will rely on SPSS to calculate ANOVAs…

Page 24: Hypothesis testing 5th - 9th December 2011, Rome

Example of One-Way ANOVAs

Report

WAZNEW

-1.3147 736 1.32604

-1.0176 3247 1.21521

-.5525 907 1.25238

-.1921 172 1.33764

-.9494 5062 1.27035

Mother's education level

No education

Primary

Secondary

Higher

Total

Mean N Std. Deviation

ANOVA

WAZNEW

354.567 3 118.189 76.507 .000

7812.148 5057 1.545

8166.715 5060

Between Groups

Within Groups

Total

Sum of Squares df Mean Square F Sig.

Research question: Do mean child malnutrition (GAM) rates differ according to mother’s educational level (none, primary, or secondary/ higher)?

Page 25: Hypothesis testing 5th - 9th December 2011, Rome

To calculate one-way ANOVAs in SPSSIn SPSS, one-way ANOVAs are run using the following steps: Click on “Analyze” drop down menu

1. Click on “Compare Means”

2. Click on “One-Way ANOVA…”

3. Move the independent and dependent variable into proper boxes

4. Click “OK”

Page 26: Hypothesis testing 5th - 9th December 2011, Rome

ANOVA: SPSS procedure

1. Analyze; compare means; one-way ANOVA

2. Drag the independent and dependent variable into proper boxes

3. Ask for the descriptive

4. Click on ok

Page 27: Hypothesis testing 5th - 9th December 2011, Rome

ANOVA: SPSS output

ANOVA

coping strategies index

25600.110 10 2560.011 2.609 .004

1116564 1138 981.163

1142164 1148

Between Groups

Within Groups

Total

Sum ofSquares df Mean Square F Sig.

Along with the mean for each group, ANOVA produces the F-statistic. It tells us if there are differences between the means. It does not tell which means are different.

Look at the F’s value and at the Sig. level

Page 28: Hypothesis testing 5th - 9th December 2011, Rome

Determining where differences existIn addition to determining that differences exist among the means, you may want to know which means differ.

There is one type of test for comparing means: Post hoc tests are run after the experiment has been

conducted (if you don’t have specific hypothesis).

Page 29: Hypothesis testing 5th - 9th December 2011, Rome

ANOVA post hoc testsOnce you have determined that differences exist among the means, post hoc range tests and pairwise multiple comparisons can determine which means differ.

Tukeys post hoc test is the amongst the most popular and are adequate for our purposes…so we will focus on this test…

Page 30: Hypothesis testing 5th - 9th December 2011, Rome

To calculate Tukeys test in SPSSIn SPSS, Tukeys post hoc tests are run using the following

steps:1. Click on “Analyze” drop down menu2. Click on “Compare Means”3. Click on “One-Way ANOVA…”4. Move the independent and dependent variable into proper boxes5. Click on “Post Hoc…”6. Check box beside “Tukey”7. Click “Continue”8. Click “OK”

Page 31: Hypothesis testing 5th - 9th December 2011, Rome

Determining where differences exist in SPSS

Once you have determined that differences exist among the means → you may want to know which means differ…

Different types of tests exist for pairwise multiple comparisons

Page 32: Hypothesis testing 5th - 9th December 2011, Rome

Pairwise comparisons: SPSS outputOnce you have decided which post-hoc test is appropriate

Look at the column “mean difference” to know the difference between each pair

Look at the column Sig.: if the value is less than .05 then the means of the two pairs are significantly different

Multiple Comparisons

Dependent Variable: coping strategies index

Tukey HSD

8.5403* 1.6796 .000 4.599 12.481

22.5906* 2.7341 .000 16.175 29.006

-8.5403* 1.6796 .000 -12.481 -4.599

14.0503* 2.5873 .000 7.979 20.121

-22.5906* 2.7341 .000 -29.006 -16.175

-14.0503* 2.5873 .000 -20.121 -7.979

(J) asset wealthasset medium

asset rich

asset poor

asset rich

asset poor

asset medium

(I) asset wealthasset poor

asset medium

asset rich

MeanDifference

(I-J) Std. Error Sig. Lower Bound Upper Bound

95% Confidence Interval

The mean difference is significant at the .05 level.*.