copyright © 2014 by nelson education limited. 11-1 chapter 11 introduction to bivariate association...

42
Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured at the Nominal Level

Upload: angelina-morton

Post on 17-Jan-2018

232 views

Category:

Documents


0 download

DESCRIPTION

Copyright © 2014 by Nelson Education Limited Two variables are said to be associated when they vary together, when one changes as the other changes. If variables are associated, score on one variable can be predicted from the score of the other variable. The stronger the association, the more accurate the predictions. Introduction to Bivariate Association

TRANSCRIPT

Page 1: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-1

Chapter 11Introduction to Bivariate Association and

Measures of Association for Variables Measured at the Nominal Level

Page 2: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

• Association Between Variables and the Bivariate Table (Crosstab)

• Three Characteristics of Bivariate Associations• Chi Square-Based Measures of Association:

Phi and Cramer’s V • Lambda: A PRE Measure of Association

Copyright © 2014 by Nelson Education Limited.

11-2

In this presentation you will learn about:

Page 3: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-3

• Two variables are said to be associated when they vary together, when one changes as the other changes.

• If variables are associated, score on one variable can be predicted from the score of the other variable.

• The stronger the association, the more accurate the predictions.

Introduction to Bivariate Association

Page 4: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-4

• Association can be important evidence for causal relationships, particularly if the association is strong.

• Bivariate association can be investigated by finding answers to three questions:1.Does an association exist?2.How strong is the association?3.What is the pattern or direction of the association?

Introduction to Bivariate Association (continued)

Page 5: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-5

• To detect association within bivariate tables:

1. Calculate percentages within the categories of the independent variable.

2. Compare percentages across the categories of the independent variable.

Does an association exist?

Page 6: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-6

• When the independent variable is the column variable (as is usually the case):1. Calculate percentages within the columns

(vertically). Column percentages are conditional distributions of Y for each value of X.

2. Compare percentages across the columns (horizontally).

Does an association exist? (continued)

Page 7: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-7

• In summary, to detect association (when the independent variable is in the column of a bivariate table) follow this rule:

“Percentage Down, Compare Across”

Does an association exist? (continued)

Page 8: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-8

• Forty-four males and females were asked whether or not they watched the last CFL Grey Cup game.

Association and Bivariate Tables: An Example

Page 9: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-9

o The table below shows the relationship between sex (X) and Grey Cup watching (Y) for 44 people. Is there an association between these variables?

Association and Bivariate Tables: An Example (continued)

Watched Grey Cup

Sex ____

Male Female

Totals

No 10 13 23 Yes 17 4 21

Totals 27 17 44

Page 10: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-10

• An association exists if the conditional distributions of one variable change across the values of the other variable.– With bivariate tables, column percentages are the conditional

distributions of Y for each value of X.

– If the column percentages change, the variables are associated.

Association and Bivariate Tables: An Example (continued)

Page 11: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-11

• To calculate column percentages, each cell frequency is divided by the column total, then multiplied by 100:

◦(10/27)*100 = 37.04%◦(13/17)*100 = 76.47%◦(17/27)*100 = 62.96%◦( 4/17)*100 = 23.53%

Association and Bivariate Tables: An Example (continued)

Page 12: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-12

Grey Cup Watching by Sex, Frequencies (Percentages)

Association and Bivariate Tables: An Example (continued)

Watched Grey Cup

Sex Male Female

Totals

No 10 (37.04%) 13 (76.47%) 23 Yes 17 (62.96%) 4 (23.53%) 21

Totals 27 (100.00%) 17 (100.00%) 44

Page 13: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-13

• The column percentages show sex of respondents by Grey Cup watching. o The column percentages do change (differ across columns), so these

variables are associated.

Grey Cup Watching by Sex, Percentages

Association and Bivariate Tables: An Example (continued)

Watched Grey Cup

Sex Male Female

No 37.04% 76.47% Yes 62.96% 23.53%

Totals 100.00% 100.00%

Page 14: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-14

• The stronger the relationship, the greater the change in column percentages (or conditional distributions).– In weak relationships, there is little or no change in

column percentages.– In strong relationships, there is marked change in

column percentages.

How Strong is the Association?

Page 15: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-15

• One way to measure strength is to find the “maximum difference,” the biggest difference in column percentages for any row of the table.

Note, the “maximum difference” method provides an easy way of characterizing the strength of relationships, but it is also limited.

How Strong is the Association? (continued)

Page 16: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-16

• The scale presented Table 11.5 can be used to describe (only arbitrary and approximately though) the strength of the relationship”

How Strong is the Association? (continued)

Page 17: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-17

• The “Maximum Difference” is: – 76.47–37.04=39.43 percentage points.– This is a strong relationship.

Grey Cup Watching by Sex, Percentages

How Strong is the Association? (continued)

Watched Grey Cup

Sex ____ Male Female

No 37.04% 76.47% Yes 62.96% 23.53%

Totals 100.00% 100.00%

Page 18: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-18

• “Pattern” = which scores of the variables go together?

• To detect, find the cell in each column which has the highest column percentage.

What is the Pattern of the Relationship?

Page 19: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-19

• The majority of males watched the last Grey Cup game.

• The majority of females did not watch the last Grey Cup game.

What is the Pattern of the Relationship? (continued)

Grey Cup Watching by Sex, Percentages

Watched Grey Cup

Sex ____ Male Female

No 37.04% 76.47% Yes 62.96% 23.53%

Totals 100.00% 100.00%

Page 20: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-20

• If both variables are ordinal, we can discuss direction as well as pattern.

• In positive relationships, the variables vary in the same direction. – Low on X is associated with low on Y.– High on X is associated with high on Y.– As X increase, Y increases.

• In negative relationships, the variables vary in opposite directions.– As one increases, the other decreases.

What is the Direction of the Relationship?

Page 21: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-21

• The relationship is positive.– As education increases, library use increases.

Library Use by Education, Percentages

What is the Direction of the Relationship? (continued)

Page 22: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-22

• The relationship is negative.– As education increases, TV viewing decreases

TV Viewing by Education, Percentages

What is the Direction of the Relationship? (continued)

Page 23: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-23

• Conditional distributions, column percentages, and the maximum difference provide useful information about the bivariate association, and should always be computed and analyzed.

• But, they can be awkward and cumbersome to use.• Measures of association, on the other hand, characterize

the strength (and for ordinal- and interval-ratio level variables, the direction) of bivariate relationships in a single number.

Measures of Association

Page 24: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-24

• For nominal level variables, there are two commonly used measures of association:–Phi (φ) or Cramer’s V (Chi square-based measures)–Lambda (λ) (PRE measure)

• While more suitable for nominal-level variables, Phi, Cramer’s V, and Lambda can be also used to measure the strength of the relationship between ordinal-level variables in a bivariate table.

Nominal Level Measures of Association

Page 25: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-25

• Phi is used for 2x2 tables.• Formula for phi:

where the obtained chi square, χ2, is divided by n, then the square root of the result taken.

Chi Square-Based Measures of Association

Page 26: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-26

• Cramer’s V is used for tables larger than 2x2.• Formula for Cramer’s V:

Chi Square-Based Measures of Association (continued)

Page 27: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-27

• Phi and Cramer’s V range in value from 0 (no association) to 1.00 (perfect association).

o Phi and V are symmetrical measures; that is, the value of Phi and V will be the same regardless of which variable is taken as independent.

Chi Square-Based Measures of Association (continued)

Page 28: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-28

o General guidelines for interpreting the value of Phi and V are provided in Table 11.14o These are similar to the guidelines used for interpreting the maximum difference in column percentages in Table 11.5.o The scale in Table 11.14 is arbitrary and meant only as a general guideline.

Chi Square-Based Measures of Association (continued)

Page 29: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-29

• The following problem was considered in Chapter 11 of the textbook:

A random sample of 100 social work graduates has been classified in terms of whether the Canadian Association of Schools of Social Work (CASSW) has accredited their undergraduate programs (independent variable) and whether they were hired in social work positions within three months of graduation (dependent variable).

Chi Square-Based Measures of Association: An Example

Page 30: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-30

• We saw in Chapter 10 that this relationship was statistically significant:

o Chi square = 10.78, which was significant at the .05 level.

• However, what about the strength of this association?

Chi Square-Based Measures of Association: An Example

(continued)

Page 31: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-31

• To assess the strength of the association between CASSW accreditation and employment, phi is compute as:

o A phi of .33 indicates a strong association between these two variables.

Chi Square-Based Measures of Association: An Example

(continued)

Page 32: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-32

•To help identify the pattern of this relationship, column percentages are presented in Table 11.11:

•The table shows that graduates of CASSW-accredited programs were more often employed as social workers.

Chi Square-Based Measures of Association: An Example

(continued)

Page 33: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-33

• Phi is used for 2x2 tables only. – For larger tables, the maximum value of phi

depends on table size and can exceed 1.0.– Use Cramer’s V for larger tables.

• Phi (and V ) is an index of the strength of the relationship only. It does not identify the pattern.

• To analyze the pattern of the relationship, see the column percentages in the bivariate table.

Limitations of Chi Square-Based Measures of Association

Page 34: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-34

• Logic of PRE measures is based on two predictions:

1. First prediction: Ignore information about the independent variable and make many errors (E1) in predicting the value of the dependent variable.

2. Second prediction: Take into account information about the independent variable in predicting the value of the dependent. If the variables are associated we should make fewer errors (E2).

Proportional Reduction in Error (PRE) Measures

Page 35: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-35

• Like Phi (and V ), Lambda (λ) is used to measure the strength of the relationship between nominal variables in bivariate tables.

• Like Phi (and V ), the value of lambda ranges from 0.00 to 1.00.

• Unlike Phi (and V ), Lambda is a PRE measure and its value has a more direct interpretation. – While Phi (and V) is only an index of strength, the value of

Lambda tells us the improvement in predicting Y while taking X into account.

Lambda

Page 36: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-36

• Formula for Lambda:

Lambda (continued)

Page 37: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-37

• Let's again consider the relationship between sex (X) and Grey Cup watching (Y) for 44 people.– Here, we will analyze the association between these

variables with a measure of association, Lambda:

Lambda: An Example

Page 38: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-38

• To compute Lambda, we must first find E1 and E2:

– E1 = n – largest row total = 44 – 23 = 21

– E2 = For each column, subtract the largest cell frequency from the

column total = (27 – 17) + (17 – 13) = 10 + 4 = 14

Lambda: An Example (continued)

Watched Grey Cup

Sex ____

Male Female

Totals

No 10 13 23 Yes 17 4 21

Totals 27 17 44

Page 39: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-39

• Using Formula 11.3 to compute lambda, we get: λ = 21- 14/21 = 7/21 = .33

• A lambda of .33 means that sex (X) increases our ability to predict Grey Cup viewing (Y) by 33%.

• According to the guidelines suggested in Table 11.14, a Lambda of 0.33 indicates a strong relationship.

Lambda: An Example (continued)

Page 40: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-40

1. Lambda is asymmetric: Value will vary depending on which variable is independent. Need care in designating independent variable.

2. When row totals are very unequal, lambda can be zero even when there is an association between the variables. For very unequal row marginals, better to use a chi-square based measure of association.

The Limitations of Lambda

Page 41: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-41

3. Lambda gives an indication of the strength of the relationship only.

– It does not give information about pattern.– To analyze the pattern of the relationship, use

the column percentages in the bivariate table.

The Limitations of Lambda (continued)

Page 42: Copyright © 2014 by Nelson Education Limited. 11-1 Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured

Copyright © 2014 by Nelson Education Limited.

11-42

• Correlation and causation are not the same things.

• Strong associations may be used as evidence of causal relationships but they do not prove variables are causally related.– Hence, association is just one of the criteria needed

to establish a causal relationship between variables.

Correlation vs. Causation