1 an ordinal irt model for a circular representation of polytomous data wijbrandt h. van schuur...

1

An ordinal IRT model for a circular representation

of polytomous data

Wijbrandt H. van SchuurUniversity of Groningen

25th Workshop on Item Response Theory University of TwenteOctober 12-15, 2009

2

Overview

A. From dominance model to proximity model

and from monotone to circular proximity

B. From dichotomous to polytomous data

(C. From deterministic to probabilistic model)

3

IRF’s of three dominance items

Figure 7: Three doubly monotone items

0

0,5

1

-8 -6 -4 -2 0 2 4 6 8

latent continuum

pro

bab

ilit

y p

osi

tive

res

po

nse

4

Example dominance model (World Values Study)

Subjects A: Hell B: Devil C: Heaven D: God

1 0 0 0 0

2 1 0 0 0

3 1 1 0 0

4 1 1 1 0

5 1 1 1 1

Do you believe in …Item A HellItem B The DevilItem C HeavenItem D God

5

IRF’s of six monotone proximity items

CN

9,15

8,07

7,18

6,27

5,21

4,41

3,39

2,59

1,74

,62

-,46

-1,72

-2,87

-4,04

-5,18

-6,09

-7,08

-7,91

-9,11

-9,98

Me

an1,2

1,0

,8

,6

,4

,2

0,0

P40

P41

P42

P43

P44

P45

6

Example Monotone proximity model(Electoral compass)

item Clinton Obama Edwards Richard-son

McCain Hucka-bee

Romney Thomson Giuliani

1 1 1 0 0 0 0 0 0 0

2 1 1 1 1 1 0 0 0 0

3 0 0 0 1 1 0 0 0 0

4 0 0 0 1 1 1 1 1 0

5 0 0 0 0 1 1 1 1 0

6 0 0 0 0 0 0 1 1 1

Item 1 The best way to reduce the federal deficit is to increase taxes Item 2 Mortgage lenders should be more tightly controlled Item 3 The US should decrease its spending on defense Item 4 Stricter gun control will not reduce crime Item 5 Abortion should be made completely illegal Item 6 The US should never sign international treaties on climate change

that limit economic growth

7

7,006,005,004,003,002,001,000,00

1,00

0,80

0,60

0,40

0,20

0,00

Vertical: probability of positive response

Horizontal: items i, j and k scale values (in radians between 0 and 2π)

IRF’s of three circular proximity items

0o 60o 120o 180o 240o 300o 360o = 0o

8Larsen, R.J. & Diener (1992), E. Promises and problems with the circumplex model of emotion, p. 31. In: M.S. Clark & J.R. Averill (eds.). Emotion: Review of personality and social psychology (Vol. 13, pp. 25-59), Newbury Park, CA: Sage.

Larsen & Diener

9

Brown, M.W. (1992). Circumplex models for correlation matrices. Psychometrika, 57, 470, 479

Brown: Vocational Interests

R

I

A

S

E

C

10

Schwartz: Universals in the content and structure of values

Stability

Other Self

Change

Security

Conformity

Benevolence

Universalism

Self Direction

Stimulation

Hedonism

Achievement

Power

Schwartz, S.H. (1992). Universals in the content and structure of values: theoretical advances and empirical tests in 20 countries.

In: M.P. Zanna (ed.), Advances in experimental social psychology, Vol. 25 (p. 1-65). San Diego/London: Academic Press..

11

Example circular proximity model (First two dimensions of Big FIVE)

Active (N)

Lively (NE)

Glad (E)

Calm (SE)

Still (S)

Tired (SW)

Sad (W)

Anxious (NW)

1 1 1 1 0 0 0 0 0

2 0 1 1 1 0 0 0 0

3 0 0 1 1 1 0 0 0

4 0 0 0 1 1 1 0 0

5 0 0 0 0 1 1 1 0

6 0 0 0 0 0 1 1 1

7 1 0 0 0 0 0 1 1

8 1 1 0 0 0 0 0 1

12

Violations of deterministic models

Dominance model: 1 subject and 2 items: 01-response to item pair (Mokken, 1971)

Monotone proximity model: 1 subject and 3 items: 101-response to item triple (Van Schuur, 1984)

Circular proximity model: 1 subject and 4 items: 1010- or 0101 response

to item quadruple (Leeferink, 1997, Mokken, van Schuur & Leeferink, 2001)

13

Homogeneity (Loevinger)For each elementary scale (pair, triple, quadruple): H = 1 - E(obs)/E(exp) = φ/φmax E(exp): product of relevant probabilities * N (for dominance data)

For each item: Hi = 1 – Σ E(obs)/Σ E(exp) Summation over all elementary scales that contain item i

For whole scale: H = 1 – Σ E(obs)/Σ E(exp) Summation over all elementary scales

Person fit: number of elementary scales in response pattern that contain a model violation

14

ExplorationBottom-up hierarchical clustering procedure:

1. “Best” elementary scale

2. “Best” next item

15

Item steps and subject scale valuesDominance model: 2 items i and j, 2 categories i: 0 1 1 j: 0 0 1 sum: 0 1 2 ──────┬─────┴────┬───┴───┬──────

θ0 δi01 θ1 δj01 θ2

Proximity models: 3 items i,j,k 2 categories i: 0 1 1 1 0 0 0 j: 0 0 1 1 1 0 0 k: 0 0 0 1 1 1 0 sum: 0 1 2 3 4 5 6

───┬─┴─┬─┴──┬─┴─┬─┴─┬─┴─┬┴─┬ δi01 δj01 δk01 δi10 δj10 δk10

16

Item steps and subject scale valuesDominance model: 2 items i and j, 2 categories i: 0 1 1 j: 0 0 1 sum: 0 1 2 ──────┬─────┴────┬───┴───┬──────

θ0 δi01 θ1 δj01 θ2

Proximity models: 3 items i,j,k 2 categories i: 0 1 1 1 2 2 2 j: 0 0 1 1 1 2 2 k: 0 0 0 1 1 1 2 sum: 0 1 2 3 4 5 6

───┬─┴─┬─┴──┬─┴─┬─┴─┬─┴─┬┴─┬ δi01 δj01 δk01 δi10 δj10 δk10

17

I01

I10

J01

J10

K10

K01

L01

L10

1100=2

1110=3

0110=4

0111=50011=6

0001=7

1001=0 or 8

1000=1 I

J

K

L

Scale values of subjects

18

Scale value of items: dominance model

ORDER of the items is generally based on popularity in sample

19

Scale value of items: unfolding model

ORDER of the items steps is based on uniqueness of representation (popularity is irrelevant)

Which item is middle item: BAC, ABC, or ACB? Requirement for “best” triple: “unique” triple: Positive homogeneity in only one permutation

20

Scale value of items: circumplex model

For circular proximity model: a. Each item can be the first in an ordered quadruple: ABCD = BCDA = CDAB = DABC b. Clockwise and counter clockwise: ABCD=DCBA

So, arbitrarily beginning with item A: Which item is middle item among remaining three items : CBD, BCD, or BDC? (or quadruples ACBD, ABCD, or ABDC) Requirement for “best” quadruple: “unique” quadruple: Positive homogeneity in only one permutation

21

Polytomous itemsDominance Monotone Circular

Model Proximity Model Proximity ModelCumulative scale Unfolding scale Circumplex scaleA B C D E F A B C D E F G A B C D E F G 0 0 0 0 0 0 2 1 1 0 0 0 0 1 2 1 0 0 0 01 0 0 0 0 0 1 2 1 0 0 0 0 0 1 2 1 0 0 01 1 0 0 0 0 0 1 2 1 0 0 0 0 0 1 2 1 0 02 2 1 0 0 0 0 0 1 2 1 0 0 0 0 0 1 2 1 02 2 1 1 0 0 0 0 0 1 2 1 0 0 0 0 0 1 2 12 2 2 2 1 1 0 0 0 0 1 2 1 1 0 0 0 0 1 22 2 2 2 2 2 0 0 0 0 1 1 2 2 1 0 0 0 0 1

For dominance model: Molenaar 1983For unfolding model: Van Schuur 1993

22

Item steps of polytomous itemsDominance model: 2 items i and j, 4 categories i: 0 1 1 2 3 3 3 j: 0 0 1 1 1 2 3 ──┴────┴─────┴────┴───┴──┴───

δi01 δj01 δi12 δi23 δj12 δj23

Proximity models: 1 item i, 4 categories i: 0 1 2 3 2 1 0

──┴────┴─────┴────┴───┴──┴─── δi01 δi12 δi23 δi32 δi21 δi10

23

Model violations for polytomous dominance data

Dominance model: 1 subject and 2 item steps

Weight of seriousness of model violation: i: 0 1 1 2 3 3 3 j: 0 0 1 1 1 2 3 ──┴────┴─────┴────┴───┴──┴───

δi01 δj01 δi12 δi23 δj12 δj23

(i=0,j=1) is less serious than (i=0, j=3)

24

Model violations for polytomous unfolding data

Monotone Proximity model: Response pattern ABC,323 less bad than ABC,302

Concept of ‘implicit error”: Given ABC,302: AB=30, so C must be 0 and C=1 and C=2 are errors. C=1 is an implicit error, C=2 is the explicit error

AC=32, so B must be 2 or 3, and B=1, B=0 in error B=1: implicit; B=0: explicit

BC=02, so A must be 0, and A=1, A=2, A=3: error A=1 and A=2: implicit; A=3: explicit

Weight of errors in triple: sum of implicit and explicit errors in pairs of triple.

25

Model violations for polytomous circumplex data

Circular Proximity model: Response pattern ABCD,3232 less bad than ABCD,3021

Concept of ‘implicit error”: Given ABCD,3021: ABC=302, so D must be 2 or 3; D=1 is the explicit error

ABD=301, so C must be 0 or 1, and C=2 explicit error

ACD=321, so B must be 2 or 3, and B=1 or B=0 are errors B=0 is explicit error and B=1 is implicit error

BCD=021, so A must be 0 or 1, and A=2 or A=3 are errors A=3: explicit and A=2: implicit

Weight of errors in quadruple: sum of implicit and explicit errors in triples of quadruple.

26

Homogeneity for polytomous circumplex data

For elementary scale: H = 1 – Σ W* E(obs) / Σ W*E(exp) Summation over relevant elementary item step combinations

For item or whole scale: H = 1 – ΣΣ W*E(obs) / ΣΣ W*E(exp) + Summation over relevant triples

27

J 1

L 1

K 1

K 1

K 2

L 2

K

L

k(21)

k(12)

l(12)j(10)

l(01)

j(21)

i(10)

I

J

k(01)

I 1

I 2

I 1

J 2

J 1i(12)

i(21)

i(01)

j(12)

L 1

j(01)

k(10)

l(21)

l(10)

1001=0 or 16

0110=8

2001=1

2101=22100=3

2200=4

1200=5

1210=6

0210=7

0120=9

0121=10

0021=11

0022=12

0012=13

1012=14

1011=15

28

Problems with scale values for subjects200000 unfalsifiable: no model error possible

202020 symmetrical: unscalable

101100 no highest value (2): ambiguous (change to 202200)

Imperfect patterns: calculation clockwise and counterclockwise should give the same result. If not: response pattern is symmetrical (unscalable) or take mean of both values

29

Probabilistic model: Use diagnostic matrices

Shape of Correlation matrix (high-low-high values)

Similarly: shape of (Conditional) Adjacency matrix shape of Dominance matrix

In development: criteria analogous to criteria developed for the Mokken scale by Molenaar and Sijtsma

30

What can we do with circular subject scores?

Biologists: compass and clock Mardia (1972): Statistics of directional data Batschelet (1981): Circular Statistics in biology Fisher (1993): Statistical analysis of circular data

Compare distributions (uniform, unimodal) Use circular scale values as dependent or independent variable in regression analyses

31

THANK YOU

32

Correlation Matrix: values decrease from the diagonal towards the lowest value (underlined), and then increase again towards the diagonal.

A B C D E F G H I A 1.00 0.50 0.07 -0.14 -0.32 -0.29 -0.18 0.07 0.54 B 0.50 1.00 0.57 0.07 -0.18 -0.36 -0.32 -0.14 0.11 C 0.07 0.57 1.00 0.50 0.11 -0.21 -0.32 -0.29 -0.18 D -0.14 0.07 0.50 1.00 0.54 0.14 -0.18 -0.36 -0.32 E -0.32 -0.18 0.11 0.54 1.00 0.54 0.07 -0.18 -0.29 F -0.29 -0.36 -0.21 0.14 0.54 1.00 0.54 0.07 -0.18 G -0.18 -0.32 -0.32 -0.18 0.07 0.54 1.00 0.54 0.07 H 0.07 -0.14 -0.29 -0.36 -0.18 0.07 0.54 1.00 0.54 I 0.54 0.11 -0.18 -0.32 -0.29 -0.18 0.07 0.54 1.00

33

Conditional Adjacency Matrix:Response value (>=): 1

A B C D E F G H I A 1.00 0.72 0.52 0.45 0.34 0.38 0.41 0.52 0.76 B 0.78 1.00 0.78 0.56 0.41 0.33 0.33 0.41 0.56 C 0.56 0.78 1.00 0.78 0.56 0.41 0.33 0.33 0.41 D 0.45 0.52 0.72 1.00 0.76 0.59 0.41 0.31 0.34 E 0.36 0.39 0.54 0.79 1.00 0.79 0.54 0.39 0.36 F 0.38 0.31 0.38 0.59 0.76 1.00 0.76 0.52 0.41 G 0.43 0.32 0.32 0.43 0.54 0.79 1.00 0.75 0.54 H 0.56 0.41 0.33 0.33 0.41 0.56 0.78 1.00 0.78 I 0.79 0.54 0.39 0.36 0.36 0.43 0.54 0.75 1.00

34

Dominance Matrix:Response value (>=): 1

A B C D E F G H I A 0.00 0.14 0.25 0.29 0.34 0.32 0.30 0.25 0.13 B 0.11 0.00 0.11 0.21 0.29 0.32 0.32 0.29 0.21 C 0.21 0.11 0.00 0.11 0.21 0.29 0.32 0.32 0.29 D 0.29 0.25 0.14 0.00 0.13 0.21 0.30 0.36 0.34 E 0.32 0.30 0.23 0.11 0.00 0.11 0.23 0.30 0.32 F 0.32 0.36 0.32 0.21 0.13 0.00 0.13 0.25 0.30 G 0.29 0.34 0.34 0.29 0.23 0.11 0.00 0.13 0.23 H 0.21 0.29 0.32 0.32 0.29 0.21 0.11 0.00 0.11 I 0.11 0.23 0.30 0.32 0.32 0.29 0.23 0.13 0.00

35

Score group matrix. Response value (>=): 1score N scale A B C D E F G H I group range 1 6 (17- 0) 1.00 0.67 0.33 0.00 0.00 0.17 0.50 0.83 1.00 2 6 ( 1- 2) 1.00 1.00 0.67 0.33 0.00 0.00 0.17 0.50 0.83 3 6 ( 3- 4) 0.83 1.00 1.00 0.67 0.33 0.00 0.00 0.17 0.50 4 6 ( 5- 6) 0.50 0.83 1.00 1.00 0.67 0.33 0.00 0.00 0.17 5 10 ( 7- 9) 0.20 0.40 0.70 1.00 1.00 0.80 0.40 0.10 0.10 6 10 (10-12) 0.10 0.00 0.20 0.60 0.80 1.00 0.90 0.50 0.20 7 6 (13-14) 0.33 0.00 0.00 0.17 0.50 0.83 1.00 1.00 0.67 8 6 (15-16) 0.67 0.33 0.00 0.00 0.17 0.50 0.83 1.00 1.00

36

ExplorationBottom-up hierarchical clustering procedure: 1. “Best” elementary scale 2. “Best” next item

Ad 1: - High(est) homogeneity - highest number of subjects who use items of elementary scale in acceptable pattern (for proximity models) - unique representation (for proximity models)

Ad 2: - High(est) homogeneity - unique representation (for proximity models)

1 an ordinal irt model for a circular representation of polytomous data wijbrandt h. van schuur...

Documents