ordinal data
DESCRIPTION
Ordinal Data. Sarah Medland. A plug for OpenMx ?. Very few packages can handle ordinal data adequately… OpenMx can also be used for more than just genetic analyses Regression Polycohoric correlations Factor analysis. Two approaches to the liability threshold model. Problem - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/1.jpg)
Ordinal Data
Sarah Medland
![Page 2: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/2.jpg)
A plug for OpenMx?
• Very few packages can handle ordinal data adequately…
• OpenMx can also be used for more than just genetic analyses– Regression– Polycohoric correlations– Factor analysis...
![Page 3: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/3.jpg)
![Page 4: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/4.jpg)
Two approaches to the liability threshold model
• Problem – Ordinal data has 1 less degree of freedom
• MZcov, DZcov, Prevalence• No information on the variance
– Thinking about our ACE/ADE model• 4 parameters being estimated• A C E mean
– ACE/ADE model is unidentified without adding a constraint
![Page 5: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/5.jpg)
Two approaches to the liability threshold model
• Solution?• Traditional
– Maps data to a standard normal distribution– Total variance constrained to be 1
• Alternate– Fixes an alternate parameter (usually E)– Estimates the remaining parameters
![Page 6: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/6.jpg)
Traditional Approach
• Imagine we have a set of binary data• Trait – lifetime cannabis use
– Never Smoked/Ever Smoked
![Page 7: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/7.jpg)
Twin 1 cannabis use
• 0 = never used
twin1
822 47.5 53.0 53.0730 42.2 47.0 100.0
1552 89.7 100.0179 10.3
1731 100.0
01Total
Valid
SystemMissingTotal
Frequency Percent Valid PercentCumulative
Percent
![Page 8: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/8.jpg)
Twin 1 cannabis use
![Page 9: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/9.jpg)
Twin 1 cannabis use
Liability or ‘risk’ of initiation distribution
Just because an individual has never used cannabis does not mean their ‘risk’ of initiation is zero
![Page 10: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/10.jpg)
Mean = .47SD =.499Non Smokers =53%
The observed phenotype is an imperfect measurement of an underlying continuous distribution
ie Obesity vs BMIMDD vs quantitative depression scales
![Page 11: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/11.jpg)
Raw data distributionMean = .47SD =.499Non Smokers =53%Threshold =.53
Standard normal distributionMean = 0SD =1Non Smokers =53%Threshold =.074
![Page 12: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/12.jpg)
Threshold = .074 – Huh what?
• How can I work this out – Excell
• =NORMSINV()• Thresholds.xls
![Page 13: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/13.jpg)
Why rescale the data this way?
• Convenience– Variance always 1– Mean is always 0– We can interpret the area under a curve
between two z-values as a probability or percentage
![Page 14: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/14.jpg)
Threshold.R
![Page 15: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/15.jpg)
Threshold.R
![Page 16: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/16.jpg)
Threshold = .075 – Huh what?
![Page 17: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/17.jpg)
What about more than 2 categories?
• Very similar– We create a matrix containing the 1st threshold
and the displacements between subsequent matrices
– We then add the 1st threshold and the displacement to obtain the subsequent thresholds
![Page 18: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/18.jpg)
Mx Threshold Specification: 3+ Cat.
-3 31.20-1
2.2
Threshold matrix: T Full 2 2 Free
1st threshold
Twin 1 Twin 2
increment
![Page 19: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/19.jpg)
MxAlgebra L%*%T
Threshold matrix: T Full 2 2 Free
1st threshold
Twin 1 Twin 2
increment
Mx Threshold Specification: 3+ Cat.
-3 31.20-1
2.2
![Page 20: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/20.jpg)
Threshold matrix: T Full 2 2 Free
1st threshold
Twin 1 Twin 2
increment
2nd threshold
Mx Threshold Specification: 3+ Cat.
-3 31.20-1
2.2
MxAlgebra L%*%T
![Page 21: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/21.jpg)
Check the xls spreadsheet…
![Page 22: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/22.jpg)
Two approaches to the liability threshold model
• Solution?• Traditional
– Maps data to a standard normal distribution– Total variance constrained to be 1
• Alternate– Fixes an alternate parameter
• Binary or Ordinal data fix E• Ordinal data fix 1st two thresholds (aka invariant threshold
approach)– Estimate the remaining parameters
![Page 23: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/23.jpg)
Models are equivalent, but…
• Alternate approach means the data is no longer mapped to a standard normal
• No easy conversion to %• Makes it difficult to compare between groups
as the scaling is now arbitrary
![Page 24: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/24.jpg)
• We are going to run traditional and FixedE ACE models with binary data– twinAceBin-Traditional.R– twinAceBin-FixE.R
• There is are scripts in the folder that have multiple threshold models in them – take a look later– twinAceOrd-Traditional.R– twinAceOrd-FixE.R– twinAceOrd-FixThresholds.R
![Page 25: Ordinal Data](https://reader034.vdocument.in/reader034/viewer/2022051518/56816178550346895dd106d7/html5/thumbnails/25.jpg)
Lisbon Castle