an overview of structural equationmodeling using mplus · i mixed e ect models for longitudinal...

23

Upload: others

Post on 31-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Harvard Catalyst Biostatistical Seminar

Neuropsychological Pro�les in Alzheimer's Disease and

Cerebral Infarction: A Longitudinal MIMIC Model

An Overview of Structural EquationModeling using Mplus

Richard N. Jones, Sc.D.

[email protected]

Institute for Aging Research, Hebrew SeniorLife

Beth Israel Deaconess Medical Center, Harvard Medical School

HSPH Kresge G2 October 5, 2011

1 / 23

Page 2: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Objective Overview of SEM in Cognitive Aging & Health Research

Objective

IntroduceI the concepts and terminology relevant to structural equation modeling

(SEM) as applied to health research

Speci�c ExampleI Cognitive EpidemiologyI Mplus software 1

EmphasisI on a broad survey of applications, results, challenges, and opportunities

1www.statmodel.com

2 / 23

Page 3: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Use of Mplus and SEM Applications in Epidemiology

Mplus and SEM Trends in Epidemiology

Relative to the frequency

with which Cox Regression

and Epidemiology appear

in Google Scholar...

Citations matching Mplus

and Epidemiology are

increasing

Although speci�c

applications are decreasing

Mplus use is increasing

and applications are

becoming more diverse

3 / 23

Page 4: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction What is Mplus, Anyways?

Mplus: General statistical analysis software good for...

Analysis with latent variables

Clustered and correlated dataI Complex sampling, weightingI Repeated measuresI Multicomponent variables (i.e., scales, composite outcomes)I Correlated observations (e.g., twins, families)I Multilevel contexts

Particular strengthsI Missing data modelingI Bayesian data analysisI Complex models

F Joint models of change and event occurrenceF Mixture models (population heterogeneity)F Longitudinal factor analysis

Where Mplus is not strongI Data managementI Graphics

4 / 23

Page 5: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction SEM in Epidemiologic Research

Structural Equation Modeling in Epidemiologic Research

5 / 23

Page 6: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Everything You Need to Know about SEM in 1 Slide

Structural Equation Modeling in Epidemiologic Research

DeStavola et al (2005), bottom panel Figure 3

6 / 23

Page 7: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction What is SEM?

Structural Equation Modeling (SEM) is

A general multivariate regression modeling frameworkI General - �exible model typesI Multivariate - multiple dependent variablesI Regression - it's just regression. Regression can be viewed as a special

case of SEM

SEMs often include latent variablesI Continuous latent variables (i.e., factors)I Categorical latent variables (i.e., classes, mixtures)

7 / 23

Page 8: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Varieties of Covariance Structure Modeling

Continuous Latent Variables

No Yes

Regressions among

No

Regression Factor

Dependent or Latent (Multivari-able\-ate) Analysis

Variables y = ν + ΓX+ ε y = ν + Λη + ε

Yes

Path Structural Equa-

Analysis tion Modeling

y = ν + BY + ΓX+ ε y = ν + Λη +KX+ εη = α+ Bη + ΓX+ ζ

8 / 23

Page 9: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Get yourself started

SEM Prerequisites

General linear modelI Linear, logistic, & probit regressionI Multivariable regressionI Mixed e�ect models for longitudinal dataI Survival and event occurrence (Cox, parametric survival)

Missing data theoryI Little and Rubin, Statistical Analysis with Missing Data. 1987

Factor AnalysisI Brown, Con�rmatory Factor Analysis for Applied Research. 2006

Item Response TheoryI Embretson and Reise, Item Response Theory for Psychologists. 2000

Path AnalysisI Kerlinger and Pedhazur, Multiple Regression in Behavioral Research.

1973

Structural Equation ModelingI Jöreskog and Sörbom, LISREL 8: User's Reference Guide. 1996

9 / 23

Page 10: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction What you have to look forward to

Mplus Work�ow

10 / 23

Page 11: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction But life can be better

Mplus Work�ow

11 / 23

Page 12: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Weaving with Stata and a good Text Editor is Fun and Easy

Mplus Work�ow - Weaving = Reproducible Research

12 / 23

Page 13: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Multivariate Regression

y = ν + ΓX + ε

13 / 23

Page 14: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Multivariate Regression

Mplus Syntax

TITLE: Multivariate regression

DATA: FILE = data.dat ;

VARIABLE: NAMES = y1 y2 x1 x2 ;

MODEL: y1 y2 on x1 x2 ;

14 / 23

Page 15: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Con�rmatory Factor Analysis

y = ν + Λη + ε

15 / 23

Page 16: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Con�rmatory Factor Analysis

Mplus Syntax

TITLE: Confirmatory Factor Analysis

DATA: FILE = data.dat ;

VARIABLE: NAMES = y1 y2 y3 ;

MODEL: eta by y1 y2 y3 ;

16 / 23

Page 17: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Structural Equation Modeling

y = ν + Λη +KX + ε

η = α +Bη + ΓX + ζ

17 / 23

Page 18: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Structural Equation Modeling

Mplus Syntax

TITLE: Structural Equation Model

DATA: FILE = data.dat ;

VARIABLE: NAMES = y1-y6 x1 ;

MODEL: eta1 by y1-y3 ; ! measurement model for eta1

eta2 by y4-y6 ; ! measurement model for eta2

eta2 on eta1 ; ! a structural regression

eta1 on x1 ; ! an "indirect effect"

y1 on x1 ; ! a "direct effect"

18 / 23

Page 19: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Introduction Path Diagrams and Equations - Variables

Structural Equation Modeling

Mplus Syntax

. runmplus y1-y6 x1 , model(eta1 by y1-y3 ; eta2 by y4-y6 ; ///eta2 on eta1 ; eta1 on x1 ; y1 on x1 ;)

19 / 23

Page 20: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Latent Variables What is a Latent Variable?

What Latent Variables Are

Latent variables are mathematical abstractions

that account for covariation among observed

variables

Latent variables may be continuous or

categorical

But what do they mean?

20 / 23

Page 21: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Latent Variables What is a Latent Variable?

What is the Meaning Behind a Latent Variable?

The answer depends on theI scienti�c questionI philosophical position 2

Two broad classes of latent variable (LV) applicationsI Instrumentalist

F the LV is a mathematical abstraction

I RealistF the LV existsF the LV re�ects some unmeasurable quantity or quality that really exists

in natureF the LV exists independently of our measurement of it

Realist or Instrumentalist interpretations are a matter of statistical

inference

2Borsboom, Mellenbergh et al., 2003 Psychol Rev 110:203-18

21 / 23

Page 22: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Conclusion

Parting Words

Why use SEM?I More easily specify analysis to answer research questionI Gain statistical power

Not sure if SEM is right for you?I Stata Corp recently added SEM to Stata (version 12)I ... and pitching SEM to EconomistsI http://www.stata.com/news/statanews.26.3.pdf

Why use Mplus?I Regression-based framework for exogenous variablesI Categorical dependent variablesI Categorical latent variablesI Complex sampling weightsI Survival analysisI Bayesian data analysis

22 / 23

Page 23: An Overview of Structural EquationModeling using Mplus · I Mixed e ect models for longitudinal data I Survival and event occurrence (Cox, parametric survival) Missing data theory

Conclusion Penetrance of SEM Concepts and Software

Google Scholar Hits 1998-2011 (5 Oct 2011)

Keyword phrase NEJM JAMA+ AJE

analysis 10,000 16,000 4,500

logistic regression 1,000 2,000 2,000linear regression 620 1,700 1,700cox proportional 610 760 650random e�ects 100 540 340generalized estimating 110 280 320

factor analysis 90 190 170structural equation 21 25 49pro�le mixture OR latent class 13 12 21item response theory 2 12 3path analysis 5 10 19latent growth curve 0 1 4

SAS Institute 440 2,500 1,400SPSS 210 1,100 240Stata 190 800 590R Foundation OR R Development 36 100 87

Mplus 0 12 15LISREL 1 3 13

Note: Hits include the text matches in the reference list.

Values greater than 100 rounded to two signi�cant digits.

23 / 23