saiyidi mat roni - 2014 - partial least square in a nutshell
DESCRIPTION
Partial least square (PLS) a robust structural equation modelling (SEM) approach. It issometimes referred as component-based SEM or simply PLS-SEM. PLS-SEM is a causal modelling statistical approach with an aim to maximise explained variance of dependent latent variables (Chin, 1998b; Hair, Ringle, & Sarstedt, 2011).UnlikeTRANSCRIPT
1 Partial least square in a nutshell | Saiyidi MAT RONI 1 November 2014
2 0 1 4
S a i y i d i M A T R O N I
2 Partial least square in a nutshell | Saiyidi MAT RONI 1 November 2014
1 Partial least square in a nutshell
Partial least square (PLS) a robust structural equation modelling (SEM) approach. It is
sometimes referred as component-based SEM or simply PLS-SEM. PLS-SEM is a causal
modelling statistical approach with an aim to maximise explained variance of dependent
latent variables (Chin, 1998b; Hair, Ringle, & Sarstedt, 2011).
Unlike covariance-based SEM, e.g. provided by AMOS and LISREL, PLS-SEM does
not put emphasis on normality of data distribution (Ringle, Sarstedt, & Straub, 2012), which
allows researchers to use their raw data in the analysis without having to transform their data
to make it at least, approximate normally distributed, prior to running SEM. Although some
studies suggest choosing PLS-SEM over covariance-based SEM on a basis of non-normal
data distribution is a weak argument, the fact that violation of normality assumption can
produce unintended biases in the final statistical result (or no solution at all), it is prudent to
opt for PLS-SEM to alleviate two serious issues with covariance-based SEM: improper
solutions where solution is beyond what is gauged by parameters, and factor indeterminacy
(Fornell & Bookstein, 1982).
For more in-depth review of PLS-SEM features Hair et al. (2011), Chin (1998b), Wold
(1985) and Hair Jr, Sarstedt, Hopkins, and Kuppelwieser (2014) are good references to begin
with.
3 Approach to structural equation modelling (SEM) | Saiyidi MAT RONI 1 November 2014
Screen data •Delete monotones
Missing value analysis •Choose MI or EM
Check outliers •Trim or Winsor
Check normality •transform if necessary
Run factor analysis •Extraction: PCA, PAF or ML •Rotation: Orthogonal or oblique
Reliability & validity •Cronbach's alpha • AVE
Address biases •CMB -> Harman's single factor score •NRB -> split half, then test means difference
2 Approach to structural equation modelling (SEM)
2.1 Preliminary data analysis
As in most cases, before an actual analysis can be performed, it is advisable to run
preliminary data analysis (PDA). PDA is recommended to ensure the dataset is ‘cleaned’ and
‘cleansed’ of substantial noise that attract biases in the final results of the actual analysis. The
following diagram illustrates a common approach to PDA.
AVE = average variance extracted CMB = common method bias EM = Expected maximisation MI = Multiple imputations ML = Maximum likelihood NRB = Non-response bias PAF = Principal axis factoring PCA = Principal component analysis
4 Approach to structural equation modelling (SEM) | Saiyidi MAT RONI 1 November 2014
2.2 Measurement model
Once PDA is done, the next step to make before the interpretation of the output of a
structural model, is to look at the measurement model. Assessments of measurement model
than later interpret the result of the structural model are two-stage SEM approach suggested
by Anderson and Gerbing (1988) and Hair et al. (2011). Measurement model stage involves
an assessment of model parameters which included reliability test using Cronbach’s alpha
and composite reliability (Kock, 2013; Urbach & Ahlemann, 2010), validity analyses through
average variance extracted (AVE) and standardised factor loadings and cross-loadings
(Fornell & Larcker, 1981; Hair, Black, Babin, & Anderson, 2010; Kline, 2010; Schumacker
& Lomax, 2012), and also an assessment to determine the nature of construct being a
formative or reflective (Chin, 1998a). The quality of the measurement model is further
assessed on lateral and vertical collinearity using variance inflation factor (VIF) (Kock &
Lynn, 2012). The criteria for the assessment is summarised in Table 1 below.
Table 1: Measurement model assessment criteria.
Assessment Criterion Note Reference Item reliability Individual item standardised
loading on parent factor. Min. of .50 Hair et al. (2010)
Convergent validity
Individual item standardised loading on parent factor, and Loadings with sig. p-value
Min. of .50 p < .05
Hair et al. (2010) Gefen and Straub (2005)
Composite reliability > .70 Fornell and Larcker (1981) Nunnally and Bernstein (1994) Hair et al. (2010)
Average variance extracted (AVE)
> .50 Hair et al. (2010) Urbach and Ahlemann (2010)
Discriminant validity
Square-root of AVE More than the correlations of the latent variables.
Hair et al. (2010)
5 Approach to structural equation modelling (SEM) | Saiyidi MAT RONI 1 November 2014
Reliability Cronbach’s alpha
> .70
Nunnally and Bernstein (1994) Urbach and Ahlemann (2010) Hair et al. (2010)
Variance inflation factor (VIF) < 10 < 5
Hair et al. (2010) Kock and Lynn (2012)
Nature of construct
Formative / reflective: Chin (1998a) Coltman, Devinney, Midgley, and Veniak (2008)
6 Approach to structural equation modelling (SEM) | Saiyidi MAT RONI 1 November 2014
2.3 Structural model
After measurement model is found to be satisfactory, structural model parameter
estimates then can be used for analyses and interpretation. The structural model is evaluated
through coefficient of determination, R2(Chin, 1998a, 1998b), predictive relevance, Q2
(Geisser, 1975; Stone, 1974), effect size, f2 (Cohen, 2013), and path coefficients (see Hair et
al., 2011; Mohamadali, 2012). These criteria are summarised in Table 2 below.
Table 2: Structural model criteria.
Criterion Note Reference Coefficient of determination, R2 .67 substantial
.33 average
.19 weak
Chin (1998a)
Predictive relevance, Q2 > 0 Stone-Geisser test
Geisser (1975) Stone (1974)
Effect size, f 2 .02 small .15 medium .35 large
Cohen (2013)
Path coefficient Magnitude Sign p-value
Hair et al. (2010)
7 WarpPLS | Saiyidi MAT RONI 1 November 2014
3 WarpPLS
3.1 Start WarpPLS Double-click on the icon > Proceed to use software
8 WarpPLS | Saiyidi MAT RONI 1 November 2014
3.2 Transfer data set from Excel to WarpPLS readable format Data file: PLS.FullData.xlsx If your dataset is in SPSS format (.sav): File > Save As > Type in file name > Save as type: > select Excel 2007 through 2010 (*.xlsx) > Save.
Open PLS.FullData.xlsx > Your data should look like this:
9 WarpPLS | Saiyidi MAT RONI 1 November 2014
On WarpPLS: Proceed to Step 1 > Create project file > File name > Type BasicModel > Save.
Proceed to Step 2 > Read from file > Files of type: > (*.xlsx) > Select PLS.FullData > Open.
10 WarpPLS | Saiyidi MAT RONI 1 November 2014
Your data preview should look like this. Next > Finish > Check your data set > OK > Yes.
Proceed to Step 3 > Pre-process data > OK.
11 WarpPLS | Saiyidi MAT RONI 1 November 2014
Check your data > Yes.
On main window: Project > Save Project.
12 Basic model | Saiyidi MAT RONI 1 November 2014
4 Basic model
Data file: BasicModel.prj
3 Predictors (independent variables) and 1 criterion (dependent variable).
4.1 Create latent variables Proceed to Step 4 > Define SEM Model > Latent variable options > Create latent variable > Click anywhere on the canvas (white area).
13 Basic model | Saiyidi MAT RONI 1 November 2014
Latent variable name: > type EASE > Add indicators: Click PEOU1 > Add. Repeat the Add indicators steps for PEOU2, …, PEOU6 > Save > Save latent variable settings > OK.
14 Basic model | Saiyidi MAT RONI 1 November 2014
4.2 Create direct links among latent variables Proceed to Step 4 > Define SEM Model > Direct link options > Create direct link. Click EASE > Click INTENT > WarpPLS creates the line in the model. Repeat the process for USEFUL-INTENT and ATTITUDE-INTENT.
Model options > Save model and close > OK. Proceed to Step 5 > Perform SEM analysis > your result should look like this.
15 Basic model | Saiyidi MAT RONI 1 November 2014
4.3 Measurement model On main window > View/save analysis results.
Examine the value of each criterion and compare them against the thresholds listed in
the table below.
Assessment Criterion Note Steps Item reliability Individual item standardised
loading on parent factor. Min. of .50 View > View indicator
loadings and cross-loadings > View combined loadings and cross-loadings.
Convergent validity
Individual item standardised loading on parent factor. Loadings with sig. p-value
Min. of .50 < .05
View > View indicator loadings and cross-loadings > View combined loadings and cross-loadings
Composite reliability
> .70
View > View latent variable coefficients.
Average variance extracted (AVE)
> .50
View > View latent variable coefficients.
Discriminant validity
Square-root of AVE More than the correlations of the latent variables.
View > View correlations among latent variables and errors > View correlations among latent variables with sq. rts. of AVEs.
Reliability Cronbach’s alpha > .70 View > View latent variable coefficients.
Variance inflation factor (VIF)
< .50 View > View latent variable coefficients.
16 Basic model | Saiyidi MAT RONI 1 November 2014
Nature of construct
Formative / reflective: Look for Simpson’s paradox indication
View > View causality assessment coefficients > View path-correlation sign
4.4 Structural model On main window > View/save analysis results.
Examine the value of each criterion and compare them against the thresholds listed in
the table below.
Criterion Note Steps Coefficient of determination, R2 .67 substantial
.33 average
.19 weak
View > View latent variable coefficients.
Predictive relevance, Q2 > 0 Stone-Geisser test
View > View latent variable coefficients.
Effect size, f 2 .02 small .15 medium .35 large
View > View standard errors and effect sizes for path coefficients.
Path coefficient Magnitude Sign p-value
From the main structural model, or View > View path coefficients and P values.
17 Model with moderating variable | Saiyidi MAT RONI 1 November 2014
5 Model with moderating variable
Data file: ModeratingVariable.prj
Moderating
variables
18 Model with mediator variable | Saiyidi MAT RONI 1 November 2014
6 Model with mediator variable
Data file: MediatingVariable.prj
Mediating variable
19 Model with second-order factor | Saiyidi MAT RONI 1 November 2014
7 Model with second-order factor
Data file: SecondOrderFactor.prj .
Proceed to Step 5 > Perform SEM analysis > Close window.
On the main window > Modify > Add one or more latent variable…
Second-order factor comprises of identify and identity latent variables.
Moderating variable.
20 Model with second-order factor | Saiyidi MAT RONI 1 November 2014
Latent variable to be added: > select internal > Add > OK.
> select identify > Add > OK > Close.
Proceed to Step 4 > Yes > Define SEM model.
Latent variable name: > Type SOCIAL > Add indicators: > choose lv_internal > Add > choose lv_identify > Add.
Save > Save latent variable settings.
21 Model with control variable | Saiyidi MAT RONI 1 November 2014
8 Model with control variable
Data file: ControlVariable.prj
Control variable
Moderating variable
22 References | Saiyidi MAT RONI 1 November 2014
9 References
Anderson, J. C., & Gerbing, D. W. (1988). Structural equation modeling in practice: A
review and recommended two-step approach. Psychological Bulletin, 103(3), 411-
423. doi: 10.1037/0033-2909.103.3.411
Chin, W. W. (1998a). Issues and opinion on structural equation modeling. MIS Quarterly,
22(1), VII-XVI.
Chin, W. W. (1998b). The partial least squares approach to structural equation modelling. In
G. A. Marcoulides (Ed.), Modern methods for business research (pp. 295-336).
Mahwah, New Jersey: Lawrence Erlbaum Associates.
Cohen, J. (2013). Statistical Power Analysis for the Behavioral Sciences (2 ed.). Hoboken:
Taylor and Francis.
Coltman, T., Devinney, T. M., Midgley, D. F., & Veniak, S. (2008). Formative versus
reflective measurement models: Two applications of formative measurement. Journal
of Business Research, 61(12), 1250-1262.
Fornell, C., & Bookstein, F. (1982). Two structural equation models: LISREL and PLS
applied to consumer exit-voice theory. Journal of Marketing Research, 19, 440-452.
Fornell, C., & Larcker, D. F. (1981). Evaluating Structural Equation Models with
Unobservable Variables and Measurement Error. JMR, Journal of Marketing
Research, 18(1), 39.
Gefen, D., & Straub, D. (2005). A practical guide to factorial validity using PLS-graph:
Tutorial and annotated example. Communications of the Association for Information
Systems, 16, 91-109.
Geisser, S. (1975). The Predictive Sample Reuse Method with Applications. Journal of the
American Statistical Association, 70(350), 320-328. doi:
10.1080/01621459.1975.10479865
Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis
(7 ed.). Upper Saddle River, NJ, USA: Prentice-Hall, Inc.
Hair, J. F., Ringle, C. M., & Sarstedt, M. (2011). PLS-SEM: Indeed a silver bullet. Journal of
Marketing Theory and Practice, 19(2), 139-151. doi: 10.273/MTP1069-6679190202
Hair Jr, J. F., Sarstedt, M., Hopkins, L., & Kuppelwieser, V. G. (2014). Partial least squares
structural equation modeling (PLS-SEM). European Business Review, 26(2), 106-121.
doi: doi:10.1108/EBR-10-2013-0128
23 References | Saiyidi MAT RONI 1 November 2014
Kline, R. B. (2010). Principles and Practice of Structural Equation Modeling, Third Edition
(3 ed.). New York: Guilford Publications.
Kock, N. (2013). WarpPLS 4.0 User Manual. Loredo, Texas: ScriptWarp Systems.
Kock, N., & Lynn, G. S. (2012). Lateral Collinearity and Misleading Results in Variance-
Based SEM: An Illustration and Recommendations. Journal of the Association for
Information Systems, 13(7), 546-580.
Mohamadali, N. A. K. (2012). Exploring new factors and the question of ‘which’ in user
acceptance studies of healthcare software. (Doctor of Philosophy), University of
Nottingham, Nottingham. Retrieved from
Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory. New York: McGraw Hill.
Ringle, C. M., Sarstedt, M., & Straub, D. W. (2012). Editor's comments: a critical look at the
use of PLS-SEM in MIS quarterly. MIS Quarterly, 36(1), iii-xiv.
Schumacker, R. E., & Lomax, R. G. (2012). A Beginner's Guide to Structural Equation
Modeling : Third Edition (3 ed.). Hoboken: Taylor and Francis.
Stone, M. (1974). Cross-Validatory Choice and Assessment of Statistical Predictions. Journal
of the Royal Statistical Society. Series B (Methodological), 36(2), 111-147. doi:
10.2307/2984809
Urbach, N., & Ahlemann, F. (2010). Structural Equation Modeling in Information Systems
Research Using Partial Least Squares. Journal of Information Technology Theory and
Application, 11(2), 5-40.
Wold, H. (1985). Partial least square. In S. Kotz & N. L. Johnson (Eds.), Encyclopedia of
statistical sciences (Vol. 6, pp. 581-591). New York: Wiley.