Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Structural equation modeling (SEM) with lavaan
Rating: 4.2 out of 5(405 ratings)
3,106 students

Structural equation modeling (SEM) with lavaan

Learn how to specify, estimate and interpret SEM models with no-cost professional R software used by experts worldwide.
Last updated 5/2021
English

What you'll learn

  • Specify and estimate parameters in a structural equation model using the R lavaan package and interpret and report on the SEM model results.
  • Perform exploratory and confirmatory factors analyses (EFAs and CFAs) using their own datasets.
  • Use a variety of multiple imputation techniques to "fill in," and correct for, missing data.
  • Specify and estimate mediated and other indirect SEM effects using traditional parametric confidence intervals, as well as using bootstrapped and/or bias-corrected and accelerated non-parametric approaches.
  • Specify and estimate the fit of multi-group SEM models, as well as determine levels of measurement invariance (metric, scalar, configural).
  • Output beautiful multi-color plots of fitted SEM models for use in reports and publications.
  • Understand how to set-up, specify, estimate and interpret a latent (growth) curve model, using alternate random intercept and slope model specifications.

Course content

8 sections73 lectures11h 16m total length
  • Introduction to Course and to R (slides)7:04

    R is a programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. Polls, surveys of data miners, and studies of scholarly literature databases show that R's popularity has increased substantially in recent years.

    R is an implementation of the S programming language combined with lexical scoping semantics inspired by Scheme. S was created by John Chambers while at Bell Labs. There are some important differences, but much of the code written for S runs unaltered.

    R was created by Ross Ihaka and Robert Gentleman at the University of Auckland, New Zealand, and is currently developed by the R Development Core Team, of which Chambers is a member. R is named partly after the first names of the first two R authors and partly as a play on the name of S.

    R is a GNU project. The source code for the R software environment is written primarily in C, Fortran, and R. R is freely available under the GNU General Public License, and pre-compiled binary versions are provided for various operating systems. R uses a command line interface; there are also several graphical front-ends for it.

  • Introduction to Path Modeling and SEM (slides, part 1)9:31

    Structural equation modeling (SEM) is a family of statistical methods designed to test a conceptual or theoretical model.[ Some common SEM methods include confirmatory factor analysis, path analysis, and latent growth modeling.[ The term "structural equation model" most commonly refers to a combination of two things: a "measurement model" that defines latent variables using one or more observed variables, and a "structural regression model" that links latent variables together. The parts of a structural equation model are linked to one another using a system of simultaneous regression equations.

    SEM is widely used in the social sciences because of its ability to isolate observational error from measurement of latent variables. To provide a simple example, the concept of human intelligence cannot be measured directly as one could measure height or weight. Instead, psychologists develop theories of intelligence and write measurement instruments with items (questions) designed to measure intelligence according to their theory. They would then use SEM to test their theory using data gathered from people who took their intelligence test. With SEM, "intelligence" would be the latent variable and the test items would be the observed variables.

  • Introduction to Path Modeling and SEM (slides, part 2)9:30

    In statistics, path analysis is used to describe the directed dependencies among a set of variables. This includes models equivalent to any form of multiple regression analysis, factor analysis, canonical correlation analysis, discriminant analysis, as well as more general families of models in the multivariate analysis of variance and covariance analyses (MANOVA, ANOVA, ANCOVA).

    In addition to being thought of as a form of multiple regression focusing on causality, path analysis can be viewed as a special case of structural equation modeling (SEM) – one in which only single indicators are employed for each of the variables in the causal model. That is, path analysis is SEM with a structural model, but no measurement model. Other terms used to refer to path analysis include causal modeling, analysis of covariance structures, and latent variable models.

  • Input and Output into R11:08
  • Useful Data Summary Statistics13:28
  • JSS Reading and Exercise #12:27
  • What is lavaan (up to syntax) ?14:09
  • Estimate an Example Confirmatory Factor Analysis (CFA)8:43

    In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis, most commonly used in social research.[ It is used to test whether measures of a construct are consistent with a researcher's understanding of the nature of that construct (or factor). As such, the objective of confirmatory factor analysis is to test whether the data fit a hypothesized measurement model. This hypothesized model is based on theory and/or previous analytic research. CFA was first developed by Jöreskog and has built upon and replaced older methods of analyzing construct validity such as the MTMM Matrix as described in Campbell & Fiske (1959).

    In confirmatory factor analysis, the researcher first develops a hypothesis about what factors s/he believes are underlying the measures s/he has used (e.g., "Depression" being the factor underlying the Beck Depression Inventory and the Hamilton Rating Scale for Depression) and may impose constraints on the model based on these a priori hypotheses. By imposing these constraints, the researcher is forcing the model to be consistent with his/her theory. For example, if it is posited that there are two factors accounting for the covariance in the measures, and that these factors are unrelated to one another, the researcher can create a model where the correlation between factor A and factor B is constrained to zero. Model fit measures could then be obtained to assess how well the proposed model captured the covariance between all the items or measures in the model. If the constraints the researcher has imposed on the model are inconsistent with the sample data, then the results of statistical tests of model fit will indicate a poor fit, and the model will be rejected. If the fit is poor, it may be due to some items measuring multiple factors. It might also be that some items within a factor are more related to each other than others.

  • Other Useful lavaan Fitted Results Functions6:38

Requirements

  • Students will be required to install no-cost R and RStudio software (instructions are provided).
  • Students who are new to R software will need to need to use and practice with the "introduction to R" scripts and exercises that are provided with the course's videos and materials.

Description

This "hands-on" course teaches one how to use the R software lavaan package to specify, estimate the parameters of, and interpret covariance-based structural equation (SEM) models that use latent variables. "lavaan" (note the purposeful use of lowercase "L" in 'lavaan') is an acronym for latent variable analysis, and the name suggests the long-term goal of the developer, Yves Rosseel: "to provide a collection of tools that can be used to explore, estimate, and understand a wide family of latent variable models, including factor analysis, structural equation, longitudinal, multilevel, latent class, item response, and missing data models." The course uses and executes many "live" examples (with included R scripts and datasets) using no-cost R and RStudio software to demonstrate and teach how to: (1) specify a SEM model in lavaan syntax; (2) fit and then evaluate your model; (3) perform a CFA; (4) impute and replace missing data; (5) estimate mediating and other indirect effects; (6) estimate and evaluate multigroup models, simultaneously establishing measurement invariance; and (7) specifying and estimating latent (growth) curve models, including the use of random (and latent) intercepts and slopes. The R lavaan package is world-class 'professional-grade' SEM software, used by thousands of SEM experts, graduate students, and college and university faculty around the world.

Who this course is for:

  • Course participants may be "brand-new" (inexperienced) to using both R software and/or SEM model estimation, or they may be more experienced in one or both techniques.
  • This course is very useful for graduate students, quantitative-analysis professionals, and/or for college and university faculty who analyze research data using path models characterized by latent variables.
  • This course is appropriate for anyone wishing to learn more about specifying, estimating and intrepreting covariance-based SEM models using the no-cost professional-grade SEM modeling features in the lavaan (and other) packages in R software.
  • The course is appropriate for anyone who wishes to learn how to use a no-cost, professional SEM software suite regarded as an alternative to MPlus.