Chapter 13 Exploratory Factor Analysis

LEARNING OUTCOMES

  • Identify and compare the key features of formative and reflective models.
  • Evaluate the differences between Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA).
  • Apply R to tidy and organize the data set before estimating the scale's correlation matrix and generating data visualizations.
  • Apply factor extraction and rotation techniques to generate the scale's simplest factorial structure.
  • Appraise the outputs of the EFA model, interpret the results, and generate plots showing the factorial structure of a scale.

Multivariate statistics are a set of statistical techniques aimed at analyzing data when we have many independent variables and/or many dependent variables correlated with one another to varying degrees (Tabachnick & Fidell, 2013). The variables included in our research design might be related via dependence or interdependence models (Hair et al., 2006).

Dependence methods attempt to predict the outcome (i.e., dependent variable) using one or several predictors (i.e., independent variables). Thus, some variables will be retained in the final model allowing us to explain, to a certain degree, the outcome. Dependence methods are usually classified as a function of the number of dependent variables being measured (i.e., one dependent variable versus two or more) and the nature of the measurement scales and predictors (i.e., nominal, ordinal, or quantitative data).

For instance, to estimate a predictive model when the outcome is one quantitative variable and the explanatory variables are quantitative as well, we use a multiple regression analysis technique (General Linear Model).

\[\begin{aligned} Y = \beta_{0} \ + \beta_{1}X_{1} \ + \beta_{2}X_{2} \ + \beta_{3}X_{1}X_{2} + \epsilon \\ \end{aligned}\]

On the other hand, to estimate a predictive model when we have several quantitative outcomes (i.e., several dependent variables) and the explanatory variables are quantitative as well, we will use a Structural Equation Modeling (SEM) approach.

Interdependence methods are aimed at finding the underlying structure of a set of variables, cases, or objects that are analyzed simultaneously. As Table 13.1 shows, these multivariate techniques are usually classified by the nature of the correlated variables (variables, cases, or objects) and by the type of data that we use (quantitative versus categorical data).

Table 13.1: Classification of Multivariate Analyses Used in Interdependence Models
Relationships among Data Statistical technique
Variables Quantitative Principal components
Factor analysis
Cluster analysis
Cases Quantitative Cluster analysis
Objects Quantitative Multidimensional scaling
Categorical Correspondence analysis

Interdependence methods could be used for three main purposes related to specific multivariate techniques:

  • Data reduction: We are interested in the relationship among variables by simplifying the structure of the phenomenon that we study (e.g., job burnout). To do so, we use Principal Components Analysis (PCA) and Factor Analysis (FA).

  • Grouping cases and variables: We are interested in clustering objects and variables by similarity. To accomplish this goal, we use Cluster Analysis.

  • Structure of objects: We are interested in exploring the underlying structure of objects. We use Multidimensional Scaling for quantitative data and Correspondence Analysis for categorical data.