SEM Flashcards

Question

Using the formulae to calculate the maximum number of single connections between observed variables what do you have to compare this number to to check model identification?

Answer 1

Count all of the model pathways (ignoring disturbance/error terms) And then compare these two numbers The maximum number must equal or exceed the number of paths counted in the model

Answer 2

Over-identified model (more correlations than free paths in the model) Just-identified model (saturated model) (correlations equal the number free paths in the model) Under-identified model (fewer correlations than free paths - model cannot be estimated) -only over or just identified models can be estimated

Answer 3

This is a model where all causal pathways are moving in the same direction i.e. Effects are uni-directional. (This is the most common form of model and is always identifiers

Answer 4

This is where there are reciprocal relationships between variables - more complex to analyse - identification issues can be very problematic in complex non-recursive models - not as common in the psychology literature

Answer 5

Model estimation

Answer 6

The direct and indirect effects between variable Global model fit This in the context of regression would by -regression coefficients for individual predictors Test of overall regression model fit i.e. ANOVA for R squared

Answer 7

Direct and indirect effects (&error)

Answer 8

The oath regression coefficients reflect direct relations between one variable and another (controlling for the effect of any other variable also effecting the endogenous variable). These are the same as the beta weights in normal MR (we can obtain these by simply running separate OLS regression models)

Answer 9

They are standardised regression coefficients the beta weights from a regression output

Answer 10

You can fast the significance of (unstandardised direst effects) However, you should consider the magnitude of direct effects not just the sig. (Use last research as a guide, consider substantive real-world meaning of effects, use cohens rule of thumb .1 = small, .3 = medium and .5 = large)

Answer 11

These are the effects of one variable on another variable via a mediator variable In a standard one-mediator mediated regression there is one indirect effect- the effect of the IV on the DV via the mediator.

Answer 12

By multiplying the constituent paths. And then comparing this number to the direct effect pathway the relationship between the two variables should shrink in the presence of the mediator. So the indirect path should be lower than the direct path.

Answer 13

.34 * .5 = .17 - indirect path neuroticism to depression Neuro roam has a .34 direct effect on avoid but only .5 of this is transmitted to depression via avoid The indirect pathway means that an increase in depression of .17 SD units for every 1 SD unit increase in neuroticism via the effects of avoid

Answer 14

Total effects represents the total causal effect of one variable on another This is calculated by summing all of the direct and indirect effects

Answer 15

You cannot enter and exit a variable on an arrowhead You cannot enter a variable twice on the same trace

Answer 16

To express relationships between variables in terms of direct and indirect effects, based on a causal model assumed to be correct (to qualify degrees of causality)

Answer 17

Plausible model is constructed independently of analysis using non statistical means Model for statistics can give some indication of model plausibility NB correlation does not equal causation & we cannot determine causal direction statistically

Answer 18

1. Time precedence 2. theory 3. Previous research 4. Logic/sound rationale

Answer 19

1. Relationship: X should be correlated with Y 2. Temporal precedence (X must precede Y in time) 3. Non-spuriousness (X-Y relationship should hold after controlling for other variables experimentally or statistically e.g. Third variable issue) 4. Correct effect priority (there are no reciprocal relationships between X and Y, or Reversals of this relationship)

Answer 20

The difference between the full saturated model and the reduced model E.g. If the full model could have 10 pathways and 8 were specified DF = 2

Answer 21

In its most basic form it is a simple extension of multiple regression

Answer 22

Disturbance terms point towards latent factors and error terms to measured variables

Answer 23

Does the model produce an estimated population covariance matrix that is consistent with the sample (observed) covariance matrix? Basically is the constrained model consistent with the saturated

Answer 24

Both will be 0

Answer 25

Bad fit of the reduced model to the data

Answer 26

Sample size - with large samples, your model likely to be sig. worse even when differences in fit are substantively small

Answer 27

It's a model that specifies that all of the relationships between the variables are 0 so it will always be a bad fit to the data It is used as sometimes fit indices actually compare the default model to the independence model 'how much better is it?'

Answer 28

A residual correlation is the difference between a sample correlation and the implied correlation The SRMR is based on the average absolute values of the residual correlations An SRMR of zero would equal perfect fit (no residual) SRMR

Answer 29

-popular fit measure Designed to Asses the approximate fit of a model rewarding parsimony Of two models with similar explanatory power the simpler model - fewer paths (DF) will be favoured

Answer 30

- different approach to model fit Compares researchers model with the independence model (independence model predicts all variables are independent i.e. Zero correlations) Analogous to R2 - estimates total variance accounted for by our model. GFI > .95 = good fit GFI > .90 = adequate fit

Answer 31

Is to rule out bad models Limitation is that it cannot prove a good model Bad model fit means that the model doesn't explain the data as well as others might Good model fit - fails to disconfirm your model, you may have a good model but 'fit' is with reference to the variables in your model (alternative models with different specification paths might be even better - still worth testing alternative models & maybe that ther is a more complete model (more variables))

Answer 32

Extends observed variable path analysis by creating a latent variable measurement model, and then examining relationships between these latent variable factors

Answer 33

Specify and estimate a candidate measurement model (aka confirmatory factor analysis) Once you have a viable measurement mode, you re-specify the model as a structural Model and examine the relationships between latent factors

Answer 34

They are used to test theoretically derived models of psychological measures Often used in the development of psychological measures after having used EFA (exploratory factor analysis)to initially develop and refine the measure Once we have an EFA services measure we can administer it to a new sample and see if we can confirm the original measurement model Can tell us important information about how a measurement tool is saturated and/or how latent factors refer to each other

Answer 35

The principles underlying CFA are largely the same as those in EFA Before undertaking a CFA we should use the same assumption checks & data screening as EFA The typical difference between the two is that in CFA we constrain factor loadings (usually to be 0) I.e. We do not allow all observed items/indicators to load freely on all of the factors So the CFA model is a more constrained version of the EFA model

Answer 36

Are measured or indicator variables (observed variables) And a represented by a square

Answer 37

Estimate the relationship between the factor and the observed indicator Can be thought of as the correlation between the factor and the indicator in standard CFA models Typically like these to be >.50

Answer 38

Estimate the relationship between latent factors we can use this information to examine the convergent and discrimination validity of the factors

Answer 39

These model variation in the indicator variable not accounted for by the factor e.g. Anything else that accounts for variance in the indicator variable - other influences and error These error terms are usually uncorrelated with each other, but you could model error correlations if you expected that response across indicators would be caused by something other than the factors e.g. Method effects

Answer 40

- refer to theory/previous research to a certain appropriate level Specify the model Model identification Model estimation Testing model fit Interpret model effects Modifying models Reporting results

Answer 41

As you cannot know the variance of unmeasured variables Fix the error variances to 1 in model specification Or fix raw error loadings to 1 (AMOS default) - sets error variance based on indicator variance This is important for model identification

Answer 42

Factors are unmeasured so variance is unknown Fix factor variance to 1 or set raw factor loadings to 1 (Only need one factor loading to be set to 1 per factor) This is important for identification of the model

Answer 43

Known values Number of knows = V*(v+1)/2 Where v equals the number of variables

Answer 44

Calculate the knows v*(v+1)/2 And the unknowns (count up number of free paths and variances) Subtract the unknowns from the know a to get DF If model DF greater than or equal to 0 then proceed i.e. The model is identified If not you need to re specify your model

Answer 45

If a model with a single factor has 3 or more indicators it will be identified If a model with 2 or more factors has 2 or more indicators or factor it will be identified

Answer 46

Estimate model parameters e.g. Factor loadings and factor covariances Test global model fit (We can also then compare the fit of competing measurement models, specify alternative models etc)

Answer 47

If the factor correlations > .75- .80 then this may suggest that the model is 'over-factored' or that one of the factors is redundant -a more plausible model might involve collapsing the factors in to one and re-estimating (this is where you would also need to rely on what theory and previous research suggests)

Answer 48

That you should possibly remove this indicator from your measurement in the future (I.e. If a questionnaire and your factor does not load highly on to item 6 maybe this item is not really tapping into the factor that you want so remove it as it just adds noise to your data)

Answer 49

The same as in path analysis Residual correlations (sample correlations minus implied correlations - sample correlations are observed correlations; implied correlations are calculated From the model loadings - smaller residual correlations = better fitting model - larger specific residual correlations may indicate that part of the model is misspecified) Chi-square (examine the fit of an individual model - comparing model with observed data, so we want a non-significant chi-square value i.e. No significant difference between model and data - can also directly Test differences between chi square nested (hierarchical) models using difference between model DF as critical chi-square value) RMSEA & GFI as well! SRMR (average absolute value of the residual correlations - so the closer to 0 means perfect fit - SRMR

Answer 50

Check the default model in Amos (non significant = good)

Answer 51

Very close to 0

Answer 52

Start with a bare bones model and then add path(s) If extra paths significantly improve fit these are added to the model

Answer 53

Typically start with a saturated model and simplify it by eliminating paths If the model fit does not sig. Deteriorate then paths can be removed (model is no worse but simpler - more parsimonious)

Answer 54

Calculate chi-square for first model & then second Calculate difference between the chi-square statistic for each model If chi-square is sig. then the model is sig. improved by adding paths and these can be retained in your refined model NB when checking if the chai square statistic difference is sig. you use the difference between the models DF to then look up the statistic in the table

Answer 55

MI chan be used to add individual paths to the model These are an output from Amos The large the MI the greater the improvement in model fit Usual conventions is MI > 4 suggests an improvement in model fit and path should be added

Answer 56

Yes as 4.03 is smaller than 5.99 so there is not a sig. Difference so the new model does not have a sig. Worse fit to the saturated so as its more Parsimonious it is accepted.

Answer 57

AIC and BIC

Answer 58

Theoretical approach

Answer 59

Paths are added or deleted from model purely based on statistical criteria In model building MIs for all paths are examined to see which ones significantly improve the model Can capitalise on chance correlations This type of SEM is more Exploratory Credibility of model improvement if model structure replicated in another sample

Answer 60

- test an SEM across a categorical variable e.g. Gender We might want to look at model estimates in different groups, or see whether a particular model holds across groups i.e. It is invariant across groups This can be done for CFA model or a full SEM Uses the principle of iteratively constraining parameters in the model to equality across the groups (implying they are the same in each group), and then looking to see if this produces a significantly decrement in model fit If a sig. Decrease in model fit occurs, you then have to identify which parameter have caused this problem I.e. You can iteratively free parameters to identify the source of the misfit

Answer 61

Estimate the model simultaneously in the groups, freely estimating all of the model parameters - this is often referred to as a test of configurable invariance If the above model shows good fit, you could then test a further model that constrains the factor loadings and/or path coefficients to equality across the groups - if this model shows good fit, you can then assume the model parameters are consistent across groups. If not, you can iteratively free paths to diagnose the ill- fit and establish what is referred to as partial invariance

Answer 62

Apply to correlation/ regression Linearity (dependent (endogenous) variable should be linearly related to IV's (exogenous)) SEM programmes can handle continuous and categorical variances, but check for coding of categorical variables and make sure programme knows what codes are being used Normally (residuals should be nor annoy distributed and homoscedastic) Disturbances uncorrelated with endogenous variables No multicollinearity Exogenous variables are reliably measured ``` Additional Identification (models cannot be under-identified) ``` Adequate sample size (Kline recommends at least 10 times as many cases as parameters (paths) - ideally 20) Proper model specification (specification errors occurs when common causal variables are left out of the model)

SEM Flashcards

(86 cards)