PCA Flashcards

(32 cards)

1
Q

PCA

A

Principle Component Analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does PCA allow you to do

A

Do all of the questions in a questionnaire tap into the same underlying construct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

PCA - correlate

A

Looks at which items correlate (R-Matrix) with each other and calls them the component/factor

e.g. component 1 - BEING AWFUL
items = saying “im not racist but” and disliking dogs

e.g. component 2 - BEING SOUND
items = smily idk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Kaiser’s rule

A

Eigenvalues greater than 1 mean the component IS VALID

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Eigenvalues

A

You use the Eigenvalues to judge whether the components are worth keeping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Proportion variance

A

How important each component is - the percentage of variance that the component explains

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cumulative variance

A

Adds the percentages of variance up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Component loadings

A

The PCA gives us a number of components, but this doesn’t tell us which of our individual measures make up each component

Component loading tell us this - they tell us what the association is between each item and each component

Essentially they are a Pearson’s correlation between the item and the factor/component

They give a component matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What component loading is taken as a strong enough loading

A

.4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

PCA - simplifying data

A

A PCA allows us to take many items and reduce the dimensionality of a construct

All these different measures and reduce it down to the core factors/components

If 3 items are all highly associated with each other, awe can combine them into a single measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does reducing the dimensionality of a construct reduce the likelihood of

A

Reduces the likelihood of false positives - because we test the one construct not all three separate measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why do we need to collect a lot of participants with PCA

A

PCA is based around an r matrix Pearson’s correlation

Questionnaires are Likert scales - non parametric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

SS Loadings

A

Eigenvalues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Scree plot

A

Scree plots simply graph Eigenvalues and are another way to judge the number of factors/components

When the graph flattens there are no more relevant factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Joliffes rule

A

Eigenvalues over .7 are valid in a PCA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What figure in the component matrix is a significant loading

A

Correlation coefficient of .4 and above

17
Q

Can component loading be negative

A

Yes - just means the items are negatively associated with the component

18
Q

Can the measure load onto more than one component

19
Q

Rotation

A

Simplifies the data and allows us to make inferences from it easier

Changes what these component loadings are to make it clearer for us

Rotation shifts the factors in two-dimensional space to make factor loadings clearer and easy to
interpret

20
Q

Orthogonal rotation methods

A

Assume that the factors in the analysis are uncorrelated, most commonly used is the varimax rotation

21
Q

Oblique rotation methods

A

Assume that the factors are correlated, most commonly used is the oblimin rotation

e.g. questionnaire that uses anxiety and depression

22
Q

Where do the factor names come from

A

You just name them based on what you think the questions seem to measure

23
Q

PCA assumptions

A

Data should NOT be nominal

Need to consider sampling adequacy

Sufficient correlations between individual variables is needed to run a PCA

24
Q

Sampling adequacy

A

This assesses whether or not PCA is appropriate for your data. This is measured by something called KMO

It assesses how much variance among all your variables might be common variance (i.e. explained by an underlying component or factor)

25
What is an acceptable sampling adequacy
KMO measure of above .5 (Hutcherson and Sofroniou) If less collect more data
26
Assumption - sufficient correlation between individual variables
We have to run a test to make sure the R matrix is not an identity matrix Identity matrix - none of the variables are correlated with each other at all Use Bartlett's test of sphericity
27
Bartlett's test of sphericity
Tests the null hypothesis that the correlations represent an identity matrix - we want this to be significant
28
How would you write up Bartlett's test of sphericity
Bartlett's test of sphericity demonstrated that correlations between items were large enough for PCA
29
How to write up sampling adequacy
The sampling adequacy was acceptable (KMO=...)
30
Given the overlap between factor one and two, what could be done in a reanalysis of the data to obtain clearer factors and why?
Oblimin rotation as it allows the factors to be correlated with each other
31
Sampling adequacy
Kaiser-Meyser-Olkin measure of sampling adequacy should be above 0.5
32
Sufficient correlations between individual variables to run a PCA
Bartlett's test of specificity should be significant