analysing categorical data (chi-squared) Flashcards

1
Q

How do we analyse categorical data?

A

-Predict category that someone falls into
-Create contingency table and perform chi-squared test of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a contingency table?

A

-Table of frequencies for how often an observation occurs in a category
-Categories have to be mutually exclusive and exhaustive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Describe the use of a chi-squared test

A

-Calculates how often an observation falls into a category based on how many were expected by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the use of the null hypothesis in this situation?

A

-Means that the frequencies observed were expected by chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the use of the experimental hypothesis in this situation?

A

-Means that the frequencies observed reflect real differences in categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What 2 assumptions can be made?

A

1: Independence
-Each person can only contribute to one cell of a contingency table
2: Expected frequencies
-All expected counts should be greater than 1 and no more than 20% of expected counts should be less than 5-If violated, power can be lost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can we do to prevent loss of power?

A

-Use an exact test instead e.g. fisher’s or MLR
-Collapse/remove data across one variable
-Collapse levels of one variable
-Collect more data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 3 steps for a chi-squared formula?

A

1: Calculate expected frequencies
2: Calculate chi-squared value based on observed and expected
3: Compare chi-squared value against critical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What differs if you have 2 IV’s?

A

-Difference will be calculating the expected values in each case
-Have to calculate expected frequencies of specific cells

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a binomial test?

A

-Compares observed and expected frequencies for variable with only 2 levels
-E.g. Are there more people in the sample from USA than what we would expect by chance?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly