9 - Contingency Tables Flashcards

(16 cards)

1
Q

What is a contingency table?

A

A contingency table displays frequencies for combinations of two variables categorized into groups. It organizes outcomes for one variable in rows and the other variable in columns, with the intersection values representing frequencies for each combination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the chi-squared (πœ’Β²) distribution?

A

The chi-squared distribution is a special case of the gamma distribution, widely used in inferential statistics to test for associations, including independence and goodness of fit. It is particularly useful for large sample sizes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are degrees of freedom in statistics?

A

Degrees of freedom refer to the number of independent pieces of information used to calculate a statistic. (n-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is the expected frequency calculated in a contingency table?

A

The expected frequency for any cell in a contingency table is calculated using the formula: (row total Γ— column total) / overall total. Expected frequencies are the theoretical frequencies assuming independence between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the chi-squared test statistic (𝑋²) represent?

A

The chi-squared test statistic measures the difference between observed frequencies (𝑂) and expected frequencies (𝑬) in a contingency table, calculated as 𝑋² = βˆ‘((𝑂 βˆ’ 𝑬)Β² / 𝑬). It helps determine if there’s a significant association between the two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the null hypothesis in the context of contingency tables?

A

The null hypothesis posits that there is no association between the two variables being studied, implying that the observed sample proportions are due to chance. Testing this hypothesis helps determine if the variables are independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why might small expected frequencies be a problem in a chi-squared test?

A

If the expected frequencies are small (less than 5), it is invalid to obtain a critical value from the chi-squared distribution. This can often be remedied by combining categories to create larger expected frequencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the significance level in hypothesis testing?

A

The significance level is the threshold used to determine whether to reject the null hypothesis. Common levels are 1%, 5%, and 10%, indicating the probability of wrongly rejecting the null hypothesis if it is true.

These flashcards cover essential definitions, calculations, and concepts related to contingency tables and the chi-squared test, aiding in understanding and memorization for students studying statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What role do contingency tables play in hypothesis testing?

A

Contingency tables help in analyzing and testing the null hypothesis that two variables are independent. By examining observed and expected frequencies, researchers can determine if there is a statistically significant relationship between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is the chi-squared test considered valid?

A

The chi-squared test is valid when the observed frequencies (𝑂) used for the test are counts (not percentages or other forms), and the expected frequencies (𝑬) are sufficiently large, typically greater than 5 in all cells of the contingency table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What should be done if expected frequencies are less than 5?

A

If any expected frequency is less than 5, groups should be combined to form larger expected values. This ensures that the chi-squared test remains valid and meaningful in evaluating associations between variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the term β€œnull hypothesis” refer to in the context of chi-squared tests?

A

The null hypothesis (H0) shows that there is no association between the two variables being examined, meaning any observed differences in frequencies are due to chance rather than a systematic relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do researchers interpret the result of a chi-squared test?

A

Researchers interpret the chi-squared test result by comparing the calculated chi-squared statistic to a critical value determined by the degrees of freedom and significance level. If the statistic exceeds the critical value, the null hypothesis is rejected, indicating some association between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is an example of a practical application for a chi-squared test?

A

A practical application includes analyzing survey data to see if the preference for public transport versus road improvement is independent of the area of residence. By using a chi-squared test, researchers can determine if preferences differ significantly across neighborhoods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe the formula for the test statistic 𝑋².

A

The test statistic 𝑋² is calculated using the formula: 𝑋² = βˆ‘((𝑂 βˆ’ 𝑬)Β² / 𝑬), where 𝑂 is the observed frequency for a cell and 𝑬 is the expected frequency for that cell. This formula assesses the degree of difference between observed and expected outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly