lecture 10 - comparing groups (categorical data) Flashcards
(15 cards)
how do we know to use the multidimensional Chi-Square (x^2)?
- Three sports groups (Field sports, rowing & netball)
- Three sets of measurements (the weight category of each subject in that sporting group)
- No subjects/participants appears in more than one group
- This research design is independent
what is the multidimensional chi-square (x^2)?
- test of association. it is also known as chi-square (x^2) test of independence
how is the chi-square carried out?
like other inferential statistics
what’s the formula for the Chi-square?
(x^2)= sum of ((O-E)^2/E))
where x^2 is the symbol for chi-square
O is the observed count
E is the expected count
what does the chi-square compare?
observed frequencies in our sample with the frequencies we would expect if there was no relationship at all between the two variables in the population that the sample was drawn from.
what are the assumptions of the Chi-square?
- measures must be independent
- variables must be categorical
- randomly sampled
- counts, not %
- calculated expected values must not be less than 1
- no more than 20% of expected values should be less than 5
- solution= collect more data, collapse categories, or use an exact test (Fishers exact)
what are the null and alternative hypotheses?
null: the variables do not differ
alternative: the variables do differ
how do we calculate expected cell count?
expected cell count = (col marginal*row marginal)/grand total
what do we do for each cell in the table?
X^2 = sum of (O-E)^2/E
we deduct the expected count from the observed (O-E)
O-E is squared to remove the minus sign
O-E squared is divided by the expected count
all of these values are summed, giving us a chi-square statistic. How different our sample is from what we would expect if H0 is true
what are the degrees of freedom?
The more categories there are in the Independent Variable and Dependent Variable, the more chance there is of the analysis being affected by sampling error
how to calculate degrees of freedom?
the number of categories in the row variable minus one multiplied by
the number of categories in the column variable minus one
DON’T COUNT THE MARGINAL CELLS
what is count equivalent to In SPSS output?
observed counts
when do we use Fishers exact test?
More than 20% of cells have an expected count of less than 5
and if the exact sig. (2-tailed) is less than 0.05 we reject the null hypothesis
what is the hypothetical paired research design
e.g. an athlete’s weight is measured at beginning and end of season
time is the IV (beginning or end)
each athlete has a measurement in each category of the IV
- this is a paired research design, also called within subjects design or repeated measures design
what is the McNemar test for repeated measures?
- if we had a repeated measures design with two dichotomous variables, we would use a McNemar test
- used to test for differences where samples are paired
- dependent and independent variables must be dichotomous (only two categories each)