unit 8/9 things to remember Flashcards

1
Q

x2 assumptions

A
  1. SRS
  2. 10% rule where n<10% of population
  3. large number count - all expected counts are greater >5
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

when to use GOF chi square?

A

when there is one categorical variable with on population, comparing one population to a baseline

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how to calculate expected value and degree of freedom for GOF chi square test

A

expected value - (percent of outcomes)x(total population)

degree of freedom - # of subcategories - 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

mechanics to know for GOF

A

P(x2>5/5) –> always greater than

use second vars #8 to find p value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

when to use homogeneity x2 test

A

1 categorical variable with multiple populations - comparing multiple groups of samples

again, MULTIPLE populations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how to calculate expected value and degree of freedom for homogeneity chi square test

A

expected value - (marginal value of column)x(marginal value of row) / total population

df - (number of rows - 1)(number of columns -1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mechanics to know for homogeneity + hypothesis

A

again, p( x2> jrngo) – GREATER THAN

normal x2 test

hypotheses: the number of (subject) will be spread evenly between the groups.

use a matrix :)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

when to use an independence x2 test

A

when there are 2 categorical variables and 1 population and the word association or something like that is used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how to calculate expected value and degrees freedom for an independence test

A

expected value - (marginal value of column)x(marginal value of row) / total population

df - (number of rows - 1)(number of columns -1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

r

A

correlation coefficient - describes the strength and direction of the linear association of the scatter plot equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

LSRL - least squares residual line

A

y hat = a + bx
y hat = predicted y given x
a = y intercept
b = slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

se

A

standard deviation of residual

“the actual y variable is typically about s away from the predicted y variable”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sb

A

standard error of slope. average distance that observed values deviate from the regression line. the smaller the closer to the line

“the slope of the sample LSRL typically varies from the slope of the. population LSRL by about Sb”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

one sample t interval for slope conditions

A
  1. SRS
  2. 10% condition - independence
  3. Nearly normal - “The distribution of sample residuals is free of strong skewness and outliers” –> when n > 30
  4. nearly linear:
    a) make a scatterplot to identify data/ look at it
    b) check residuals
    “The scatter plot appears to be fairly linear and residual plot appears to not have a pattern”
  5. Equal SD: “Scatter plots show similar variability for each x value.”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

one sample t test for slope mechanics

A

everything same as the interval except mechanics:

t= b-beta(0) / SEb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Hypotheses and df for one sample t interval for slope

A

null: beta = O
alternative: beta doesn’t equal 0/ is greater/less than 0

df : n-2