M2 - Logistic Regression Flashcards

(39 cards)

1
Q

What differentiates LR from MR? pick best answer.

  1. LR is used for predicting group membership.
  2. LR only uses binary independent variables.
  3. LR has a dependent variable that is binary.
  4. LR output are graphs that have a straight line for the line of best fit.
A
  1. LR has a dependent variable that is binary.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Part B - Question 2: The Independent variables in a Logistic Regression should be:

  1. Binary.
  2. Continuous or binary.
  3. Ordinal.
  4. Continuous or metric.
  5. Metric.
A
  1. Continuous or Binary
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Part B - Question 3: What are key differences between ANOVA and logistic regression?

  1. The DV is binary for LR and not for ANOVA.
  2. The DV is binary for both LR and ANOVA.
  3. The DV is continuous for ANOVA and not for LR.
  4. The IV and DVs are binary for both LR and ANOVA
A
  1. The DV is binary for LR and not for ANOVA.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Part B - Question 4: Is it possible to use a continuous variable for use in Logistic regression?

  1. Continuous dependent variables can be used in a regression analyses if the scale includes a 1 and zero in the metric scale.
  2. No, a dependent variable that is continuous can only be used in a multiple regression analyses.
  3. Only if the continuous variable has more than 2 values.
  4. A cut-point can be identified on a continuous variable, and this can be used to form a binary variable.
A
  1. A cut-point can be identified on a continuous variable, and this can be used to form a binary variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Part C - Question 1: What is the general shape of a plotted logistic regression formula?

  1. A straight line.
  2. A parabola.
  3. An S shape.
  4. A U shape
A
  1. An S shape.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Part C - Question 2: What is the scale of the DV for the logistic regression model?

A parabolic scale.
A decimal scale.
A hexadecimal scale.
A logarithmic scale

A

A logarithmic scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Part C - Question 3: Is binary logistic regression a linear model?

Yes, it has the function of y=c+mx.
Yes, it has the formula of y=bx+c
Both a and b.
No, it is non-linear function.

A

No, it is non-linear function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Part D - Question 1: Can the approach used for MR model building be used for LR?

No, it can only use, standard model building.
No, it must use no- linear strategies.
Yes, it can use standard, sequential and statistical model building.
Yes, it must use ordinal strategies

A

Yes, it can use standard, sequential and statistical model building.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Part D - Question 2: How many ordinal outcomes can a multinomial logistic regression predict?

One continuous category.
Three or more categories.
One multivariate variable.
Two categories

A

Three or more categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Part D - Question 3: Which type of LR is the 4th year lecture covering?

Multinomial.
Ordinal.
Binary.

A

Binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Part D - Question 4: Sequential logistic regression is what?

Binary variables are randomly entered in blocks.
Binary and continuous variables are entered in blocks and pre-specified by the researcher.
Binary variables are entered in blocks and pre-specified by the researcher.
Binary and continuous variables are randomly entered in blocks

A

Binary and continuous (predictor) variables are entered in blocks and pre-specified by the researcher.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Part E - Question 1: What does target variable mean?

This is the independent variable with the most number of categories.
This is the outcome category of the dependent variable that is the focus of the research question.
This is the dependent variable with the least number of categories.
This is the variable that can be used as a independent or dependent variable

A

This is the outcome category of the dependent variable that is the focus of the research question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Part E - Question 2: What is the reference category in logistic regression analyses?

It is the reference category used to interpret the categories of a categorical variable.
It is the reference for logistic regression.
It is the same as the Target category.
The variable that the research question is focussed on

A

It is the reference category used to interpret the categories of a categorical variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Part E - Question 3: How does SPSS choose the target category for an outcome variable in a logistic regression?

It chooses the highest numeric value.
It chooses the lowest numeric value.
SPSS does not choose the DV target category, the analyst always needs to select this to run the analysis

A

It chooses the highest numeric value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Part E - Question 4: Can a categorical DV have more than one category in logistic regression?

Categorical DVs in a logistic regression must be continuous.
Categorical DVs in logistic regression can only have two values.
Categorical DVS in a logistic regression must be ordinal.
Categorical DVs in a logistic regression can have two or more values.

A

Categorical DVs in logistic regression can only have two values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Part E - Question 5: Can a categorical IV have more than one category?

Categorical IVs in a logistic regression can have two or more values.
Categorical IVS in a logistic regression must be ordinal.
Categorical IVs in a logistic regression must be continuous.
Categorical IV variables in logistic regression can only have two values.

A

Categorical IVs in a logistic regression can have two or more values.

17
Q

Part E - Question 6: For an IV with more than one category, can any category be the reference category?

Yes, any category can be the reference variable.
No, SPSS will force a reference category, and this cannot be changed.
No, the lowest category should be the reference category.
No, the highest category should be the reference category

A

Yes, any category can be the reference variable.

18
Q

Part G - Question 1: When checking the linearity assumption in logistic regression we are doing the following:

There is linear relationship between the independent variables.
There is log linear relationship between continuous independent variables and the dependent variable.
There is a log linear relationship between categorical variables.
There is a linear relationship between all independent variables and the dependent variable

A

There is log linear relationship between continuous independent variables and the dependent variable.

19
Q

Part G - Question 2: What statistic is recommended to interpret model fit and can also be used as pseudo measure of R2?

The Cox and Snell statistic.
The Chi-square classification.
The Wald statistic.
The -2 Log Likelihood

A

The -2 Log Likelihood

20
Q

Part G - Question 3: An odds ratio > 1 means what?

Outcome is more likely for target level of IV.
No difference between groups.
Outcome is less likely for target level of IV

A

Outcome is more likely for target level of IV.

21
Q

Part G - Question 4: An odds ratio < 1 means what?

Outcome is more likely for target level of IV.
Outcome is less likely for target level of IV.
No difference between groups

A

Outcome is less likely for target level of IV.

22
Q

Part G - Question 5: An odds ratio = 1 means what?

No difference between groups.
Outcome is more likely for target level of IV.
Outcome is less likely for target level of IV

A

No difference between groups.

23
Q

Part G - Question 6: The odds ratio is also referred to as what?

Tetrachoric correlation.
Exp(B).
The standard error.
Wald statistic

24
Q

Part G - Question 7: Odds ratios should always be interpreted in the context of what?

Sample size.
Number of correct classifications.
Number of false classifications.
Prevalence

25
In what circumstances should logistic regression be used?
- predicting group membership - DV is binary / categorical - continuous variables can be converted to binary using cut offs - Multiple IVs can be categorical or continuous
26
Outline the differences between logistic regression and multiple regression in terms of - types and number of IVs and DVs
LR IVs - multiple continuous or categorical DVs - single binary or categorical (binomial, multinomial or ordinal) MR IVs - multiple continuous or categorical DVs - single continuous
27
Outline the differences between logistic regression and multiple regression in terms of - approach for entering variables
LR Standard (entered altogether) Sequential (entered in blocks) - theory directed Statistical (forward to backward) MR Standard (forced entry) Hierarchical (entered in blocks) - theory directed Stepwise (statistics based, forward or backward) - only user for exploration
28
Outline the differences between logistic regression and multiple regression in terms of - assumptions
LR - Independence of errors - errors should not be correlated (clustered data) - linearity - IV should have linear relationship with log of the DV - distribution normality - distribution should be normal and outliers should be dealt with (transformed or removed) using Standardised residuals Cook Distance - samples size --> 5 cases per possible combination required - singularity and multicollinearity
29
How does the logistic function differ compared to other functions
``` log function = log(Y/1-Y) = b0 + b1x1 + b2x2 + e log function is an exponential function that is shaped liked an S curve +ve coefficients will increase Y -ve coefficients will decrease Y interpret using Odds Ratio ``` linear function = Y = bx + c linear function is a straight line as DV increase by 1 unit IV increases by b units ``` quadratic function = Y = ax2 + bx + c quadratic function is a parabola +ve a is happy face -ve a is sad face ```
30
When is categorical coding useful?
Useful to deal with categorical variables with 2 or more outcomes
31
What is binomial, multinomial and ordinal categorical coding?
Binomial is 2 (Yes/No) Multinomial is for nominal groups eg brown = 1, blue = 2, green = 3 Ordinal is multinomial moving in a progressive way eg education level achieved 1 = high school, 2 = grad school, 3 = postgrad
32
Which group of DV category should be the referent in binomial logistic regression and how should it be coded?
Referent group should be coded lower and should be the group you that is not the target of interest ie control group
33
For IVs with 2 and 3 categories, how should they be coded?
Binomial categorical IV -SPSS will assign automatically - target should be the group of interest - referent should be the group not of interest Multinomial categorical IVs - can be anyway - needs to make sense - set the referent as the variable you are most interested in so other groups can be compared directly to that one
34
What does the choice of referent group in categorical coding impact?
interpretation of the DV | interpretation of the coefficient o the IVs
35
Name the model overall approaches for logistic regression interpretation
Model Improvement - 2LL change - 2LL proportion (% improvement of model fit) Classification Accuracy % correct % improvement relative to baseline
36
Describe model improvement methods for interpreting LR
``` -2LL change = -2LLbase - -2LLnew = -2LL change (Omnibus test) used for nested models only significant at p =.05 if 1 df > 3.84 ``` -2LL proportion = x2 model / -2LLbase then transform to improvement of model fit
37
Describe classification accuracy methods for interpreting LR
Classification accuracy 1. % correct = the # of correctly predicted to be in one group and not in another group 2. % improvement = the # of correctly predicted relative to if everyone was predicted to be in the category with the most outcomes = hits + correct rejection -nmax / sample n - nmax then transform to % for model improvement over baseline
38
Name and explain the individual predictors of LR interpretation
b weight = change in log odds of Y =1 for 1 unit change in the IV Odd Ratio = change in likelihood of Y = 1 for 1 unit change in IV <1 = less likely, > 1 is more likely, 1 = no difference Significance test - Wald's test
39
Where do you find the log odds and odd ratio in SPSS output?
log odds - b weight = B For every unit increase level of group, the log odds of being in the outcome increase by B units Odd ratio - Exp(B) likelihood of increase relative to referent group