Week Eleven Flashcards
(16 cards)
What is Cook’s distance measure?
It shows the effect of each case on the fitted values for all cases in the model. If Cook’s distance is greater than 1 it must be investigated further.
How can we identify outliers?
Via bloxplots, scatter graphs and residual plots after regression.
What is a logarithm?
It explains how many of one number we need to multiply to get another number. It is the opposite of a power.
What do we get if we take a log of y?
We log x - depending on the value of b (base).
What value can base (b) take?
Any value that is positive, but not 1.
What are the three particularly useful values of base?
- Base 2
- Base 10 - also known as the common log, denoted as log(x)
- Base e - an irrational, number denoted as ln(x)
What rule do some researchers use to keep zeros in when conducting a log?
Add one then take a log - ln(1) = 0
What are logarithms useful for?
Compressing skewed variables and to interpret the coefficients in terms of percentage changes in X and Y rather than in units.
What is the Ecological fallacy?
Where we make conclusions about individuals based on analyses of group data.
What is the exceptional/individualistic fallacy?
Where we make conclusions about the group based on exceptional or individual cases.
What is the log-log model?
LogY = log.a ➕ b.logX
Interpret the coefficients as the effect of proportional change in X on a proportionate change in Y.
What is the Lin-log model?
Y = a ➕ b.logX
Effect of proportionate change in X on units of Y.
What is the log-lin model?
LogY = a ➕ bX
The effect of an absolute change in X (in units) on Y in proportionate terms.
How do we overcome the issue of interpreting a dummy variable when the dependent variable has been log-transformed?
We can take an antilog of dummy coefficients, subtract 1 from the variables and mutiny the difference by 100 to get the percentage change in y when a dummy goes from 0 to 1.
What can interaction terms be?
A mixture of continuous and categorical, or just categorical data.
What is hierarchical regression?
The practice of building successive, related linear regression models, each time adding more predictors.