Statistics 2 - Regression Flashcards Preview

Statistics > Statistics 2 - Regression > Flashcards

Flashcards in Statistics 2 - Regression Deck (27):

What is the basic linear regression equation?

y = mx + c


What does each variable in the basic linear regression equation stand for?

y = outcome variable
m = slope
x = predictor variable
c = y intercept when x = 0


What is the basic linear regression transformation taught in this course?

y = B0 + B1.x1


What does each element in the transformed linear regression stand for?

y = outcome variable
B0 = intercept
B1 = slope
x1 = predictor variable


How might the slope be defined narratively?

The change in y, for a one unit change in x.


How might the y intercept be defined narratively?

The value of y when x = 0.


When x is a continuous variable, what interpretation does B1 take on?

B1 takes on a mean difference interpretation.

That is, for a mean difference between two subjects of x = 1 unit, the mean difference in y between the subjects is the slope.


What are you calculating when you calculate the confidence interval for slope (B1)?

Confidence interval for mean differences.


What are you calculating when you calculate the confidence interval for the intercept (B0)?

The CI for the mean of a single group.


For what type of data is the CI for the y intercept (B0) useful and for what data is it less useful?

Useful for binary and categorical data.

Less useful for continuous data ie B0 may be a placeholder eg cannot have height of 0cm.


What is the formal title for R^2?

Coefficient of determination.


What does R^2 measure? (Give short and long definitions)

The strength of linear association between x and y.

Amount of variability of data points around the regression line explained by y-values.


What is r and what does it measure?

r = correlation coefficient

Measures strength and direction of relationship between x and y.


What must you do to a log odds result for logistic regression to get an understandable actionable figure?

Exponentiate the log odds result to gain an odds ratio for outcome y using predictor x.


What is effect modification (interaction)?

Effect modification occurs when the relationship between two variables (x and y), depends on the level of a third variable z.


Can you test for Effect Modification through adjustment?

No. Adjustment is used to test for Confounding.


How do you test for Effect Modification?

By comparing separate x-y estimates for different groups or levels of z eg Male vs Female, low vs med vs high


What’s the mnemonic for calculating Risk from logistic regression results?

LETS LOPP branches


What does each letter of the mnemonic for calculating Risk from Logistic Regression results stand for?

LETS LOPP branches
L = log y (OR) calculation for given x-value
E = exponentiate from log to normal
T = transform OR into P (= p/1+p)
S = switch P to %



What does a Spline summarise?

A non-linear relationship (a dog leg relationship).

Spline is ‘split-line’


What is a Lowess plot?

A “LOcally WEighted Scatter plot Smoothing” line through a scatter plot to allow trends and relationships to be visualised.


What is a Hazard Ratio?

A measure reported in TTE analysis summarising the chance of an outcome in a treatment group, compared to the control group.


What is a Propensity Score?

PROBABILITY of being in the EXPOSED group, given certain CONFOUNDER values


What must the predictor be in a PROPENSITY SCORE calculation?

Binary eg Exposed/Unexposed


What can you do with a PROPENSITY SCORE?

Create quartiles of B1.

ie Probability of being Exposed, given population characteristics

Eg Māori ethnicity could be associated with damp home, and hospital admission. But all you’d see is hospital admission and think that ethnicity was the driver.


Subtraction of results of comparison results on the log scale is the same as what on the normal scale?

i.e log odds (A) - log odds (B)



Addition of variables on the log odds scale is the same as what on the normal scale?