week 5 & 6 Regression Flashcards
What is the primary difference between ANOVA and regression in terms of the types of studies they are used for?
ANOVA is for experimental studies while regression is for observational studies.
What does regression fundamentally begin with?
Regression begins with correlation.
What is the relationship between correlation and causation?
Correlation does not imply causation.
What is the common measure of effect used in both correlation and regression?
The correlation coefficient (r²) which represents the proportion of variance explained.
What can restriction of range on the independent variable lead to regarding the relationship?
It can underestimate the relationship.
How can extreme cases or outliers affect linear models in regression?
Outliers can skew the correlation, inflating or deflating results.
What is a potential issue with using poor or proxy measures in correlation?
It may underestimate the correlation.
What does the strength of effect in regression correspond to in terms of r² values?
r² values indicate the proportion of variance explained, with higher values indicating stronger relationships
A statistical method for predicting the value of one variable from another, using one or more predictors.
Regression
A measure that quantifies the direction and strength of a linear relationship between two variables.
Correlation Coefficient (r)
The portion of variability in a dependent variable that can be attributed to the independent variable(s) in a regression model.
Variance Explained
What is the difference between simple regression and multiple regression?
Simple regression uses one predictor (independent variable), while multiple regression uses two or more predictors.
A data point that differs significantly from other observations and can substantially affect the results of statistical analysis.
Outlier
The variable that is manipulated or varied in an experiment or regression analysis to assess its impact on the dependent variable.
Independent Variable (IV)
Why is it beneficial to use regression over simple correlation?
Regression allows for prediction of the outcome variable while accounting for multiple predictors, enhancing the understanding of variable relationships.
What does a linear relationship express in the context of regression?
A linear relationship is expressed as a straight line.
What is the relationship between a line and a model in regression?
Your line is your model.
What are the two fundamental features that all lines possess in regression analysis?
All lines have a slope and an intercept.
What does the term ‘error’ refer to in the context of regression?
Error refers to the difference between your modeled line and the actual data points.
What does b1 represent in regression analysis?
b1 is the regression coefficient for the predictor and represents the gradient (slope) of the regression line, indicating the direction and strength of the relationship.
What is represented by b0 in a regression equation?
b0 is the intercept, which is the value of Y when X = 0, marking the point where the regression line crosses the Y-axis.
How can one estimate the outcome using multiple predictors in regression?
By entering the value of the predictor, multiplied by the coefficient, and adding the intercept, one can estimate the outcome.
A statistical process for estimating the relationships among variables, allowing for the prediction of one variable based on the values of others.
Regression
The slope of a regression line, represented by b1, indicates the direction and strength of the relationship between the independent and dependent variables.
Slope