PSCH 443 - Midterm Flashcards

Question

What is normality?

Answer 1

The assumption of a normal bell-curve distribution; predictor and outcome variables should be normally distributed. 1. prediction errors should also be normally distributed centered at zero 2. if errors are truly random, most errors should be relatively small and evenly distributed around the mean If data is non-normal, it will affect our parameter estimates--has to be large to cause a similarity important effect. Generally works against statistical significance, or implies a less statistically significant relationship b/w predictor and outcome.

Answer 2

The assumption that relationships between our predictors and outcome are linear or can be modelled linearly (i.e., by least squares estimation or linear regression line of fit).

Answer 3

The assumption that the residual variability should be consistent across the predictors; predictors should have the same general amount of variance distributed throughout the model; errors should be random. Can cause the following problems: 1. cases with larger disturbances have more “pull” than other observations 2. standard errors are biased 3. incorrect conclusions about the significance of the regression coefficients Durbin-Watson statistic provides measure of correlation among errors. Value that is significantly different from 2 is likely to indicate some type of problem w/ the error distribution across variables in the model.

Answer 4

Simultaneous - all variables were entered into the regression equation at one time. - goal is to enter predictors in order that accounts for most variance - solution does not have to make theoretical sense - sufficient sample size to enable reliable estimation of predictors - very clean data that met assumptions of regression Hierarchical - Predictors are entered in an order specified by the researcher. - theory should determine order of entry - known predictors are entered first - subsequent variables are entered into the model to see how much variability they account for after previous variables are accounted for - evaluate whether later predictors account for a significant portion of the variance in Y

Answer 5

Assigns numbers to represent different categorical values, but those numbers do not have usual mathematical meaning; enables comparisons among various categories. 1. 0 and 1 – one usually represents presence of some key attribute 2. sets of dummy variables are always entered in same block into regression analysis 3. choose one category to be the baseline category when using multiple dummy codes

Answer 6

Occur when predictors in a model correlate with each other in a way that enhances the fit of one (or more) of the predictors. Usually found when: - absolute value of a beta weight associated with a predictor is larger than the correlation b/w the predictor and outcome - direction of the b and beta weights are the opposite of the simple correlation between the predictor and outcome X2 is soaking up extraneous variance from X1; X1 is then able to soak up more variance in Y. This is problematic b/c X1 may not be able to explain a significant portion of the variance in Y without X2 (e.g., is too reliant on X2 rather than actually accounting for variance).

Answer 7

Ordinary least squares allows us to assess the best fit of the predictor variables to the outcome values; it is creating a linear equation (i.e., Y = a + (b(x)) ) based on a slope of the predictor to the constant of the sample, which is usually the mean. - primary goal is to minimize error variance until it is as small as possible - evaluates the differences b/w the predicted value and the actual value of Y - assumes the mean of Y for the constant, and treats the constant as equal to 0 - models causation, and relationship on predictor to Y-hat We use the error account for by our model over the total error variance to calculate R2, which standardizes the error variance our model explains. If R2 is equal to 0, then the model is doing as badly as possible. R2 equal to 1 is a perfect relational model of Y-hat and the actual values of Y.

Answer 8

A.) Hierarchical regression is used when we want to see how much variation our predictor variable accounts for after all the other possible variables are already controlled for; the way variables are entered into the model is chosen by the researched. Generally used to test theory. B.) Any known predictors are typically entered first, and then the variable of interest is entered after these other variables are controlled for.

Answer 9

If there is only one predictor (i.e., bi-variate), then the value of beta and r is the same. This is b/c: - r metric is already standardized - beta is this same standardized metric conceptually - there are no other variables to assess correlation against/cause multicolinearity Therefore, b/c no other predictor variables exist to contrast the standardized metric and all the variance is already accounted for in the beta alone, they are understood as functionally the same value.

Answer 10

We use standardized regression coefficients when: 1. we need to compare relative strength of different predictors that are measuring different qualities 2. when the scales have little intrinsic or mathematical meaning (e.g, using categorical variables, Likert scales w/ no specific interval, etc) We use unstandardized regression coefficients when: 1. The units used to measure are well known and have clear mathematical meaning (e.g., dollars, inches, miles, pounds) and can be easily evaluated against each other

Answer 11

The mean is a reasonable guess b/c it is a least squares statistic. This means it already minimizes the average squared differences b/w Y and Y-hat to the smallest amount, in part b/c the mean’s upper and lower limits already “balance” to a mathematical zero.

Answer 12

In order to deal w/ outliers or extreme cases, we should use SPSS to examine the following statistical procedures: Cooks distance > 1 Mahalanobis Distance (with 2 predictors, n ~ 30) > 11 Leverage > .16 (2*(k+1/n)) DFBetas > 1 We should also rerun the analysis w/o the outliers include and compare both analyses to better understand the differences caused by the outliers. If we find an outlier statistic that seems to bias the data, we should remove it and include a reason as to why it was dropped from the data set.

Answer 13

1. according to the central limit theorem, a smaller sample size more poorly reflects both the actual population size and the assumption of a normal distribution 2. the bs and beta weights assume normality and will be able to account for more error if the sample size is sufficiently normal 3. effects confidence intervals and predictor metrics 4. outliers are more problematic in smaller sample sizes 5. the sample size may not generalize to the greater population values

Answer 14

1. The size of the coefficient can be misrepresented in NHST tests b/c they only incorporate either a rejection or failure to reject--they are mutually exclusive definitions and this creates problems 2. Depending on the size of the sample, the result of a NHST can fail to represent the actual population if there was more or less data available 3. Variance across all predictors could not actually be equal/violate homoscendasticity assumption and this changes NHST results 4. As a probability value, it only gives as estimation b/c it assumes an arbitrary cut off for the possibility to being due to sampling error; this can be misleading if there was an effect throughout all or many cases, but it just doesn’t meet the cut off for whatever reason

Answer 15

Some alternatives to using p-values to evaluate our models include measuring effect size, which more closely reflects the strength of the relationship b/w the predictor and the outcome. NHST does not look at strength in any meaningful way, but rather if the test statistic can meet the p > 0.05 cut-off. This cut-off is arbitrary, and gives no indication of what the actual model wants to capture in some respects.

Answer 16

Residuals are useful for evaluating these assumptions b/c of the following factors: 1. errors should be random if all assumptions are met correctly 2. residual variability should be consistent across all predictor variables and other parameters 3. if residuals are not consistent, implies either a problem w/ the model or that variability is not properly being accounted for by some variable (or might measure same concept in different variables) If the model respects normality, homoscendasticity, and no bizarre correlational relationships, then the residual errors should be equally distributed within predictor variables. This fact alone makes it good to assess the overall consistency of the predictors the model is using to model its linear relationships.

Answer 17

There must be b weights of equal magnitude but opposite signs for the two analyses; if all the dummy coded variables are not entered, this means we do not actually know how the variables are related to each other, and the end result would be thrown off b/c it is not accounting for all the “meaningless variance” each variable represents (i.e., all the 0s and 1s in the sample are not actually represented and thus the mean values being assessed are incorrect).

Answer 18

EX. Dummy coding: If we are looking at political data, we can code the variables to be Democrat, Republican, or Independent. We can then use Independent to represent 0 at baseline, and code in Democrat or Republican as a 1 if the respondent identified as either. EX. Dummy coding #2: If we are looking at gender data, we can code the variables to be Female or Male. Depending on what we want to assess, we can choose either to be the baseline and have a value of 0. The variable being contrasted would get the value of 1.

Answer 19

Y-intercept: the mean of the comparison group (i.e. the group that gets all zeros across dummy coding categories) Slope: the b weight represents the difference of the means b/w all paired categorical variables.

Answer 20

A.) A strong correlation between two or more predictors that impacts the outcome variable; when an association variables is incorporated into the model. B.) In order to evaluate multicollinearity, we can look at the following statistics: 1. VIF – Variance Inflation Factor :: VIF >10 or Average VIF > 1 2. Tolerances - Reciprocal of VIF (1/VIF) :: tolerances

Answer 21

Correlation - quantifies the degree to which two variables are related. - describes how two variables vary together - both X and Y are measured - allows us to interpret the confidence interval of r Regression - finds the best line that predicts Y from X. - X values can be measured or controlled in the experiment - quantifies goodness of fit with r - variables of X and Y define how the linear relationship is represented (i.e., X is the constant and Y is the predicted or actual value)

Answer 22

The amount of error b/w the measurements of the predicted values for Y based on the modeled equation and the actual values of Y. The purpose of least squares estimation is to find a line of best fit that most closely captures a high value for r (i.e., 1 or -1 is perfect in either direction), which means there would be no error at all b/w the predictors and the actual values. - less amount of error / higher r = better correlated b/w predictor and outcome

PSCH 443 - Midterm Flashcards

(46 cards)