Midterm Flashcards

Question

How will the inter item correlation matrix differ for uni- and multi-dimensional pools of items?

Answer 1

-there may be significant variability in inter-item correlations in a multi-dimensional pool of items whereas a uni-dimensional pool of items would cluster around the average.

Answer 2

- examines the relationship between the new measure and important test and non-test criteria are congruent with one’s theoretical understanding of the target construct and its position with respect to other similar and dissimilar constructs called the nomological net. - structural validity phase involves analyses of items WITHIN the new measure.

Answer 3

the extent to which a measure correlates with other measures of the same construct

Answer 4

the extent that a measure does not correlate with measures of other constructs.

Answer 5

The extent to which a measure can predict a criterion occurring in the future.

Answer 6

Relating a measure to criterion evidence collected at the same time as the measure itself.

Answer 7

- Measurement = the process of building models that represent phenomena of interest, typically in quantitative form - Error will always occur - This error occurs not just in psychology – Amniotic fluid ex two doctors could decide different things - Almost all measures struggle with validity and reliability as no measure will be perfectly reliable and perfectly valid

Answer 8

- All measurement models will eventually be proven wrong because other models will be developed that supersede those models - measurement models must be specified explicitly so that they can be evaluated, disconfirmed, and improved - Comparative model testing is one of the best ways to determine which model is the “least wrong.”

Answer 9

1) Observed Score = True Score + Error - The true score is the score each person would obtain if there were no error. It also represents the population parameter. - Error is all the variations in the circumstances of measurement that are not related to the measurement itself. 2) Reliability = the consistency of a measurement procedure and the extent to which scores produced by the measure are replicable. The ratio of the true score variance to the observed score variance. A reliability of 0.7 or higher is sufficient but depends on the circumstances of the study ex. A study of suicide.

Answer 10

- Trouble with operationalizing constructs if the “meter stick” isn’t consistently measuring a construct - The true correlation between a measure and the constructs measured may be underestimated (Attenuated) - Small sample sizes combined with low reliability make detecting an effect difficult - When true correlations are small and are combined with low reliability it makes it difficult to detect a difference. - Effect sizes will be severely underestimated

Answer 11

- Correction for attenuation is a statistical procedure in which reliability indices are used to correct for underestimated observed correlations due to unreliability. - It should be performed such that researchers do not underestimate the strength of the relationship between two variables, thus resulting in lower effect sizes and possibly insignificant findings when an effect does exist.

Answer 12

- permits a decision maker to pinpoint the source of measurement error and quantify them - The idea is to obtain a certain level of generalizability, which is the extent to which a score is interchangeable with other scores, ex. Similar numbers are interpreted the same way, amniotic fluid - G-Theory disentangles multiple source of error rather than a broad error term the CTT provides - G-theory allows researchers to see exactly where the sources of error are coming from and possibly reduce these problems - It taps into every facet of that may influence reliability

Answer 13

1) Internal Consistency - Every item is correlated with every other item, ex. Split-half reliability 2) Interrater - How much scores vary across different raters or judges 3) Test-Retest - How responses vary across time, ex. IQ test results remain relatively stable across multiple sessions

Answer 14

- average of all possible split half reliabilities (split half= half pool items into two groups and compute the correlations between the two split groups-should be random) - it is not a pure index of internal consistency because alpha levels are not a good reflection of the data. Internal consistency is problematic and it is best tool look at homogeneity.

Answer 15

A trait that underlies and directly influences individual’s behaviors and responses.

Answer 16

A mathematical function describing the relationship between where an individual falls on the continuum of a given construct and the probability that they will give a particular response to a scale item designed to measure that construct.

Answer 17

A parameter ind icating how well an item can discriminate from people high on a trait and low on a trait. (The steepness of the line)

Answer 18

The latent level that corresponds to a 50% chance of getting the item correct or endorsing the item. (Where the inflection point occurs)

Answer 19

- should have a steep slope and have items with inflection points that hit every level of the latent trait continuum( mean that we would have a scale that tests every aspect of the latent trait continuum, from the easy questions to middle questions and difficult questions (All levels of difficulty) - every item would effectively discriminate individuals who score high or low on the latent trait with fairly good accuracy. - If graphed using item information functions- should have a high peak and hit every level of the latent trait continuum-High peaks indicate that that item provides a lot of information thus making it easier to differentiate between individuals high and low on a trait.

Answer 20

A graphical curve that displays the amount of discrimination between respondents that an item provides across the latent trait continuum.

Answer 21

The sum of all item information functions. Shows precision of scale across all levels of the latent trait continuum. -It is useful because it allows the linking of different scales. This method allows researchers to link different measures across age, cultures, genders, etc.

Answer 22

How informative a test item is across all levels of the latent trait continuum. -This is useful because it can expose items that are redundant or are not operating properly and thus are not needed. It can also identify problems with item response options (i.e., dichotomous vs polytonomous)

Answer 23

- Response option curves can be generated to assess the discrimination of each response option from the other. If the curves overlap this can mean that the labelling on the response options was confusing or that individuals couldn’t discriminate the difference between say a 2 or a 3 on a 5-point scale. - Test information and reliability functions can be generated to test for redundancy in a scale. If two items have similar curves or overlap each other than the researcher will know that those two items are providing similar information and thus one of the can be eliminated to make the scale more efficient.

Answer 24

occurs when individuals from the different groups, who are at the same points on the latent trait continuum, obtain different scores on the item.

Answer 25

occurs when individuals from the different groups, who are at the same points on the latent trait continuum, obtain different scores on the overall test. -It is important to note that item biases often cancel each other out, so researchers aren’t as concerned by them as test biases that affect the overall score.

Answer 26

a form of computer-based test that adapts to the examinee's ability level. -When an individual begins a test, the computer starts with a question of average difficulty, depending if the individual gets the question right or wrong will determine which question the computer chooses next. A wrong answer will lead to the computer choosing an easier question, a right answer will lead the computer to choosing a harder question

Answer 27

- enables the computer to zone in on the individual’s ability level quite accurately - Once the individual has reached a level in which they can consistently answer questions, and they can’t answer questions of higher difficulty, the computer stops the test and provides an estimated ability level. -Due to anxiety or other influences, some individual’s may jump along the latent trait continuum quite frequently - computer must administer more questions until the individual’s responses fall into a more consistent pattern and an ability level can be estimated - CAT allows researchers to obtain accurate estimates of an individual’s place on the latent trait continuum using relatively fewer questions

Answer 28

- Regression is the process of fitting a model or a line to a set of data and using it to predict an outcome variable (DV) from a predictor variable (IV) in simple regression or multiple predictor variables (IV’s) in multiple regression - Outcome = Model + Error - To assess the fit of a model a linear regression line is computed using the least squares method. In this method the sum of the squared differences for every possible regression line is calculated. A line of best fit occurs when the sum of squared differences from the line or the residuals are at an absolute minimum. In other words, when your error term is the lowest. - The slope of the regression line is the change in the outcome associated with a unit change in the predictor variable. - The intercept of the regression line is the predicted outcome variable when the predictor is at zero.

Answer 29

- assessed using the least squares method in which the difference of each point from the line (The residuals) is at a minimum value - done by first calculating the difference between the observed value and the mean because the mean will be a fairly good fit This model (Sum of squares total) shows how good the mean is as a model of the observed data - Next, we fit a line of best fit to the data (A regression line) and compared that line to the observed values. -This model (Sum of squares residual) represents the degree of inaccuracy (The residuals) when the best model is fitted to the data - To improve the final regression line, the difference between the two lines are calculated (Sum of squares total – sum of squares residual = sum of squares model) - results in a reduction of the inaccuracies in the model and produces a line of best that can accurately predict outcome variables from predictors.

Answer 30

- Typically a linear line, raw slopes are the expression of the variables in their raw units - Relatively simple and easy to use; however, raw slopes cannot be meaningfully compared because they are in different metrics - For example education may be measured in years and income measured in $1000 units, therefore the expression of the relationship indicated by the slope of the regression line would describe something like each increment of one raw unit of education (years), projected earnings would increase 1000 raw units of income (dollars).

Answer 31

- Typically represented by a normal curve, standardized slopes are the expression of the variables in common units, z-scores. - allows researchers to compare residuals across different models, assess significance, and allows for meaningful comparisons thus researchers can determine if one predictor had more of an effect than another predictor in a multiple regression analysis. - For example one standard deviation of education, projected earnings would increase by .40 standard deviations of income.

Answer 32

a method in which all predictors are forced into the model simultaneously. No prior decisions are made about the order in which they are entered, only theoretical reasoning for including the predictors. The simplest procedure; however, as the amount of predictor variables increase the results can become hard to interpret (especially since overlapping predictors is common in psychology)

Answer 33

a method in which predictors are selected based on past work and are entered into the model by order of importance in predicting an outcome. This method is used most often.

Answer 34

a method used in stepwise methods in which the computer searches for the best predictor out of all possible predictors in predicting the outcome variable and adds it to the model - Then the computer selects the next best predictor and so on until the addition of a new predictor doesn’t improve the model and the computer stops - R does this based on the Akaike information criteria (AIC). A lower AIC indicates a better model; therefore, predictors are kept as long as they lower the AIC.

Answer 35

you will trim your predictors down just for that specific sample, which will make it hard to replicate and generalize. Generalization is key if you want to conclude anything significant from your study and step wise methods will trim your study so that it will fit that sample, but will make it hard to replicate on any other samples used -If results cannot generalize they do not add anything to research and the growth of psychological theories.

Answer 36

standardized residuals are the differences between the point and the regression line converted into z-scores (Residual divided by the standard error). -This is important in interpreting outliers and error because it provides researchers with a universal cut off point for what constitutes an acceptable value. Therefore, if more than 5% of the points falls beyond 1.96 and -1.96 the model is a poor representation of the data, assuming normality.

Answer 37

DFFIT is the difference between the predicted value for a case when the model is calculated including that case and excluding that case. This requires an adjusted predicted value that is computed by taking every case in a distribution and testing whether or not exclusion of that case improves outcome predictions. If a case is not influential, it DFFIT value will be zero. Don’t want cases to have a large influence, they should all be relatively small.

Answer 38

DFBETA is the difference between a parameter estimated using all cases and estimated when one case is excluded. This technique allows researchers to identify cases that have a particularly large influence on the parameters of the regression model (outliers).

Answer 39

-Regression can be used on continuous or dichotomous predictors whereas ANOVA will only work on continuous variables.

Answer 40

-If predictors correlated with external variables this will be problematic when interpreting your results. External variables could play the part of a third variable that significantly influenced your results. So you wont be confident that your outcome variable was due to your predictors, there could've been a third variable that you did not control for influencing the results.

Answer 41

- indicates the loss of predictive power, how much variance in Y would be accounted for if the model had been derived from the population from which the sample was taken. - Always goes down from R2 value. - Small sample size and lots of predictors will cause the adjusted to R2 value to decrease. If this happens the line of best fit may over fit the values, thus over estimating effect size

Answer 42

- Randomly splitting a data set, computing a regression equation on both halves of the data and then comparing the resulting models. - The regression line should fit both halves similarly.

Answer 43

General rule is about 15 cases per predictor but depends on the size of the effect we are trying to detect and how much power we need to detect them. To figure this out, a power analysis must be run to determine an appropriate sample size. So the main factors are number of predictors, size of the effect wanted, and power needed to attain an effect.

Answer 44

As collinearity increases so do the standard errors of the b coefficients. Big standard errors for b coefficients means that these b’s are more variable across samples and less likely to represent the population.

Answer 45

When two predictors are correlated they may account for the same amount of shared variance in an outcome. Thus having both accounts for no more variance then just having the one. If predictors are uncorrelated then we can be sure they are accounting for different portions of the total variance.

Answer 46

Multicollinearity makes it difficult to assess the individual importance of a predictor. If predictors are highly correlated and account for similar variance we can’t be sure which predictor is the important one in predicting the outcome.

Answer 47

-To assess instances of collinearity or multicollinearity. These correlations can bias our regression line and produce results that may not be entirely correct (imprecise b values, conservative significance tests); therefore, these problems should be dealt with before the results are evaluated.

Answer 48

-R2 change the change in model one to model two. In model two, several predictors are added, this R2 change value will give us the difference between the models such that we will know if more of the variance is accounted for in the second model through the addition of new predictors. This change can then be tested for significance using an ANOVA test.

Answer 49

-If we were to replicate this study multiple times on different random samples we are 95% confident that the estimated slope of the regression line would fall between the lower and upper limits.

Answer 50

If two cases in our sample are correlated (such as two sisters taking part in a study) then it can lead to imprecise b coefficients and an over estimated effect size. This means that your results are not very meaningful as they are biased and may not generalize to the larger population.

Answer 51

An outlier in a regression analysis can change the slope of the regression line such that it is no longer accurate or meaningful. It may also increase residual variance, thus increasing the chance of making a type II error. The best approach to fix this is to compute the regression line again without the outlier and see how much the slope changes, if it is significant its best to drop the outlier and mention it in your results section.

Answer 52

if a data set doesn’t follow a normal distribution then the slope of the regression line may be biased and affected by outliers. Thus results will not be an accurate depiction of the larger population. To fix this a robust regression method should be used, such as bootstrapping.

Answer 53

-Regression analyses are often depicted uses tables. These tables will include the b coefficients, standard error of the b coefficients, beta weights, the change in R2, and the p values of the significance tests for all predictors in each model. It is also good practice to report the confidence intervals for each model. Some researchers also include correlation coefficients to indicate the absence of collinearity.

Answer 54

the correlation between two variables in which the effects of the other variables are held constant. In other words, the amount of variance two predictors share when a the shared variance of a third predictor is controlled for. - Can use for dichotomous or continuous variables. - The third variable shares variance with BOTH of the predictors. - Best used when you want to look at the relationship between two variables when a third variable or possibly confounding variable is controlled for.

Answer 55

the correlation between two variables when the effect of the third variable is controlled for in only ONE of the predictors. - Third variable is pulled out of IV not DV - Best used when trying to explain the variance in one particular predictor from a set of predictors.

Answer 56

- The problem with using categorical predictors in regression is that in most cases a predictor will have more than two categories (ex. Religion) - Dummy coding = A way of representing groups of people using only zeros and ones. The amount of dummy variables needed is the amount of predictors minus one, as the baseline group will take a zero across all predictors. All the predictors are set up in contrasts, the placement of the 1, indicating which predictor is which (ex. 0100 or 1000) - Once the contrasts are set up a multiple regression analysis can be computed.

Answer 57

-the beta value represents the shift in the change in the outcome scores as predictors change. Ex. Hygiene scores go down (a person becomes smellier) as a person changes from listening to no music to listening to rock music

Answer 58

- a variable that alters the direction or strength of the relationship between a predictor and an outcome. Essentially an interaction effect by which the effect of one variable depends on the level of another variable - Represents, “For whom” a variable most strongly predicts or causes an outcome variable magnitude and the direction of the relationship. - Can be categorical or continuous. - Often sought after the fact, when a weak effect was found researchers often search for other interactions that could explain this result

Answer 59

- In regression moderator’s variables are often treated as another IV. To represent the interaction between the moderator variable and the predictor product terms are formed by multiplying the moderator term by the predictor term using the newly coded dummy variables. A product term is created for each level of the moderator variable. Once all of the product terms have been created a hierarchal multiple regression approach can be used to get an F statistic that determines if a significant amount of the variance in the outcome was accounted for by the product term. A significant result means that an interaction has occurred and a graph must be generated to see what the interaction looks like. - Enhancing interactions = more effect in one group than the other - Buffering interactions = as one groups goes up the other goes down, zero effect size

Answer 60

The variable that explains the relationship between a predictor and an outcome. The mechanisms through which a predictor influences an outcome variable. - Causal mechanism - IV1 predicts DV because of IV2; therefore, IV1 and DV may not be related at all if IV2 accounts for the total variance in IV1. - Often sought after when we find a relationship and want to explain why it occurs. - Represents, “Why and how” one variable predicts or causes an outcome.

Answer 61

-ANOVA’s can be used to test for significant moderator relationships but only when the IV and DV are categorical. However, if they are continuous many researchers perform a median split and separate the continuum into portions such that they can use an ANOVA instead of regression methods.

Answer 62

One or more categorical IV’s and 2 or more continuous DV’s

Answer 63

MANOVA’s protect against family wise error, which inflates as more ANOVA’s are run on the data and increases the chances of making a type I error. In addition, MANOVA’s provide more information about the relationship between the dependent variables, therefore a MANOVA can inform us if groups can be distinguished by a combination of scores on several dependent measures.

Answer 64

- F ratio compares systematic variance to unsystematic variance - Both are interested in how much variance can be explained by the experimental manipulation - The product of the F ratio is a value representing the effect of systematic over unsystematic variance. - Both use the sum of squares method, the total squared difference between the observed values and the mean value, telling us how much variation can be accounted for by the model.

Answer 65

- ANOVA is univariate, MANOVA is multivariate. - ANOVA uses single values for systematic and unsystematic variance whereas MANOVA deals with multiple DV’s and must use matrices representing all of the systematic over matrices representing all of the unsystematic variance in the all of the DV’s. - The product of the F ratio is a single value in ANOVA, representing the effect of systematic over unsystematic variance. Whereas, the product for a MANOVA is a matrix, representing the effect of the systematic variance in all the DV’s over the unsystematic variance in all the DV’s. - MANOVA calculates the sum of squares but also calculates the cross products of the DV’s. Thus allowing researchers to look at the correlation between DV’s. These cross products represent the total value for the combined error between two variables.

Answer 66

An eigen analysis produces statistics for the discriminant functions. The two main products of an eigen analyses are a matrix of eigen vectors and a list of eigen values. Eigen vectors are the two perpendicular lines intersecting a data set along a diagonal line, in which all the values off the diagonal lines are zero and only those values along the diagonal line are used for further analyses. Thus the matrices produced (those representing the systematic over unsystematic variance, will be exceptionally smaller and easier to interpret. Eigen values (the values along the diagonal dimension of the HE-1 matrix) are conceptually equivalent to the F ratio in ANOVA, thus indicating whether or not the groups are significantly different from each other along each DV.

Answer 67

- linear variates used to predict which group a person belongs to (to discriminate them). Linear variates are the linear combinations of the dependent variables, allowing us to investigate the relationship between combinations of DV’s. - The number of discriminant function variates is the number of DV’s minus one. Therefore, to avoid confusion and a lot of headaches from trying to interpret so many different dimensions it is best to stick with about 5-7 DV’s.

Answer 68

- Product of analyses is a set of weights for variables, indicating how much variance can be accounted for by each predictor in the total outcome variance. - Both are interested in predicting an outcome variable. - Both fit linear models to data sets in order to predict an outcome variable.

Answer 69

- In regression the weights produced for the variables are for the IV’s not the Dv’s as in discriminant functions. - In discriminant function we are interested in predicting an IV from multiple DV’s, not a DV from multiple IV’s as in multiple regression. - Several discriminant functions can be produced from a set of DV’s; however, in multiple regression all independent variables are included in a single model.

Answer 70

Multivariate significance tests involve eigen values - magnitude of the eigen values indicates if there is a significant difference between groups somewhere in the multivariate space - like an ANOVA, the multivariate significance test compares systematic variance to unsystematic variance - unlike ANOVA, the F test is for the linear combinations not the individual linear models.

Answer 71

Roy’s test tends to be more powerful because the computer searches for the discriminant function with the largest separation among the groups and runs analyses on that discriminant function, increasing the likelihood of finding a significant result. Whereas other tests, simply test all the discriminant functions.

Answer 72

Observations should be statistically independent. Not correlated with each other.

Answer 73

- the assumption that each of the variances in the DV’s are equal and that the correlation between any two variables is the same in all groups - tested by assessing whether the population variance-covariance matrices of the different groups in the analyses are equal\ - called Box’s test, and the results should be non-significant, meaning the matrices are the same.

Answer 74

find the linear combinations of the DV’s that best separates or discriminate the groups. This method is more in keeping with the analytic methods of MANOVA because it focuses on the relationships that exist between the DV’s and the underlying dimensions.

Answer 75

It is thought that the previously used MANOVA will protect the use of the ANOVA’s after from the effects of type I errors. This is flawed because the MANVOVA only protects from type I errors against the DV for the group differences genuinely exist. Therefore, Bonferroni adjustments are needed to correct the subsequent ANOVA’s.

Midterm Flashcards

(99 cards)