Exam revision questions Flashcards

Question

If the assumption of normality was violated, what is a non-parametric test we can use instead of a t-test?

Answer 1

A Wilcoxon test. The test stat is W. Essentially it measures how many times values from one group are larger than the other.

Answer 2

Wilcoxon effect size r, which has a similar interpretation as Cohen's d.

Answer 3

1. How different the means are (obviously). 2. The degrees of freedom/ sample size. 3. The variance of the two sample. Increased variance decreases t.

Answer 4

Because this does not tell us whether there are multiple populations or not. We are interested in the relationshiop between the variability within groups and variability between groups. This will give us an indication as to whether we are observing multiple populations. Hence, the test stat for ANOVA takes into account the ratio between SSb and SSw.

Answer 5

G-1, where G is number of groups. N-G, where N is combined sample size and G is number of groups.

Answer 6

F=MSb/MSw, where MSb=SSb/G-1, MSw=SSw/N-G.

Answer 7

That the means of the different groups are likely to be significantly different. F is larger when the between groups variation is larger than the within groups variation.

Answer 8

Small. If the null were true then the means of the different groups would be similar and the between groups sums of squares would be low. Even if SSw was low the F stat would still be low.

Answer 9

This means that the between groups variance explains the total variance, i.e. they are the same, and therefore know which group something in is all you need to know to know its value. That the total variance arises completely from within group variance and therefore it is unlikely we are looking at different groups and knowing which group something is in tells us nothing of the value of that entity.

Answer 10

The proportion of the total variance explained by the grouping variable. e.g. an eta-squared of 0.5 suggests that 50% of the variance in the dependent or outcome variable is explained by the predictor variable or grouping variable.

Answer 11

The type I error rate associated with multiple tests, such as multiple t-test done when doing an ANOVA. In other words, it is the probability of obtaining at least one type I error across multiple tests. We want the family-wise type I error rate to be 5%.

Answer 12

Bonferonni correction. Holm correction.

Answer 13

Bonferonni correction is done by multiply the number of tests by the p-values of each test. This is a very conservative approach and leads to a large loss of power and potentially interesting and important information, i.e. high type II error rate.

Answer 14

The Holm correction has the same type I error rate, but lower type II error rate. It works by multiplying the lowest p-value by the number of tests then the next lowest p-value by number of tests-1 and so on until it gets to a p-value it cannot reject.

Answer 15

Yes. Need to also include the correction that was done.

Answer 16

1. That the residuals are normally distribured. 2. There is equal variance across groups.

Answer 17

The residuals are the within groups variance. In other words they are the difference between the individual data points and the their group mean. These need to be normally distributed. It does not matter whether variable itself is normally distributed or not.

Answer 18

Kruskall-Wallis test. This does ANOVA on ranked data as opposed to actual data, similar to Wilcoxon.

Answer 19

It is just a Kruskall-Wallis effect size and it is interpreted like eta-squared, such that 0.23 says that the grouping variabe accounts for 23% of the variance in the outcome variable.

Answer 20

By using Levene's test. This tests checks whether the standard deviations of each group are equal. It yield an F-stat and a p-value. If p-value is <.05 then there is not equal variance across groups. We can then use a Welch one-way ANOVA.

Answer 21

The residuals are different. A two-way ANOVA takes into account variation of two grouping variables

Answer 22

Yes. You use a two-way ANOVA.

Answer 23

When we wanted to see how the mean of a quantitative dependent variable varied according to the levels of two categorical variables. e.g if you wanted to see how the amount of food harvested varied depending on both what type of land the food was grown on and the type of food that was being grown.

Answer 24

Yes. We get an F stat for each factor or grouping variable and for an interaction term, if we include this. This tells us whether a factor has a significant effect on the dependent or outcome variable when we take into account the other factor's influence on the outcome variable. We can either have two significant main main effects (from both grouping variables) or one or none.

Answer 25

Each factor or grouping variable has an F stat calcualted for it. F=MSb/MSr, where MSb = sums of squares between for the factor/(G-1) MSr=sums of squares of residuals/(N-R-C+1). Residuals are the variance that is not accounted for by the factor after both factors have been taken into account. In other words, the residuals reflect how much variation there is in the outcome variable after taking into account the variation associated with the two factors.

Answer 26

An interaction term tells us whether the effect of one grouping variable is dependent on the other grouping variable.

Answer 27

The residuals when we include an interaction term will tend to be smaller because they are reflective of the variance that is not accounted for by the two factors AND the interaction of these factors.

Answer 28

It tells us the effect of one factor if we assume all other factors are zero. Not very useful. If you wanted to know this then do a one-way ANOVA.

Answer 29

Pearson's correlation if the relationship is linear. Spearman's correlation if the relationship is non-linear.

Answer 30

It converts all data to ranks and then does a correlation on the ranked data.

Answer 31

On the basis of the least squares principle, where the summed deviations between the predicted Y values and the actual Y values are the smallest. A regression line aims to minimise residual sums of squares (analogous to within groups sums of squares in ANOVA).

Answer 32

Multiple regression. The model is a plane of best fit.

Answer 33

A multiple regression is done taking into account the interaction term. The model is a curved plane of best fit. Interactions tell us that we cannot understand the relationship between one predictor variable and the outcome variable unless we know the value of the other predictor variable.

Answer 34

The sign of the interaction term. If the sign of the interaction term is positive, then when both predictors are negative or positive (i.e. they have the same sign) then the outcome variable will be high. When the two predictor variables have opposite signs then the outcome variable is low.

Answer 35

The outcome variable is low. It will be high if the two variables have opposite signs.

Answer 36

F stat. Analysis is analogous to ANOVA. The two values that are used for a regression F stat are: 1. the model sum of squares, or SSm 2. the residual sum of squares, SSr

Answer 37

Model sum of squares. The model sum of squares looks at how different the regression line/plane predictions are compared to the mean of the outcome variable. The degrees of freedom are the number of predictors. The steeper the slope the stronger the relationship between predictors and outcome variable and the larger the model sum of squares will be.

Answer 38

The residual sum of squares, which is the difference between the data and the regression line/plane/curve predictions. The larger the summed difference between the actual data and the predictions of the regression model the larger SSr will be and the less likely F will be significant.

Answer 39

We convert data to z-scores and then run the analysis. The coefficients we get are called standardised coefficients. They allow us to compare the impact of each predictor on the outcome variable to each other.

Answer 40

We do t-tests on the slope of the regression model for the predictor and the null hypothesis, which states that the slope should be zero.

Answer 41

1. Linearity. 2. Normality of residuals. 3. High influence points. 4.Collinearity

Answer 42

We can plot the residuals and the predicted values of our test. If there is equal range of residuals across values predicted by model then we can assume linearity. We can look at the model alone or we can also look at residuals and values for each predictor. A Tukey test will have a p-value that indicates whether the model is linear or not. p <.05 indicates that the model is not linear.

Answer 43

An outlier has a large residual, but does not influence the model much. A high leverage point is quite far away from the rest of the data, but does not have a large residual, but does influence or leverage the predicted model somewhat. A high influence point is one that has a large residual and has high leverage. Including a high infuence point significantly alters the regression model.

Answer 44

Using Cook's distance. Cook's distance takes into account both a given data points residual as well as its leverage on the model (measured by hat values). 2k/N, where k is the number of coefficients - don't forget the intercept!

Answer 45

Collinearity refers to whether predictor variables are correlated with each other. The more correlated they are the more uncertainty there is around the coefficients in a regression model.

Answer 46

Using a Variance Inflation Factor (VIF). Square root of the VIF tells us roughly much bigger the confidence interval becomes when we add this predictor, given its collinearity with the other predictor variables.

Answer 47

VIF values greater than 2 or 3.

Answer 48

No. This is because, in general, the model that explains the most variance would be the model that has the most predictors.

Answer 49

AIC and BIC. Lower values indicate better model choice.

Answer 50

A research paper that predefines the exclusion/inclusion criteria for papers, search strategies, and how information from the papers will be coded. These are a response to narrative reviews of research papers that are too vague and subjective with their inclusion or exclusion of certain research papers. There is still subjectivity to these reviews.

Answer 51

Yes. This is because they are a statistical evaluation based on the statistical evaluations done by lots of other research papers.

Answer 52

They effectively have much larger sample sizes, because they are taking the sample sizes of multiple studies and putting them together. Larger sample sizes increase likelihood of generatign significant results.

Answer 53

1. Vote counting takes non-significant results as evidence for no effect. This is not what non-significant results show.

Answer 54

Yes. They allow for research to be collated and interesting, even life-saving, conclusions to be arrived at, e.g. streptokinase as a treatment for heart attack example.

Answer 55

There are many studies that do not get published because they do not show significant results. This means that when significant results are published they will be received with more weight than they may represent, as there has been prior evidence that would diminish this confidence. This is especially pertinent in meta-analyses, as these analyses generally can only work with published works and so cannot take other statistical analysis and research findings into account when doing a meta-analysis.

Answer 56

No. But not all tests that reliable are valid. A test is valid if it is reliable AND is measuring the construct it purports to measure. Reliability refers to a given test generating the same result when re-administered. That is, it consistently generates the same results.

Answer 57

Proposed by Spearman in mid 20th century. Theory of reliability?

Answer 58

All observed values, say for psych assessment, are made up of the true score and an error (endogenous and/or exogenous). Classical Test Theory assumes the follwoing: 1. The expected error is zero. 2. Errors do not correlate with each other. 3. Errors do not correlate with true scores. 4. Expected value of the test is equal to the true score.

Answer 59

Signal/(Signal + Noise) We want to signal to be strong enough to almost cancel out or at least significantly reduce the effect of the noise or error.

Answer 60

Yes. This is because true reliability relies on population data that we do not have. We therefore have to estimate the reliability.

Answer 61

1. Test-retest reliability. 2. Alternate forms reliability. 3. Split-half reliability 4. Cronbach's alpha.

Answer 62

Cronbach's alpha. Generates a very conservative estimate for reliability.

Answer 63

We calculate the standard error of estimation and then multiply it by 1.96. The predicted true score plus or minus this value is the 95% confidence interval for our predicted true score. Essentially this is saying that we are 95% confidence that the clients true score falls within this range.

Answer 64

1. Increase the relationship between the psychological construct and the test. 2. Remove sources of inconsistency in test administration and interpretation. 3. Increase the number of items on the test.

Answer 65

It tells us how much the reliability of our test will change if we change the number of questions on our test. Reliability will increase if we increase the number of questions on our test and decrease if we decrease the number of questions on our test.

Answer 66

"the degree to which evidence and theory support the interpretations of test scores entailed by the proposed uses of the test."

Answer 67

1. Criterion validity. e.g does the test correlate to other gold standard measures of the construct? 2. Content validity-does the test cover the whole domain of the construct? 3. Construct validity - does the test actually assess the construct of interest? Can be evaluated by looking at whether correlation between the test and other related constructs, such as a new anxiety test and its correlation with a well-established test for stress.

Answer 68

Construct validity.

Answer 69

A test's ability to correctly identify positive cases, that is to correctly identify those with a certain disease. True positives/ (true positives + false negatives)

Answer 70

The ability of a test to correctly identify negative cases. True negatives/(true negatives + false positives)

Answer 71

No. But Bayesianists can.

Answer 72

The Bayes factor tells us the relative probability of seeing the data given our two hypotheses. Probability of seeing data given alternative hypothesis is true/Probability of observing data given null hypothesis is true.

Answer 73

1. Construct Validity. 2. External validity. 3. Internal validity. 4. Statistical conclusion validity.

Exam revision questions Flashcards

(106 cards)