Research methods - Review Flashcards

Question

Which of the following is not a potential threat to the internal validity of a quasi-experiement? A. Instrumentation B. Generalisability C. History D. Selection E. Maturation

Answer 1

B. Generalisability (because this concerns external validity!)

Answer 2

A. selection Group 1 X O Group 2 O (control) why not the others?: B. Instrumentation - no reason to think this C. Testing - no repeated measures D. Regression toward the mean - repeated measures only E. response bias - no idea about this and why would it differ

Answer 3

C. Construct

Answer 4

B. Indicates that anger explains 9% of the variance in aggressive behaviour

Answer 5

C. Random samples

Answer 6

D. Q1 contains an unwarranted assumption; Q2 requires information that the participant might not remember; Q3 is double-barrelled.

Answer 7

Q1: B. It provides a measure of the dependent variable Q2. E. a 2 x 2 between-subjects factorial design

Answer 8

C. - 80 Graph goes right to left = negative

Answer 9

D. the relationship between Crimes and Churches is a spurious on.

Answer 10

A. Figure 1 shows heteroscedasticity; Figure 2 shows a strong negative correlation, Figure 3 shows an interaction.

Answer 11

DOES: tell us if a relationship exists Doesn't: tell us effect size or give any conceptual understanding, or how compelling the evidence is

Answer 12

Effect size Clinical significance Error bars

Answer 13

Participants (N)

Answer 14

Formal effect size indices.

Answer 15

Proportion of total variance in the DV that is explained by the effect (IV)

Answer 16

Eta2 (n2) R2 r2 (regression coefficient squared)

Answer 17

Eta2 - for ANOVA R2 - for regression

Answer 18

SS (i.e., sum of sqaures) effect / SS corrected total (NOT TOTAL!!)

Answer 19

Both are indices of proportion explained....but R2 is is total variance explained and R2change is the change for a particular step (i.e. additional variance explained) in hierachical regression.

Answer 20

Partial eta2 ignores any other factors or effects in an analysis and only considers SSerror (unexplained variability) and any extra variability form the effect of IV (SSeffect). thus it can't be used to compare effect sizes (it is not standardised). Only MATTERS when there is more than one factor. (if only one factor eta2 = partial eta2) Eta2 is standardised and can be used to compare and judge effect sizes.

Answer 21

effect size; means; standard deviation

Answer 22

(m1-M2)/ ___________________ (SD1 +SD2)/2 (i.e., average SD for the means)

Answer 23

small - .010 -.04 (1%) medium - .06 - .130 (6%) large - .14 + (14%)

Answer 24

Small- .02 (2%) medium - .13 (13%) large - .26 (26%)

Answer 25

small - .1 (1%) medium - .3 (9%) large - .5 (25%)

Answer 26

small - .20 medium - .50 large - .80

Answer 27

* Never intened them to be indiscriminately applied * Mainly intended for new areas of research and deciding on the necessary sample size (power analysis) when study being planned. * were based on 'typically' observed effect sizes for various analyses across numerous behavioural size areas, not on what is important/useful * they are non-equivalent for different indices (e.g., eta 2 and R2 - even though they are effectively the SAME!)

Answer 28

important.

Answer 29

* standard deviations (indicated variability in scores- least common) * standard erros (allows statistical inference) * 95% confidenc eintervals (allows statistical inference)

Answer 30

Participants' rights outweigh other considerations. Participation must be voluntary Consent must be informed Participants may withdraw at any time Participants response and performance must remain confidential - they also must remain anonymous Harm to participants should be minimised and provisions to adress any stress etc should be done Must have the approval of ethics committee Participants should be debriefed

Answer 31

questions that provide respondents ith a fixed set of alternatives from which to choose

Answer 32

Rating scales include Likert scales. Usually 4-7 response alternatives e.g., 5-point rating scale from 1 "strongly disagree" to 5" strongly agree. Strictly speaking, Likert scales assess the extent of the agreement with a statement and so, have anchors such as "Agree" and "Disagree". However, there are many variations on this theme, such as "approve- disapprove" or "Satisfied- dissatisfied", which are simply referred to as 'rating scales' (or Likert-type scales)

Answer 33

Unipolar - a rating (e.g., 1-6) towards either ends of one construct e.g. satisfaction (1 not at all satisfied -----6 very satisfied Bipolar - a rating (e.g.., 1-6) towards either ends of two constructs e.g., 1 very unsatisfied to 6 very satisfied. Semantic Differential Scales - anchors are polar opposite adjectives, e.g., healthy-sickly, generous-selfish etc

Answer 34

Ensure that all possible response alternatives are available Use clear verbal anchors for rating scales Use more, rather than fewer, question to assess a construct (such as self-esteem). (Reliability tends to be greater with more questions because, if there's a poor question it has less effect._ NOTE: response alternatives must match the question. Use existing, validated, questionnaires where possible If designing your own questions, pilot or pre-test them (begin asking friends to read the questions: what appears to be a model of clarity to the author can seem exceedingly muddy to the naive respondent) Determining and reporting reliabilities for questionnaire data is a standard procedure (where relevant or appropriate, e.g., it usually makes no sense to determine reliabilities for basic demographic information)

Answer 35

Response biases - social desirability, response sets (acquiescent response set, deviation response set) Other problems Unclear question or instructions (instructions are important) Appropriate response not available Imprecision of many ordinal scales (differences in interpretation) Question is double-barrelled (or, much worse, quadruple-barrelled!) Respondent doesn't know the answer (e.g., can't remember) Suggestibility: the respondent did not really hold a view on an issue until the questionnaire suggested it. A question might be based on an inappropriate and unwarranted assumptions about respondents (e.g., not applicable) Leading questions Very general questions Sensitivity of an issue.

Answer 36

Double negative; unbalanced scale

Answer 37

Leading question

Answer 38

Double barrelled

Answer 39

Sensitive question

Answer 40

What does 'Australian" mean?

Answer 41

No other categories? Vague categories. Respondent might not work.

Answer 42

Will participants remember?

Answer 43

Response options don't match the question & is this an exhaustive list? student? retired? parent?

Answer 44

'usually' is too vague, what does 'healthy' mean?

Answer 45

The SD can be thought of as the average amount by which any score (in a set of scores) differs from the mean. (It's actually a bit bigger than the strict average). It tells us how typical or atypical the mean is of scores within a group, and how much, on average, scores deviate from the mean (above and below). It uses the same unit of measurement as that which was used to measure the variable. SD is the square root of the variance

Answer 46

Effect size simply refers to the strength (often called a magnitude) of an effect (or of a relationship or difference), i.e., how big the effect is. It usually tells us something about the importance of a relationship (or of a difference between groups, which is also a relationship)

Answer 47

Yes, if we are comparing means for different conditions (or times), we can assess effect size just by looking at the degree of difference between means. (can also use cohen's d) For example, if we reduce the number of times that autistic children engage in head-banging from a baseline mean of 26 times per day to a post-treatment mean of 1 time per day, this is much more impressive (larger effect of treatment) and useful outcome than if we'd reduced it from 26-20 times a day.

Answer 48

Yes, the size of the correlation indicates the strength of the relationship but its significance level does not. (because very small correlations can attain significance with sufficient participants!)

Answer 49

Consistency or stability of a measure. Logically, if a measure is valid (a good/accurate measure), it should be reliable.

Answer 50

The degree of correspondence/agreement between individual items that make up a measure of a single characteristic. Logically, if a number of items are meant to measuring the same construct, there should be a high level of agreement between them. Cronbach's Alpha: the average correlation questionnaires (easily calculated [by a computer] and reported) Cronbach's alpha can range from 0-1. SPSS will calculate it and can also indicate the change in Cronbach's if any particular item were omitter. Good (i.e., acceptable, though not outstanting reliability = .80*. the higher the better)

Answer 51

The correlations between each item and the total score (of a measure).

Answer 52

The correlation between total scores for the two halves of a test of measure.

Answer 53

the correlation between scores for 2 administrations of the measure.

Answer 54

The correspondence or agreement between the scores of two independent observers or judges. Used when judgement is used by the score or assessor, or where assessment requires skill or training.

Answer 55

Where 2 or mor ecategorical IVs are manipulated (or measured if it is a quasi-experiment) AND all combinations of the IVs are tested. allows us to see both the separate and combined effects of the IVs.

Answer 56

Categorical information (e.g.., employer status, sex)

Answer 57

when the 'quantification' of th econstruct is imprecise, but we do assume that the scores indicate something about the relative amounts of a particular characteristis, that is, that we could rank scores in a meaningful order (e.g., 0-5 stars on a movie)

Answer 58

A more precise measurment, where the difference (or distance or interval) between any two adjacent scores in the same and the unit of measurment does have a universal maning and indicates the same amount of a property across different participants or situation. However the scale does not have an absolute zero. e.g., IQ score, temperatur

Answer 59

Precise measurement/quantification whereby 0 means NONE/abscense of the characteristic e.g., heartrate, number of children, number of words recalled

Answer 60

Continuous DVS that use an ORDINAL DEPENDANT VARIABLES (i.e., they can be put in rank order but there is not equal distance between each value and no true 0) + data that does not meet the assumptions for patametric tests (i.e., skewed data)

Answer 61

Categorical DV

Answer 62

chi-square - intepretted like a correlation.

Answer 63

ranks; raw scores

Answer 64

Median (middle score)

Research methods - Review Flashcards

(88 cards)