Week 1 Flashcards

(36 cards)

1
Q

Reliability

A

the consistency or repeatability of measures

Reducing random measurement error improves reliability
Validity rides on the back of reliability
Reliability does not guarantee validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Validity

A

are we measuring what we are trying to measure?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Test-retest reliability

A

Used to assess the consistency of a measure from one time to another
The correlation between scores across two administrations of the measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Parallel forms reliability

A

Used to assess the consistency of the results of two tests constructed in the same way from the same content domain

  • Split-half reliability
  • Item-total reliability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Internal consistency reliability

A

Used to assess the consistency of results across items within a test
Cronbach’s alpha – the average correlation among all possible pairs of items (.80 or above)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Types of validity

A
  • measurement (construct) validity
  • external validity
  • internal validity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Construct validity

A

A ”construct” refers to a behaviour or process that we are interested in studying

  • > e.g., Depression, Short term memory, Social prejudices
  • Do our measures and manipulations (of both IV’s and DV’s) reflect the theoretical constructs of interest
  • > Operationalisation of measurement – how we measure a construct
  • > Operationalisation of experimental manipulation – how we manipulate an IV
  • Both our measures and our manipulations must be valid
  • For example - how well does the Beck Depression Inventory measure Depression?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Measurement validity - convergent

A
  • Do scores on the measure correlate with scores on other similar measures related to the construct?
  • Relates to the degree to which the measure converges on (is similar to) other constructs that is theoretically should be similar to
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Measurement validity - discriminate (divergent)

A
  • Do scores on the measure have low correlations with scores on other different measures that are unrelated to the construct?
  • Relates to the degree to which the measure diverges from (is dissimilar to) other constructs that it should be not similar to
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Measurement validity - face

A
  • On its face value, does the measure seem to be a good translation of the construct?
  • If you ask participants to do some sums – will they understand this will measure their arithmetic ability?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Measurement validity - content

A
  • Does the measure assess the entire range of characteristics that are representative of the construct it is intending to measure?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Measurement validity - criterion

A

Concurrent
- Do scores on the measure align with scores on other measures (recognised and reliable) as they should

Predictive
-Are scores on the measure able to predict future outcomes (i.e., attitudes, behaviours, performance)?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Manipulations can be

A

Instructional:
Experimental conditions are defined by what you tell participants

Environmental:
Stage an event, present a stimulus, induce a state

Stooges:
Use fake participants to alter experimental conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When doing it right, manipulations will

A
  • Reduce random error (replicate procedure)
  • Reduce experimenter bias
  • Reduce participant bias
  • Ensure manipulation has construct validity
  • Do a manipulation check – ask participants about various aspects, beliefs, attitudes etc.,
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

External validity

A
  • Extent to which the results can be generalised to other relevant populations, settings or times
  • Studies have good external validity when results can be replicated:
  • > Using alternative operationalisation of variables
  • > Measuring a different sample of participants
  • > Conducting the research in another setting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

External validity - ecological

A

The extent to which the results can be generalised to real-life settings

17
Q

External validity - population generalisation

A

Applying the results from an experiment to a group of participants that is different and more encompassing than those used in the original experiment

18
Q

External validity - environmental generalisation

A

Applying the results from an experiment to a situation or environment that differs from that of the original experiment

19
Q

External validity - temporal generalisation

A

Applying the results from an experiment to a time that is different from the time when the original experiment was conducted

20
Q

The replication crisis in Psych - why does this happen

A
  • Replications are uncommon
  • In the top 100 psych Journals between 1900-2012, only 1.6% were replications (Makel et al., 2012)
  • Substantial bias towards publishing significant findings (and not null findings)
  • Times change – especially in terms of areas like social psych
  • Alpha cutoffs are arbitrary
21
Q

The replication crisis in Psych - what to do about it

A
  • Increase in replications
  • Pre-registration of studies (if methods are good they will be published regardless of significance)
  • Suggestion that we might consider probability based analyses rather than null hypothesis significance testing
22
Q

Internal validity

A
  • Able to conclude causal relationships from results
  • Extent that any effects on the DV were caused by the IV
  • Elimination of alternative explanations for observed relationships
  • Inferences of cause-and-effect require three elements
  • > Co-variation
  • > Temporal precedence
  • > Elimination of alternative explanations
  • Strong internal validity requires an analysis of these three elements
23
Q

Selection bias

A
  • A threat to internal validity that can occur if participants are chosen in such a way that the groups are not equal before the experiment
  • Differences after the experiment may reflect differences that existed before the experiment began
  • Differences after the experiment may reflect differences that existed before the experiment began plus a treatment effect
24
Q

Maturation

A
  • Changes in participants during the course of an experiment or between measurements of the DV due to the passage of time
  • > Permanent – e.g., age, biological growth, cognitive development
  • > Temporary – e.g., fatigue, boredom, hunger
  • Most common is naturally occurring developmental processes (i.e., children)
25
Statistical regression - regression towards the mean
- Extreme scores on first measurement tend to have scores closer to the mean on second measurement - Subsequent scores are still likely to be extreme in the same direction – but not as extreme - With extreme scores, it is difficult to maintain degree of extremity over repeated measures - If participants are selected on the basis of extreme scores – regression to the mean is always going to be a possible explanation for higher or lower scores on a repeated (or similar) test
26
Mortality - attrition
- Relates to premature dropouts and differential dropout across experimental conditions - If a differential dropout rate occurs, then it is likely that the groups of participants are not as equal at the end of the experiment as they were before - If the intervention is unpleasant or very demanding or is not working – dropout could be higher in this group than in the control group - Or if the participants in the control group see no benefit or no improvement – dropout could be higher in this group than the experimental group
27
History
- Outside events that may influence participants in the course of the experiment or between the DV measurements in a repeated-measures design - History can include major historical events like terrorist attacks, political changes, or smaller personal changes like joining a gym or changing jobs or having a baby - If they are relevant to the study in some way, these events can influence the second DV score obtained – independently of any intervention
28
Testing
- Prior measurement of the DV may influence the results obtained for subsequent measurements of the DV - Measuring the DV can cause a change in the DV E.g., participant become aware of study aims
29
Practice effect
- When a beneficial effect on a DV measurement is caused by previous experience with the DV measurement itself E.g., making free-throws
30
Instrumentation
- Changes in measurement of the DV that are due to the measuring device (equipment or human) - The equipment or human measuring the DV changes the measuring criterion over time
31
Observee reactivity - Hawthorne effect
Participants change their behaviour when they know they are being observed ("reactivity")
32
Social desirability
Reporting inaccurately on sensitive topics in order to present in the best possible light
33
Demand effects
Relate to an aspect of the research that allows participants to guess what the research is about
34
Placebo effect
Results from participant’s own expectations about experiments or expectations about what will happen or what is meant to happen
35
Experimenter bias
Errors in a research study due to the predisposed notions or beliefs of the experimenter e.g., Observer bias (selective viewing or interpretation of behaviours, NLP)
36
Controlling threats to internal validity
- Randomly allocate participants to levels of the IV(‘s) - Treat all conditions equally except for intended IV manipulations - Use appropriate control conditions where relevant - Use double-blind studies where possible