Assessing Improving Reliability Validity Flashcards
(32 cards)
Validity Overview
• Research should have validity - measuring what you think you are measuring.
Internal Validity:
Internal = inside the study. Is the research measuring what it intends to measure? Is it measuring the effect of just the IV on the DV?
Affected by extraneous variables e.g. demand characteristics, researcher bias, investigator effects, social desirability, order effects.
External Validity:
External = outside the study. Whether findings can be generalised outside the study.
Ecological Validity
A form of external validity. Extent findings can be generalised beyond the setting of the study to other real life settings.
Population Validity:
A form of external validity. Extent findings can be generalised beyond the sample studied to the target population.
Temporal Validity:
A form of external validity. Extent findings remain true over time and can be generalised to other time periods.
Face Validity:
Assessing validity
An independent psychologist in the same field looks at the experimental conditions (questions or behaviour categories) to see if they look like they measure what they intend to measure (CONTEXT). If yes, research has face validity.
Concurrent Validity:
Assessing validity
Compare results of new test (CONTEXT) with results from a similar established test using stats test. Correlation should exceed +0.8. If similar, test is valid.
Improving Validity
• Experimental Research:
Use control group to check IV affects DV (cause & effect).
Use standardised procedures/instructions to reduce investigator effects.
Use single blind to reduce demand characteristics & double blind to reduce demand characteristics and investigator effects.
Validity in Observations
Face Validity:
Independent psychologist checks if behaviour category (CONTEXT) looks like it measures what it claims to measure at first sight.
Concurrent Validity:
Validity of observation
Compare new observation with similar established observation, correlation should exceed +0.8.
• Improving Observations:Validity
Behaviour categories operationalised.
Observers trained to use categories.
Use covert observations for natural behaviour.
Validity in Self-Reports
Face Validity:
Independent psychologist checks if questions in questionnaire/interview (CONTEXT) look like they measure what they intend.
Concurrent Validity:self report
Compare new questionnaire/interview with established one, correlation should exceed +0.8.
Improving Self-Reports: validity
Lie tests (nearly identical questions for consistency).
Use standardised procedures.
Allow anonymity.
Avoid leading questions.
What is meant by the term reliability?
Reliability refers to the ability to repeat a study in similar conditions to gain consistent results.
Is reliability high or low in a lab experiment and why?
High reliability – control environment – control over extraneous variables.
Is reliability high or low in a field experiment and why?
Low reliability – real life environment – low control over extraneous variables.
Is reliability high or low in a quasi experiment and why?
Low reliability – naturally occurring IV – low control over extraneous variables
: Is reliability high or low in a natural experiment and why?
Low reliability – naturally occurring IV – low control over extraneous variables.
How is test-retest used to assess reliability?
• Participants complete a task or measure (CONTEXT).
• After a time delay (e.g. 2 weeks), same task is repeated.
• Results from both tests are correlated.
• A strong positive correlation above +0.8 shows high reliability.
What does operationalising mean?
Being specific and clear when defining the IV and DV in an experiment so they can be easily measured.
Why is operationalising variables important for reliability?
: If variables are clear and specific, another researcher can repeat the study to check for consistent results (replicability). If consistent, the research is reliable.
What is inter-observer reliability?
Two observers are trained on behaviour categories (CONTEXT).
• They watch the same behaviour for the same time, but record independently.
• Tallies are compared and correlated using stats test.
• A strong positive correlation of +0.8 shows high reliability.