Week 2 - Lecture 2 Flashcards
Hypotheses Testing
define conceptual hypothesis
- outlines the predicted relationship between the independent varible (IV)/predictor variable and the dependent variable (DV)*/outcome variable *
- eg. Intoxicated participants will be more creative than sober participants
define statistical hypothesis
- represents the mathematical relationship **presumed to exist **between **two or more population parameters **
- eg. creativity scores will be higher among intoxicated participants than among sober participants
what is the Neyman-Pearson Hypothesis Testing Model?
-
statistical analysis in psychology is guided by the Neyman-Pearson hypothesis testing model
In this model there are **two statistical hypotheses offered: ** - The null hypothesis (H0)
- The alternative (research) hypothesis (H1)
what does the null hypothesis (H0) represent?
- there is no relationship between the two variables that we are investigating
- assertion that researcher is trying to reject (something researchers want to say is unlikely to be true)
what does the alternative (research) hypothesis (H1) represent?
- there is a relationship between the** two variables**
- what the researcher believes to be true
how does the null hypothesis vs alternative hypothesis look graphically
- yellow could be drawn anywhere
- statistically you’re testing the** likelihood** that the sample came from the population as specified in the null
What is meant by “rejecting the null”?
the question we are always considering is:
- what is the likelihood my **sample was drawn **from a population with the mean specified under the null hypothesis?
- this is why we must **specify **the **H0 mean **up front
- we then aim to reject it on the basis that the sample mean is so extreme that it is unlikely to have come from the population **specified under the null **
How “extreme” does a sample mean need to be for us to “reject the null”?
By convention, we** reject the null** when the sample mean is so extreme that it would occur when the** null is true** less than 5% of the** time**
- Referred to as** p < .05** (critical region, alpha greek symbol)
- specifies acceptable risk of **rejecting H0 when it is true**
- ie. only 5 out of 100 experiments would have produced the obtained result (or an even more extreme one) by chance (under H0)
what does the critical region look like mapped into the normal standard distribution curve?
when you get a Z score in the middle region, we retain H0
- It’s not extreme enough to feel confident that it’s unlikely the sample came from the population *
Once it gets into the rejection region of the two tails, it’s unlikely *we would** get a value this extreme** based on chance alone, if H0 was correct
- spread our error into two tails, so that** 2.5% of the time we’re going to make an error in either direction **
what is the significance/meaning behind rejecting/retaining the null hypothesis (H0)?
- rejecting H0 simply means that it is unlikely given the current data and we accept the alternative hypothesis for the time being
-
retaining H0 does not mean that it is true - we might just not have found sufficient evidence to reject it at the moment
**We cannot prove a statement ** - the best we can do is evaluate the evidence to see whether H0 is unlikely to be true
Four types of hypotheses
define causal-directional hypotheses
- more A causes more (or less) B
- eg. consuming alchohol increases creativity
- experimental data
Four types of hypotheses
define causal non-directional hypotheses
A causes B
- eg. consuming alchohol affects creativity
- experimental data
Four types of hypothses
Define non-causal directional hypotheses
- more A is related to more (or less) B
- Greater alchohol consumption is associated with greater creativity
Four types of hypotheses
Define non-causal non-directional hypotheses
A is related to B
- alcohol consumption is associated with creativity
which type of hypothesis is the following scenario?
- Giving a gift to someone impacts your happiness
Causal, non-directional
which type of hypothesis is the following scenario?
- setting specific, measurable goals leads to better performance
causal, directional
which type of hypothesis is the following scenario?
- stress is related to workplace attitudes
non-causal, non-directional
what is a non-directional hypotheses?
Predict that a **difference will exist without specifying the nature or direction of that difference**
- eg. alchohol consumption is associated with creativity
- eg. relatable examples affect enjoyment learning statistics
However, we’re always interested in the direction when it comes to **interpreting any significant effects **
define one-tailed vs two-tailed tests
When Alpha is 0.05, acceptable chance of making a** Type 1 error **(rejecting H0 when H0 is actually true)
- One-tailed test: Large rejection region just on one side
- Two-tailed test: Smaller rejection regions on both sides
- **allows for differences in either direction (more room for error) **
why do we compare a** sample mean** to a distrubution of means in psychology?
Discussed** individual scores** of their l**ikelihood of being observed **in the **general population **
- compared one individual score (yours) to a distribution of individual scores (the population)
- But in psychology we’re interested in sample scores, *not *a single *individual’s score *
Statistically, we can’t compare a sample mean with a comparision distribution of individual scores
- instead we use a comparison distribution of means of samples of scores: we compare a sample mean to a *distribution of means *
what are sampling distributions?
- sampling distributions are how things would look if you repeatedly took samples (of a given size) from a **population of interest **
- It’s not representing the raw scores but some **statistic from the sample **
- Can have a sampling distribution for any statistic, but the most common is the** sampling distribution of the mean **
what is natural variability?
Gives you the **distribution of means **we can expect when we do this repeated process of drawing samples of the same size and calculating their mean
- mean of the sampling distribution will be the population mean, but the SD shows you there is **variability around that **
When will the sampling distribution of the mean be normally distributed?
- the sampling distribution of the mean will be normally distributed if the population of raw scores is normally distributed or if the** sample size is at least 30 (central limit theorem)**
- don’t need to understand central limit theorem, just know that sample size matters
- **bigger sample size **= sampling distribution of the mean will be normal
- *critical assumption for our statistical tests *
- the mean of the sampling distribution is equal to the mean of the actual population (refer to photo)
what is the standard error of the mean?
- Standard error of the mean is the** standard deviation **of the **distribution of sample means **
- It represents the typical or average distance between a **sample mean **and the mean of the population
- It is used to* define *and accurately measure sampling error
- Tells you how much you would expect the sample mean to vary just **based on chance alone **