Week 2 - Lecture 2 Flashcards by Phoebe McDonnell

Hypotheses Testing

define conceptual hypothesis

outlines the predicted relationship between the independent varible (IV)/predictor variable and the dependent variable (DV)*/outcome variable *
eg. Intoxicated participants will be more creative than sober participants

How well did you know this?

Not at all

Perfectly

define statistical hypothesis

represents the mathematical relationship **presumed to exist **between **two or more population parameters **
eg. creativity scores will be higher among intoxicated participants than among sober participants

How well did you know this?

Not at all

Perfectly

what is the Neyman-Pearson Hypothesis Testing Model?

statistical analysis in psychology is guided by the Neyman-Pearson hypothesis testing model
In this model there are **two statistical hypotheses offered: **
The null hypothesis (H0)
The alternative (research) hypothesis (H1)

How well did you know this?

Not at all

Perfectly

what does the null hypothesis (H0) represent?

there is no relationship between the two variables that we are investigating
assertion that researcher is trying to reject (something researchers want to say is unlikely to be true)

How well did you know this?

Not at all

Perfectly

what does the alternative (research) hypothesis (H1) represent?

there is a relationship between the** two variables**
what the researcher believes to be true

How well did you know this?

Not at all

Perfectly

how does the null hypothesis vs alternative hypothesis look graphically

yellow could be drawn anywhere
statistically you’re testing the** likelihood** that the sample came from the population as specified in the null

How well did you know this?

Not at all

Perfectly

What is meant by “rejecting the null”?

the question we are always considering is:
- what is the likelihood my **sample was drawn **from a population with the mean specified under the null hypothesis?
- this is why we must **specify **the **H0 mean **up front
- we then aim to reject it on the basis that the sample mean is so extreme that it is unlikely to have come from the population **specified under the null **

How well did you know this?

Not at all

Perfectly

How “extreme” does a sample mean need to be for us to “reject the null”?

By convention, we** reject the null** when the sample mean is so extreme that it would occur when the** null is true** less than 5% of the** time**
- Referred to as** p < .05** (critical region, alpha greek symbol)
- specifies acceptable risk of **rejecting H0 when it is true**
- ie. only 5 out of 100 experiments would have produced the obtained result (or an even more extreme one) by chance (under H0)

How well did you know this?

Not at all

Perfectly

what does the critical region look like mapped into the normal standard distribution curve?

when you get a Z score in the middle region, we retain H0
- It’s not extreme enough to feel confident that it’s unlikely the sample came from the population *
Once it gets into the rejection region of the two tails, it’s unlikely *we would** get a value this extreme** based on chance alone, if H0 was correct
- spread our error into two tails, so that** 2.5% of the time we’re going to make an error in either direction **

How well did you know this?

Not at all

Perfectly

what is the significance/meaning behind rejecting/retaining the null hypothesis (H0)?

rejecting H0 simply means that it is unlikely given the current data and we accept the alternative hypothesis for the time being
retaining H0 does not mean that it is true - we might just not have found sufficient evidence to reject it at the moment
**We cannot prove a statement **
the best we can do is evaluate the evidence to see whether H0 is unlikely to be true

How well did you know this?

Not at all

Perfectly

Four types of hypotheses

define causal-directional hypotheses

more A causes more (or less) B
eg. consuming alchohol increases creativity
experimental data

How well did you know this?

Not at all

Perfectly

Four types of hypotheses

define causal non-directional hypotheses

A causes B
- eg. consuming alchohol affects creativity
- experimental data

How well did you know this?

Not at all

Perfectly

Four types of hypothses

Define non-causal directional hypotheses

more A is related to more (or less) B
Greater alchohol consumption is associated with greater creativity

How well did you know this?

Not at all

Perfectly

Four types of hypotheses

Define non-causal non-directional hypotheses

A is related to B
- alcohol consumption is associated with creativity

How well did you know this?

Not at all

Perfectly

which type of hypothesis is the following scenario?
- Giving a gift to someone impacts your happiness

Causal, non-directional

How well did you know this?

Not at all

Perfectly

which type of hypothesis is the following scenario?
- setting specific, measurable goals leads to better performance

causal, directional

which type of hypothesis is the following scenario?
- stress is related to workplace attitudes

non-causal, non-directional

what is a non-directional hypotheses?

Predict that a **difference will exist without specifying the nature or direction of that difference**
- eg. alchohol consumption is associated with creativity
- eg. relatable examples affect enjoyment learning statistics
However, we’re always interested in the direction when it comes to **interpreting any significant effects **

define one-tailed vs two-tailed tests

When Alpha is 0.05, acceptable chance of making a** Type 1 error **(rejecting H0 when H0 is actually true)
- One-tailed test: Large rejection region just on one side
- Two-tailed test: Smaller rejection regions on both sides
- **allows for differences in either direction (more room for error) **

why do we compare a** sample mean** to a distrubution of means in psychology?

Discussed** individual scores** of their l**ikelihood of being observed **in the **general population **
- compared one individual score (yours) to a distribution of individual scores (the population)
- But in psychology we’re interested in sample scores, *not *a single *individual’s score *
Statistically, we can’t compare a sample mean with a comparision distribution of individual scores
- instead we use a comparison distribution of means of samples of scores: we compare a sample mean to a *distribution of means *

what are sampling distributions?

sampling distributions are how things would look if you repeatedly took samples (of a given size) from a **population of interest **
It’s not representing the raw scores but some **statistic from the sample **
Can have a sampling distribution for any statistic, but the most common is the** sampling distribution of the mean **

what is natural variability?

Gives you the **distribution of means **we can expect when we do this repeated process of drawing samples of the same size and calculating their mean
- mean of the sampling distribution will be the population mean, but the SD shows you there is **variability around that **

When will the sampling distribution of the mean be normally distributed?

the sampling distribution of the mean will be normally distributed if the population of raw scores is normally distributed or if the** sample size is at least 30 (central limit theorem)**
don’t need to understand central limit theorem, just know that sample size matters
- **bigger sample size **= sampling distribution of the mean will be normal
- *critical assumption for our statistical tests *
the mean of the sampling distribution is equal to the mean of the actual population (refer to photo)

what is the standard error of the mean?

Standard error of the mean is the** standard deviation **of the **distribution of sample means **
It represents the typical or average distance between a **sample mean **and the mean of the population
It is used to* define *and accurately measure sampling error
Tells you how much you would expect the sample mean to vary just **based on chance alone **

define standard error (SE)

how much would you expect the sample mean to differ from the population mean, just based on chance alone, for a sample of this size

# True or False We have a standard error of 3. This means we would expect the **mean IQ **of our sample to **differ** from the **population mean** on average by *3 IQ points*. So a **sample mean IQ** of *97 (3 points below the mean) or 103 (3 points above the mean)* is **within the margin of error we would expect** based on **sampling error**

True

what does the Z score tell you?

tells you in SE units how far the sample mean is from the population mean

shortcut for rejecting the null

**critical z score** (when doing a two-tailed test, and alpa = 0.05): - **1.96** is the **critical value** on either side you **need to exceed** to **reject** the **null **hypothesis - any z-score falling **above 1.96** = you can **reject the null ** - any z-score falling **below -1.96 **= you can **reject the null** - any **other z-score **= you need to **retain** the **null**

When we know the mean of a population but do not know the standard deviation for the population the best we can do is...

- Use an **approximation** of *that* **standard deviation** from the **standard deviation** for the *sample* - This means we end up using a **one-sample t-test**

steps to testing hypotheses about means - sigma unknown

- conduct a one-sample t-test - to calculate SE need to substitute an estimate of sigma

what is the formula for sigma unknown hypotheses testing?

refer to photo

When sigma is unknown, the nature of the test changes how?

- then we obtain a *t value* and **compare** to a* t distribution * WHY? - the **sampling distribution** of the **sample variance** is** positively skewed** -** S^2** will **underestimate sigma^2** (esp. for small samples) - We'd be more** likely to make a Type 1 error **if we used **critical z scores**, so we use **critical t instead **

explain Student's t distribution

- Gosset showed that if s^2 is used in place of sigma^2 it would lead to a particular distribution, called a t-distribution

What are the similarities in both z and t distributions

Both distributions (z & t): - have a mean of zero - are symmetrical - are unimodal

what are the differences between a z and t distribution?

The t distribution: - has a flatter distribution - Has a larger standard deviation - Depends on df, so t-distribution is a family of distributions

To find the correct 'critical' t value you need...

- df (degrees of freedom) and alpha

Implications for statistical inference

- larger values of t required to reject null hypotheses - depends on df - as df approaches infinity, the distribution will approach normality and be equivalent to z

Degrees of freedom - what are they?

- **number of scores that are free to vary ** - df = N = number of parameters estimated - For a one sample t-test,** df = N - 1 ** - because in a one-sample t-test we use s^2 to estimate alpha^2 **EXAMPLE:** - take the numbers 6,8 & 10 - sample mean (x-bar) = 8 - **Now you may change any of these numbers as long as the mean stays as 8 ** - How many numbers can you change? - only 2 are "free" to vary and keep the mean of 8 - 6,8,10 (sample mean = 8) - 7,?,13 (sample mean = 8) - ? has to be 4 - so for a **one sample t-test**, **df = N - 1 **