Null Hypothesis significance testing Flashcards by Miayla Marcus

What is the scenario we are going to be focusing on here?

Imagine you are interested in anxiety levels in younger and older adults. You decide to class <45 as younger and >45 as older
You collect data from a sample of 25 younger and 25 older people on an anxiety questionnaire which produces normal scores, has been thoroughly validated in the UK and for which the population parameters are known

How well did you know this?

Not at all

Perfectly

Why can’t we assess whether our sample and population means are different from the UK average?

due to sampling errors

How well did you know this?

Not at all

Perfectly

What are the questions we could ask of this data?

• Does the sample mean look very different from the UK population average?
- Hypothesis: The sample came from a population with higher/ different population mean
- Same as asking ‘does the sample mean look inconsistent with a sample of the appropriate size from this population?’
• Do the older vs. younger sample means look different from each other?

How well did you know this?

Not at all

Perfectly

What is our null hypothesis significance testing example we are going to be using?

Imagine there is a carefully validated test which measures statistics ability in 1st year UK undergraduates
Assume that the population test scores follow N(u, sigma)
You have good reason to hypothesise that your cohort is better at stats than the general UK 1st year undergrad population

How well did you know this?

Not at all

Perfectly

What happens in step one of NHST and apply this to our example?

generate hypotheses:
Null hypothesis (H0): My cohort is not better at stats than the UK population of 1st year UG students and my sample is typical for sample from that population
Put in another way: Performance of a sample of students from my year will be comparable to a random sample of the same size from the population of test takers – the sample from my year will not be particularly different from any other random sample of that size from the population of test takers
Put another way: the sample will not have an atypically high mean
(H0): the sample came from a population with mean uh = u
(uh = hypothetical value for population mean from which this sample is drawn)
(u = population mean for UK 1st year UG students)
Research Hypothesis (H1): My cohort is better at stats than the UK population of 1st year UG students
Put another way: performance of a sample of students from my year will not be comparable to a random sample of the same size from the population of test takers – the sample from my year will be markedly higher than other random samples of that size from the population of test takers
Put another way: the sample mean will be more consistent with that of a random sample of the same size taken from a population with a higher mean
(H1): the sample came from a population with mean uh> u)

How well did you know this?

Not at all

Perfectly

What happens in step 2 of NHST and apply this to our example?

data
Our ability to test hypotheses rests fundamentally on collecting good data
Experiment needs to be well designed with as much control as possible of extraneous variables
In our example, data collection requires us to get an appropriate random sample (N=25) from our cohort and get their stats test results
We will then be able to work out sample statistics (e.g. mean m, s.d.)
We also know about the population parameters and so about the SDM

How well did you know this?

Not at all

Perfectly

What happens is step 3 of NHST?

evaluate inconsistency with H0
how inconsistent are the data with H0
related to where our sample mean is in relation to our parent population mean (because uh = sample mean)
if our sample mean is closer to the parent population mean then you would say that it is more consistent with our null hypothesis and less consistent with our research hypothesis
you have to assume that H0 is true

How well did you know this?

Not at all

Perfectly

What happens in step 4 of NHST?

reject or fail to reject
Based on step 3 we now know how inconsistent the data are with H0
If data are sufficiently inconsistent with H0 we say ‘we reject H0 in favour of H1’
If data are not sufficiently inconsistent with H0 we say ‘we fail to reject H0’ (this is not the same as finding evidence for H0)

How well did you know this?

Not at all

Perfectly

What happens in step 5 of NHST?

interpret
We now interpret our findings in terms of our hypotheses
If we could reject the null (H0) we say that we have evidence for the research hypothesis (H1) and that our sample came from a population with mean uh>u
In our case that would mean we have evidence our cohort is better at stats than the UK population of 1st year UGs
If we failed to reject the null (H0) then we can not claim to have evidence our cohort is better than the UK population of 1st year UGs
This is not the same as finding evidence for H0 – we can not conclude that our cohort has the sample population mean as the UK population of 1st year UGs. We are looking for evidence for the research hypotheses, not the null hypotheses.

How well did you know this?

Not at all

Perfectly

In step 3 if our sample mean is not particularly high what would you say about inconsistency?

Not very inconsistent: p is not small here – i.e. it seems entirely plausible to get a sample mean here if null is true so don’t reject the null

How well did you know this?

Not at all

Perfectly

In step 3 if our sample mean is uncharacteristically high what would you say about inconsistency?

Very inconsistent: p is rather small here – i.e. it seems too implausible to get a sample mean here if null is true so reject the null

How well did you know this?

Not at all

Perfectly

When do you fail to reject the null?

when values of p>a

- suggest not inconsistent with H0

How well did you know this?

Not at all

Perfectly

When do you reject the null?

When values of p<a></a>

How well did you know this?

Not at all

Perfectly

What value is a? (how small should p be before we decide to reject H0)

we use a value of a = 0.05 (5%)

How well did you know this?

Not at all

Perfectly

What is the conditional probability associated with you sample statistic assuming that the null hypothesis is true sometimes called?

The p-value

How well did you know this?

Not at all

Perfectly

is p the probability that the null hypothesis is true?

Study These Flashcards

No. We can never know this. We are just looking for evidence of inconsistency

if p<0.05 does that mean that H1 (our research hypothesis) is correct?

Study These Flashcards

- by chance (i.e. random sampling error) you could get a sample mean that was extreme (leading to small p)

What is the z-test?

Study These Flashcards

A null hypothesis significant test in which in step 2 you find out the z-score for your data and hence the conditional probability of obtaining that value. If p < 0.05 then we can reject the null hypothesis in favour of the research hypothesis as our data is inconsistent with the null hypothesis - it is not likely to get this sample mean from the parent population.

When do you use a z-test?

Study These Flashcards

whenever we want to check a sample mean we have obtained is different from a population mean?
does it look like our sample came from that population or a population with a different mean?

Why don’t we usually use a z-test?

Study These Flashcards

Because we don’t usually have the parent population parameters

What is a one tailed hypothesis?

Study These Flashcards

only interested in one tail of the graph
It can be interested in the conditional probability of having got a sample mean as low or lower or as high or higher than our population mean
used when making a directional prediction
only formulate a one tailed hypothesis if you have good reason to predict the effect will be in a specific direction

What is a two tailed hypothesis?

Study These Flashcards

when you are interested in both tails of the graph
when we believe our sample is different from the general population but we don’t know in which direction
used when making a non-directional prediction
two tailed hypotheses are more conservative as you need a more extreme sample mean to be able to reject the hypothesis (there needs to be 2.5% on either side of the tail as opposed to 5% on one side of the tail)

How does does step 3 of z two tailed z-test work?

Study These Flashcards

Work out the z-score for our sample means
Work out the probability
Then double the probability but not the z score to work out the area either side of the tail

What is the key difference when working out the conditional probability for a two tailed z-test?

Study These Flashcards

You need to double the probability to get it on both sides.

what is a type 1 error?

when you reject the null hypothesis when it is true

What is a type 2 error?

when you fail to reject the null hypothesis when it is false

When do type 1 errors and type 2 errors occur?

- Type 1 errors occur because even if your p-value is small there is still a (small) chance that your data were unusually extreme (and so you rejected the NULL) just due to sampling error - Type II errors often arise because of a problem with your study  Perhaps your sample was biased  Perhaps there was an error in your experimental task  Perhaps your sample size was too small

Why do we use a value of a=0.05?

It balances out the probability of making type 1 vs type 2 errors (you make more type 1 errors if a is larger because it would be too easy to reject the null hypothesis and you make more type 2 errors is very large because it would be too hard to reject the null hypothesis)

Null Hypothesis significance testing Flashcards

(28 cards)