lecture 5 - inferential statistics Flashcards by Ellie Trevett

what is a statistical inference?

undertaking a statistical test to make an inference about data
two methods:
hypothesis testing (p values)
estimation (confidence intervals)

How well did you know this?

Not at all

Perfectly

why do we undertake research using statistics?

to test a hypothesis
are the results real?
do the results matter?

How well did you know this?

Not at all

Perfectly

what do we mean by estimation?

an estimator of a population parameter: a statistic (i.e. mean, t statistic)
an estimate of a population parameter: the value of the estimator for a particular sample

How well did you know this?

Not at all

Perfectly

what is a statistical hypothesis?

an assumption about a population parameter
- if sample data are not consistent with the statistical hypothesis, the hypothesis is rejected

How well did you know this?

Not at all

Perfectly

what are the two types of statistical hypotheses?

null
alternative

How well did you know this?

Not at all

Perfectly

what is the null hypothesis?

The null hypothesis, denoted by H0, is usually the hypothesis that sample observations result purely from chance.

How well did you know this?

Not at all

Perfectly

what is the alternative hypothesis?

The alternative hypothesis, denoted by H1or Ha, is the hypothesis that sample observations are influenced by some non-random cause.

How well did you know this?

Not at all

Perfectly

how to decide to accept or reject the null hypothesis?

Decision to accept or reject null hypothesis based on p value
<=0.05
P value α=0.05 (0.01 or 0.10)
Decided a priori

How well did you know this?

Not at all

Perfectly

what is the meaning of the p value?

The p-value for a hypothesis test is the probability of obtaining a value of the test statistic as or more extreme than the observed test statistic when the null hypothesis is true

How well did you know this?

Not at all

Perfectly

what does the p value give an indication of?

Reporting the p-value associated with a test gives an indication of how common or rare the computed value of the test statistic is, given that H0 is true

How well did you know this?

Not at all

Perfectly

how is the rejection region determined?

by α, the desired level of significance, or probability of committing a type I error or the probability of falsely rejecting the null

How well did you know this?

Not at all

Perfectly

why might results be uncertain?

Type 2 error- failing to reject the null hypothesis

Conclude no effect when there is one (False negative)

Small sample size can make this more likely

How well did you know this?

Not at all

Perfectly

what are the types of error?

Type I
Type II

How well did you know this?

Not at all

Perfectly

what is the probability of making a type I and II error?

The probability of making a Type I error is the significance level, or alpha (α), while the probability of making a Type II error is beta (β).

How well did you know this?

Not at all

Perfectly

what is a type I error (false positive)?

the test result says you have coronavirus, but you actually don’t.

How well did you know this?

Not at all

Perfectly

what is a type II error?

Study These Flashcards

the test result says you don’t have coronavirus, but you actually do.

is more data better?

Study These Flashcards

Collecting more data than is necessary is costly in many ways

You have the statistical power to pick up smaller and smaller differences and label them as ‘significant’

Therefore, as a researcher/scientist you have an obligation not only to look at significance but the difference between the groups in means that you are testing – e.g. is a difference of 0.1 seconds in mean running time scientifically/practically relevant even if it is statistically significant?

what does measuring difference or effect size mean?

Study These Flashcards

Allows us not only to identify significance but also the level/ size of the effect observed

The simplest one is mean differences e.g. difference in mean height = 0.1cm BUT scale dependent and disregards distribution shape!

Need a measure that can have a consistent scale e.g. standard deviation units

what is the convention of effect size?

Study These Flashcards

Brace et al. suggest calculating effect size for t-tests by looking at the difference between means and dividing it by the mean SD (z-score)
Advantage = converts mean difference into SD units
Disadvantage = works for t-tests only, but not ALL statistical tests

what are the suggested definitions by Cohen (1969) for effect size?

Study These Flashcards

For the behavioural sciences, Cohen’s 1969 work suggests the use of the following criterion for size of effect using d
Small effect = 0.2
Medium effect = 0.5
Large effect > 0.8

Note: other definitions exist

what is good power?

Study These Flashcards

Statistical convention says that 0.8 is a good value that minimizes both type I and type II errors.

Power 0.80 represents a 20% chance of making a type II error.

Before conducting your study should conduct a power calculation
University has G*Power

Involves estimating an effect size ahead of time.

what is power = to?

Study These Flashcards

1 - β

what are confidence intervals?

Study These Flashcards

Confidence intervals
Estimated range of values that is likely to contain the unknown population parameter

Conventionally 95%, also 90% or 99%

Width of these values describes uncertainty
Related to our effect size

what does the confidence level mean in context?

Study These Flashcards

If this included zero, no statistically significant difference between groups.

Very wide confidence interval may indicate a small sample.

We interpret an interval calculated at a 95% level as, we are 95% confident that the interval contains the true difference between the two population means in the wider population that the sample is drawn from.

how do we interpret a confidence interval?

a 95% confidence interval indicates that 19/20 samples (95%) from the same population will produce confidence intervals that contain the population parameter

what is the impact of sample size?

P value. does not tell you about effect size does not indicate strength of evidence Greater sample size will lead to greater precision. Does not indicate if the difference is important. Non significant P value does not mean no difference

what is the matrix for value/effect size

significant + large - IV had a strong reliable effect on DV significant + small - IV had a weak effect on FV inflated by a large sample size non-significant + large - IV had a strong reliable effect on DV but too low a sample size to detect it non-significant + small - IV had a weak effect on DV

lecture 5 - inferential statistics Flashcards

(27 cards)