resampling statistics Flashcards

1
Q

Why do we use resampling techniques?

A

-Fewer assumptions = more accurate if assumptions aren’t met
-General = basic ideas can be modified and reused, no equations/tables to look up
-Retains power
-Thinking about tests = allows us to think about our data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why aren’t they very popular?

A

-They are new (1979)
-Assumed to be more complex
-Parametric stats are typically quite simple and do a good job
-Requires a computer and programming
-People don’t like having to think about their data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 types of resampling techniques?

A

-Permutation tests
-Bootstrap resampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are permutation tests?

A

-Compare groups and conditions (replacing t-tests)
-Shuffle data in accordance to conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are bootstrap resampling tests?

A

-Generate confidence intervals
-Make error bars
-Resample-with-replacement the values in sample
-Can look at the variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the main points of inferential statistics?

A

-Whether the probability that the differences were caused by sampling error
-Resampling = measure sampling error by repeating sample process a lot of times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What would the process be if it was between subjects design?

A

-Run experiment multiple times
-Check the range of values that typically occur
-Shuffle the values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What distribution is created by shuffling?

A

-Null distribution
-Distribution of expected experimental results if the null was true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Give a summary of the process of using between-subjects test

A

-Repeat experiment large num. of times
-Force nut hypothesis to be true
-Check how extreme the real value was
-No equation
-No tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do we look at generalisation within between subjects?

A

-If the hypothesis is that the groups differ in diversity (SD) rather than the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does shuffling change for within subjects design?

A

-The values are shuffled for each subject rather than the entire data set
-Randomise the sign of of difference for each pair

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do we manipulate the number of ppts?

A

-Sample size for resamples has to be the same as the original data
-Variance in mean differences = reflects the num. of subjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe bootstrap resamples

A

-Used to calculate confidence intervals e.g. CI of mean and SE of mean
-Can determine whether a test value is inside or outside the confidence interval
-Resample with replacement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is meant by resample with replacement?

A

-The piece of data can be used once, more than once or none at all

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is SEM?

A

-Standard deviation of the means of all possible samples
-Estimated from SD of bootstrap means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do we calculate a confidence interval?

A

-95% confidence interval from bootstrap represents range of values that 95% of the means take
-Order them and cut off the highest and lowest 2.5%

17
Q

How do we link one sample t-test to bootstrapping?

A

-Count how often a mean of 100 or less occurs within our bootstrap population
-Order data and find the values that are less than or equal to 100

18
Q

What is bootstrapping with a model fit?

A

-Very simple model
-Generalises more easily

19
Q

What are the advantages of bootstrapping?

A

-General method = any model can be used and any CI can be estimated
-Used to perform hypothesis testing (using one-sample t-test)
-No assumptions
-No tables or equations

20
Q

What are the 2 other resample approaches?

A

-Jack-knife
-Monte-Carlo method

21
Q

What is the Jack-knife approach?

A

-Similar to bootstrap
-Resampling done by selecting all data except one

22
Q

What is the Monte-Carlo approach?

A

-Create data from model simulations
-Compare to real data

23
Q

What are the issues and concerns around resampling?

A

-Not an exact number of resampling that you have to generate, can be between 1000 and 10000 depending on accuracy of p