resampling statistics Flashcards

Question 1

Q

Why do we use resampling techniques?

Answer

A

-Fewer assumptions = more accurate if assumptions aren’t met
-General = basic ideas can be modified and reused, no equations/tables to look up
-Retains power
-Thinking about tests = allows us to think about our data

Question 2

Q

Why aren’t they very popular?

Answer

A

-They are new (1979)
-Assumed to be more complex
-Parametric stats are typically quite simple and do a good job
-Requires a computer and programming
-People don’t like having to think about their data

Question 3

Q

What are the 2 types of resampling techniques?

Answer

A

-Permutation tests
-Bootstrap resampling

Question 4

Q

What are permutation tests?

Answer

A

-Compare groups and conditions (replacing t-tests)
-Shuffle data in accordance to conditions

Question 5

Q

What are bootstrap resampling tests?

Answer

A

-Generate confidence intervals
-Make error bars
-Resample-with-replacement the values in sample
-Can look at the variability

Question 6

Q

What are the main points of inferential statistics?

Answer

A

-Whether the probability that the differences were caused by sampling error
-Resampling = measure sampling error by repeating sample process a lot of times

Question 7

Q

What would the process be if it was between subjects design?

Answer

A

-Run experiment multiple times
-Check the range of values that typically occur
-Shuffle the values

Question 8

Q

What distribution is created by shuffling?

Answer

A

-Null distribution
-Distribution of expected experimental results if the null was true

Question 9

Q

Give a summary of the process of using between-subjects test

Answer

A

-Repeat experiment large num. of times
-Force nut hypothesis to be true
-Check how extreme the real value was
-No equation
-No tables

Question 10

Q

How do we look at generalisation within between subjects?

Answer

A

-If the hypothesis is that the groups differ in diversity (SD) rather than the mean

Question 11

Q

How does shuffling change for within subjects design?

Answer

A

-The values are shuffled for each subject rather than the entire data set
-Randomise the sign of of difference for each pair

Question 12

Q

How do we manipulate the number of ppts?

Answer

A

-Sample size for resamples has to be the same as the original data
-Variance in mean differences = reflects the num. of subjects

Question 13

Q

Describe bootstrap resamples

Answer

A

-Used to calculate confidence intervals e.g. CI of mean and SE of mean
-Can determine whether a test value is inside or outside the confidence interval
-Resample with replacement

Question 14

Q

What is meant by resample with replacement?

Answer

A

-The piece of data can be used once, more than once or none at all

Question 15

Q

What is SEM?

Answer

A

-Standard deviation of the means of all possible samples
-Estimated from SD of bootstrap means

Question 16

Q

How do we calculate a confidence interval?

Answer

A

-95% confidence interval from bootstrap represents range of values that 95% of the means take
-Order them and cut off the highest and lowest 2.5%

Question 17

Q

How do we link one sample t-test to bootstrapping?

Answer

A

-Count how often a mean of 100 or less occurs within our bootstrap population
-Order data and find the values that are less than or equal to 100

Question 18

Q

What is bootstrapping with a model fit?

Answer

A

-Very simple model
-Generalises more easily

Question 19

Q

What are the advantages of bootstrapping?

Answer

A

-General method = any model can be used and any CI can be estimated
-Used to perform hypothesis testing (using one-sample t-test)
-No assumptions
-No tables or equations

Question 20

Q

What are the 2 other resample approaches?

Answer

A

-Jack-knife
-Monte-Carlo method

Question 21

Q

What is the Jack-knife approach?

Answer

A

-Similar to bootstrap
-Resampling done by selecting all data except one

Question 22

Q

What is the Monte-Carlo approach?

Answer

A

-Create data from model simulations
-Compare to real data

Question 23

Q

What are the issues and concerns around resampling?

Answer

A

-Not an exact number of resampling that you have to generate, can be between 1000 and 10000 depending on accuracy of p