Research methods Flashcards

Question

Chi-square by hand: two IVs

Answer 1

- With two IVs, the difference will be in calculating the expected values in each case - To calculate expected frequencies for two IVs, we need to calculate expected frequencies of specific cells

Answer 2

- Compares observed and expected frequencies for variables with only two levels - E.g. are there more pp’s in our sample from the USA than we would expect by chance?

Answer 3

- Statistical power: probability of seeing a true positive - Alpha (a): the highest acceptable risk of a false positive (typically 5%)

Answer 4

- Researchers biased towards results which support their theories - Significant results are more likely to be published - Many journals value novelty and surprising results - Non-significant results are often not published - Non-significant replications are hard to publish - Researchers are under pressure to find significant results

Answer 5

- There are many valid ways to analyse a given dataset: - A) different statistical tests - B) different variables - C) different rules for excluding outliers

Answer 6

- P-hacking is a way to cheat/lie with statistics - For any test, we accept a 5% probability of a false positive - P-hacking: performing the analysis in different ways to get p<.05 and only reporting the significant results - This results in false positive: we cannot trust the results - HARKING: hypothesising after the results known

Answer 7

- Run many possible analysis - See how many get a significant result - Munoz & Young (2018) analysed the data with N = 1152 regressions - Less than 5% had a significant effect

Answer 8

- Significant results easer to publish, including false positives - Many papers are underpowered, true positive are not seen - Leads to many false positives in the literature

Answer 9

1. Open materials – share the exact materials (instructions, program, stimuli), makes it easier for others to replicate 2. Open data – share the raw data so other researchers can perform the analysis and see how other variables/ analyses affect the results 3. Preregistration – plan the study in advance, including materials, planned analysis (e.g. open science framework), prevents p-hacking and HARKING. Researchers can compare your pre-registration to the final study

Answer 10

- Makes it easier to understand datasets - E.g. Florence Nightingale used data visualisation to highlight British soldiers living conditions in the Crimean War (1858)

Answer 11

- Data analysis process: check that assumptions have been met and understand the relationships between variables before inferential analysis - Report writing and publication process: show clear relationships between variables and help the reader interpret the data in the way you want them to

Answer 12

1. Checking data assumptions – graphs can be useful for checking data assumptions before running statistical tests (e.g. histograms, boxplots) 2. Summarising descriptives – graphs can also help to summarise lots of descriptive statistics (e.g. bar charts, clustered bar charts) 3. Graphing relationships – we can use scatterplots to graph relationships between variables (and sometimes check assumptions)

Answer 13

- Tufte (2001) and the American Psychological Association (2021) suggest that: A) images are clear B) units of measurement are provided C) axes are clearly labelled D) elements in the figure are clearly labelled or explained E) avoid distorting the data F) induce the reader to think about the underlying messages of the figure G) avoid using chartjunk (the use of unnecessary or misleading elements in the design of a graph

Answer 14

- no x and y axis labels - dates on the x axis are in a random order - colours of bars are not consistent across clusters of bars

Answer 15

- They are based on some commonly used parameters (e.g. standard deviation), which assume a normal distribution - If the data is not normally distributed, then those parameters might not be meaningful - Most data on a ratio or interval scale are normally distributed - When data are heavily skewed, the step size between points on the scale is probably not constant - When data are ordinal you may not have a normal distribution and parameters like SD, SEM, variance may no longer capture the data

Answer 16

- If we arrange all the data into an ascending sequence of scores: a) When the null hypothesis is true, we would expect the group labels then to be randomly distributed b) When the null hypothesis is false, we would expect the scores of the two groups to be clustered at either end of the sequence

Answer 17

- Test is based on the difference between the 2 scores for each subject - It uses the direction (which score is greater) and magnitude of the differences (how much greater) - If H0 is true, then the differences in one direction will be as large as the differences in the other - Whenever, we have tied ranks, we take the average of the range of ranks that the ties cover and allocate this to the value of the ties

Answer 18

- When the null hypothesis is true, we expect a random distribution of ranks across groups - When the null hypothesis is false, we expect a systematic distribution of ranks across groups

Answer 19

- Rank within each subject - Does not tell us which conditions differ

Answer 20

- A few basic ideas that can be modified and reused - No equations or tables to look up – maths is easier - Fewer assumptions so more accurate if assumptions not met - Resampling does require: - A) a computer - B) some programming

Answer 21

1. Permutation tests – for comparing groups/ conditions (e.g. t-test replacement). Shuffle data according to your conditions 2. Bootstrap resampling – for generating intervals (e.g. make error bars). Resample-with-replacement the values in a sample - Others: Jack-knife, Monte-Carlo method

Answer 22

- The point of inferential statistics: - To determine the probability that the differences we measured were caused by sampling error - The principle of resampling techniques is to measure that sampling error by repeating the sampling process many times - We can determine the likely error in introduced by the sampling by looking at the variability in the resampling

Answer 23

- We assume that these are real and sensible values - We do not assume anything about their distribution - Repeat process many times, forcing the null hypothesis to be true, and check how extreme the real value was - No equation needed, except for the statistic of interest e.g. mean - No table needed: the data, themselves, give the p value

Answer 24

- Can be used to calculate confidence intervals (e.g. standard error of the mean, confidence interval of a mean like 95%) - They can also determine whether some test value is inside or outside the 95% confidence interval - Used for confidence of simple values (mean) or for fitted parameters (gradient of a line) - SEM is the standard deviation of the means of all possible samples, and it can be estimated from the standard deviation of the bootstrap means - Bootstrapping generalises easily to more complex models than just the mean

Answer 25

- Very general method: any type of model can be used and confidence intervals on any of its parameters can be estimated - Can also be used to perform hypothesis testing (for one-sample tests) - Not based on any assumptions about the data - No tables, no equations (except for the model)

Answer 26

1. Stimulate recollecting the data 2. If the original data don’t look likely from your null distribution, then the null hypothesis is presumed not a good model of your data 3. These tests make very few assumptions about your data 4. They don’t throw away information

Answer 27

- Thomas Bayes (1702-1761), statistician and philosopher - Our perceived probability that something is true depends on the data about it as well as our previous expectations - Bayes formalised that intuition into some equations - It has become hugely important for many aspects of science

Answer 28

- Posterior: probability after seeing the data - Prior: probability before seeing the data (expectation) - Likelihood: evidence coming from the data itself - Posterior = likelihood x prior - (belief = evidence x expectation) - One implication of Bayesian theory is that when something is surprising, we intuitively require better evidence to believe it

Answer 29

- Null Hypothesis Significance Testing (NHST) – calculate the likelihood of this data when Null is H0. We reject H0 only if there’s good reason to do so. Failing to reject H0 doesn’t mean it was more likely than H1 - Instead, ‘Bayesians’ say – calculate the probability of H1 & H0 given the data. Calculate the ratio of those probabilities (i.e. Bayes Factor)

Answer 30

- These are the Bayesian equivalent of a p-value - They operate in both directions - Bayesians like to point out that BF is continuous – don’t need a hard boundary - BF categories soon appeared too: a) 0.0-1/3 evidence for H0 b) 1/3-3 not much evidence for anything c) 3 evidence for the theory

Answer 31

- In NHST H0 is always designed to be Null effect (no difference, no relationship etc.) - In Bayesian statistics H0 is not necessarily the “Null” hypothesis. It can be any other hypothesis (so often referred to as M1/M2 rather than H0/H1) - But often it is the Null effect (are the conditions the same) that we want to know about - Mathematically Bayes Factors can provide support for the Null (e.g. when BF<1/3) - To say 2 things are the same we have to say what level of “tolerance” (which Bayes tests also handle)

Answer 32

- Bayesians refer to traditional statistics as “frequentists” because of the logic of the p value (they say p only tells you how ‘frequent’ this data would be assuming that H0 is true & they like to point out this isn’t the same as telling us how likely the original hypotheses are)

Answer 33

- These are the Bayesian equivalent of Confidence intervals - Aim to give you the “right” thing Some key differences with traditional stats: - Can also incorporate a priori knowledge explicitly in the test (but often used with a “flat” prior) - Balanced test of whether H0/H1 is more likely, rather than assuming H0 unless confident about H1 - Can use any hypothesis as H0 (not necessarily the no-difference hypothesis) - Philosophically, a better fit with what stats aims to do

Research methods Flashcards

(57 cards)