2nd Stats Exam MCQ Flashcards

Question

How do we get ppts individual intercept?

Answer 1

Add individual u0 variable (quality of life estimate for each clinic) to the fixed b0 value

Answer 2

Separate regressions for each variable (clinic, so 10) | - which can be manually combined - just have to split clinic variable

Answer 3

1. Patients | 2. Clinics

Answer 4

One sample T Test - mean of the b values / SE = calculate t - the t given is close to t in MLM intercepts row

Answer 5

Replication Crisis - when studies are replicated different p results are found (actually non signif...)

Answer 6

replicated random 100 studies all with signif p values and found only a third maintained the signif result

Answer 7

Belief in Null Hypothesis Significance Testing (p values) drastically declines

Answer 8

We can draw on our knowledge of the world (current model - view of someone and factors affecting their decision) which will affect how you influence ambiguous data We use our current model of knowledge, because we cannot measure direct info (e.g. someones thoughts) that would give us correct answer

Answer 9

Confirmation Bias - seeing what we expect to see, followed by biased updating of our beliefs (a) interpreting ambiguous data in light of your prior beliefs and (b) updating those very same beliefs based on your interpretation of the ambiguous data

Answer 10

Null Hypothesis Significance Testing (NHST) | Francis Bacon, Pearson, Neyman, Popper, Fisher,

Answer 11

create a hypothesis and if what we expect occurs under certain circumstances then we expect our theory to be true / scientists are trying to contrive a situation which will produce unambiguous evidence either for or against the theory

Answer 12

To conduct studies FALSIFYING our theories too - identifying that there can be evidence against our theory too (idea behind crucial experiment)

Answer 13

Psychology and medicine deals with PROBABILISTIC THEORIES - if my theory is true, then on average this will happen / on average these people will do better Therefore, cannot really follow Bacons idea of crucial experiment - our experiments do not definitely prove a theory wrong > only provides PROBABALISTIC EVIDENCE

Answer 14

Probabilistic theories and evidence we gather, allows for confirmation bias to creep back in data is open to interpretation by researcher, allowing for biased prior beliefs in favour of theory > interpret data in biased way

Answer 15

P values and effect sizes p value is the probability of getting the current data if the null hypothesis is true CONTINOUS SPECTRUM - smaller the p, more significant our data is and can reject the null

Answer 16

Fisher - continuous spectrum for p value = smaller p = evidence for our theory P&N came up with a DECISION RULE in NHST - ‘reject’ our null hypothesis if the p value is less than .05 (signif result) - conveying evidence for our theory Intended to stop interpretation of p values > but has the opposite effect

Answer 17

A mixing of Fishers idea and Pearson/Neymans idea of analysing data - should both be exclusive (use one way or the other)

Answer 18

Publication Bias > P-Hacking and bottom drawer effect | HARKing

Answer 19

Majority of papers published show significant results and nonsignif ones are discarded or not written up signif papers seem more interested and generate more money Many studies suspiciously reporting p values just below .05 (reported by Masicampo)

Answer 20

Incentive structure to write and publish papers with signif results and discarding any with non-signif results as they usually are rejected by journal (chucking non-signif findings in bottom drawer) = SKEWED REPRESENTATION

Answer 21

Conducting a Meta-anlalysis i.e statistical reviews - they try to come to a single statistical overview of the field

Answer 22

P-HACKING - nudging our p-value towards a signif result - e.g. researchers will try all the different ‘acceptable’ ways (methods and analyses) out and pick the one that is significant (CONFIRMATION BIAS) because researchers have certain degree of freedom when running experiment

Answer 23

1. Following Neyman-Pearson approach more - pre-registration (registering experimental and analysis plans before collecting data) 2. Give up on NHST / p value being .05 entirely make it continuous again like fisher proposed? arbitrary significant vs non-significant distinction removed

Answer 24

Bayesian approach

Answer 25

because we have weak theories to begin with (thus vague predictions) - e.g. we say meditating makes us happier - well by how much? no idea and weak experiments (unlike in physics) - studying complex humans - messy situation - confounded - whether confound variables explains signif results rather than theory (whether it is or not is very subjective)

Answer 26

a priori hypothesizing suggests that the researcher is on to something impressive and not due to confound variables (... many researchers want to do this so they may HARK)

Answer 27

Hypothesising after the results are known and pretending to make apriori hypotheses occurs because incentive framework - work with impressive apriori hypotheses more likely to be published - difficult to predict before seeing results though = harking can help get published pre-registering proposed to prevent HARKing... OR give up the sanctification of a priori hypothesizing there will be no incentive to HARK (rid of NHST)

Answer 28

we don't know what our theory predicts (so difficult to make apriori hyps) and don't know if our experiment is a good test of our theory before we run it we don't have any mature enough theories to use NHST and make hypotheses, should just do explorative studies first with no expectations in mind

Answer 29

based on vague, weak theories to begin with confounding variables tend to occur during experiment causing it apriori hyps to weaken - not able to make predictions when everything is complex Still in exploration stage, not NHST stage

Answer 30

Not saying results are significant to .05 in an experiment, instead saying = provides a bit of evidence for or against a theory No apriori hyps needed - it doesn’t matter if you predicted it beforehand or not – it’s just one piece in the puzzle - in a decade or so we’ll have some idea if the theory is true or not

Answer 31

privatised capitalist journal system which prioritises ‘interestingness’

Answer 32

a whole model -2LL | we do get individual predictor r and f values in the fixed effects box

Answer 33

Cognitive ability (what we actually want to measure) through scores on an IQ test (tools we use to measure this indirectly)

Answer 34

What is the relationship between my measurement tool and the thing I actually want to measure? (relationship being distance)

Answer 35

The validity of that measurement tool i.e. to what extent are we measuring what we intend to measure? extent is far from perfect.. (far from r2 of 1)

Answer 36

NUISANCE VARIABLES - there are many other factors that we do not want to measure that affect responses on the measurement tool. This prevents us from actually measuring what we want to measure = adds noise to your measurement tool e.g. daily mood, hunger, time

Answer 37

different DATA TYPES - NOMINAL, ORDINAL, INTERVAL or RATIO | data type might not always match the data type of the thing we actually want to measure

Answer 38

Data with no particular order e.g. gender, eye colour

Answer 39

Has a particular order e.g. ranks in the army, race positions or a likert scale with non-symmetrical points (lopsided)

Answer 40

Has a particular order and equal distances between points e.g. likert scale with symmetrical points (very bad, bad, neutral, good, very good)

Answer 41

Has a particular order, equal distances between points and true zero if someone scores a ‘zero’ on your measurement tool, they also have ‘zero’ on the thing you’re trying to measure e.g. reaction time (height and weight)

Answer 42

When it moves from nominal >>> ratio | - conduct more sophisticated calculations on data further to the right

Answer 43

5 or 7 points to balance sufficient precision while not overloading your responder

Answer 44

How your responders INTERPRET your scale - they may interact with scale in ordinal manner, rather than interval manner e.g. five star rating system - data points not equally distributed > overly generous with 5star ratings and 4star doesn't mean good anymore...

Answer 45

If you get them confused, the mean will NOT be the central point as intended with interval, if data is actually ordinal

Answer 46

Measurement tool = number of incidents data gathered is ratio when = want to measure the reduction in challenging behavior (external - directly observable) - 0 incidents = 0 challenging behaviours (true zero) data gathered is interval when = want to measure happiness (internal - not directly measured) = no. of incidents then becomes an indirect measure, because a reduction in incidents might not indicate increase in happiness...

Answer 47

Binary data Special - unsure if two points have particular order and to have equal distances you need at least 3 points

Answer 48

When a criterion variable is binary Lin R can only handle criterion variable that are at least interval data

Answer 49

Downs syndrome - a condition where you either have it or you don't (genetic mechanism)

Answer 50

A disorder (e.g. autism) may be measured on a spectrum (not directly measurable), but diagnosed in a binary way (either have condition or you don't based on cut off point)

Answer 51

Dummy coding used only Having the disease = 1 Not having the disease = 0

Answer 52

It produces a line with NO BOUNDS - goes to infinity in both ways Needs to make predictions that stay in its max 1 and min 0 bounds

Answer 53

Graph produced with a line with NO BOUNDS - goes to infinity in both ways Needs to make predictions that stay in its max 1 and min 0 bounds

Answer 54

A SIGMOIDAL CURVE, not a linear line

Answer 55

The right side of the normal linear regression formula is transformed the lin R formula is multiplied by -1, e raised to the power of the model/linRformula, added to 1 and then divided by 1!

Answer 56

Now predicting the PROBABILITY of having disease!=P(Dis) (rather than just predicting the disease=Dis) In logistic regression - looking at predictions of probability that an individual with the given cause levels has the disease (or whatever you coded as ‘1’)

Answer 57

b0 and b1 values

Answer 58

It FLIPS the direction of the line

Answer 59

Raising e to the power of the model so far (e is 2.71828) we raise a number (any number) to the power of the linear regression formula which has dramatic effect on the line

Answer 60

IDEAL SLIDE - steepness of the slope of the line is reduced increasingly as it approaches zero, so it never quite reaches zero (i.e. it is bounded at zero).

Answer 61

Step 4 - as the line approaches 1, the slope of the line becomes less steep, never quite reaches 1 end up with a SIGMOIDAL CURVE bounded at 0 and 1 like our criterion variable (producing probabilities of 1 - having the disease)

Answer 62

because this model produces predictions of probability rather than just predictions of a continuous variable e.g happiness

Answer 63

LEFT - If a ppt is coded 1, so has the disease | RIGHT - if a ppts is coded 0, doesnt have the disease

Answer 64

To undo the fact that we raised our model to the power e - ln(x) is the ‘inverse’ of e

Answer 65

the prediction is less good - because say the model predicted small chance of having dis [P(Dem) of .19], but they actually had the disease (coded 1). = Larger number

Answer 66

We sum them up - equivalent to how we sum up all the individual squared errors for linear regression This gives us a FINAL LL value (larger neg value = worse model)

Answer 67

DISi (1-have dis, 0-dont) | P(Dis) - probability given that they have the disease

Answer 68

Multiply the final LL value by -2 = -2LL ( now a positive since we x2) -2LL is known as deviance/variance because the final value has a CHI2 DISTRIBUTION - p value can be calculated easily on computer

Answer 69

higher positive values now mean =worse model smaller values = better model.

Answer 70

LIKELIHOOD RATIO - The difference in -2LL between our current model (model with predictor e.g. TauC) and model with no predictors

Answer 71

The probability of getting the LR, if the NULL HYP is true | i.e. if the new model is not any better than the model with no predictors.

Answer 72

SSE-Left (error reduced after including current model with predictor)

Answer 73

-2LL for current model + CHI2 output (LR, equivalent to the SSE Reduced)

Answer 74

the individual predictors coefficients box in Lin R

Answer 75

1. Bvalue for predictor /SE for predictor | 2. square the answer

Answer 76

The change in the odds of having the dis for every one increase in the pred

Answer 77

Odds/(1+Odds)

Answer 78

Odds of getting dis (or not getting dis) for that pred score, MULTIPLIED by ExpB value for that pred

Answer 79

a backwards inference question (posterior) | Bayes wanted to update peoples prior beliefs based on new evidence

Answer 80

FORWARDS - KNOWN CAUSES > KNOWN EFFECTS BACKWARDS - KNOWN EFFECTS > BACK TO KNOWN CAUSES (more tricky)

Answer 81

1. start with initial prior belief for the probability 2. update this prior belief based on probability of getting new data 3. arrive at a new posterior belief level So we want to update the probability of a hypothesis as more evidence becomes available

Answer 82

The total number of false positives depends not only upon the rate, but also upon how many people there are WITHOUT disease

Answer 83

Most diseases are rare so we have a large number of individuals without the disease and a small FP rate. So when we multiply the large no. of people without disease by the FP rate = large FP total

Answer 84

true positive number / total positive results | total positive results = TP number +FP number

Answer 85

they give the TP RATE % or the FP RATE % as the answer thus, confusing forward and backwards inference Should have done = true positive number / total positive results

Answer 86

Bayes-LaPlace formula

Answer 87

Prior beliefs about hypothesis + new data = posterior belief about hypothesis

Answer 88

P(Da|¬Hy) - probability of getting the current data if our hypothesis is not true i.e. if the null hypothesis is true only difference - p value - the probability of getting the current data OR MORE EXTREME if the null hypothesis is true

Answer 89

the p value is only a small part of the whole picture, despite most papers only reporting the p value to find out the probability that our hypothesis is true, given our data (i.e. the purple posterior), the p value has to be combined with all these other figures – it absolutely cannot tell us that all by itself.

Answer 90

Confusing P(Da|¬Hy) (pvalue) with probability of the null hypothesis being true (P(¬Hy|Da) (posterior) This mistake confuses ‘forward’ and ‘backwards’ inference - in natural language hard to detect the difference

Answer 91

The prior - who decides what the prior belief in the hyp is? SUBJECTIVE but becomes less of a problem as you gather more evidence - data speaks for itself and you get a true posterior belief thus, your belief will change according to data in the bayes formula!

Answer 92

Don't know our ALT HYP / P(Da|Hy) - we don’t have a specific value in mind for ‘Hy’, unlike when it come to the ¬Hy (0). Thus, if we don’t have a value for ‘Hy’ we can’t calculate a probability for P(Da|Hy). due to SOFT SCIENCE = weak theories and predictions (just predicting a therapy outperforms control by more than 0)

Answer 93

``` only figure we can objectively calculate - only figure all scientists can agree on unlike P(Da|Hy) is based on weak theories and vague predictions ```

Answer 94

META-ANALYSES of several experimental papers Meta-analyses calculate P(Da|Hy) using the figures in papers they're examining (sample mean and SE).

Answer 95

assigns a cut off point - P

Answer 96

P(Da|Hy) and P(Da|¬Hy)

Answer 97

LR = P(Da|Hy) / P(Da|¬Hy) | (equivalent to SSE-Reduced) = support for alt hyp if it is more than 0

Answer 98

Belief that our hypothesis is true has risen from prior belief value to posterior belief level (e.g. from 0.01 to .23)

Answer 99

if the NULL is FALSE, the probability CORRECTLY getting a SIGNIF result

Answer 100

The amount of POWER

Answer 101

0.8 the power should be greater than the alpha (FP rate) .05 (but most psych experiments have way less power)

Answer 102

``` FP = a TYPE 1 error rate (alpha) FN = a TYPE 2 error rate (beta) ```

Answer 103

False negative rate - when the hypothesis really was true, but you got a non-significant result i.e. incorrectly accepted the null hypothesis.

Answer 104

Alpha (FP) - smaller this is, smaller type 2 is, but the larger type 1 error is Sample Size - bigger sample =greater power and smaller type 2 error True effect size - the bigger the effect, easier to detect and type 2 error lowers lower effect = lower power/TP rate Sample variance - looking for true effect size in the noise/variance - effectsize vs noise ratio

Answer 105

increase e.g. if alpha set a .01, effect is harder to detect, thus lower power

Answer 106

b1 (in fixed effects box) + u1 ( do not just look at slope u1 variable in unbiased predictions box! or u0 intercept variable if asked about intercepts!)

Answer 107

LR / -2LL No Pred Model

2nd Stats Exam MCQ Flashcards

(137 cards)