Final: Ch 11-20 Flashcards

Question

What does the F test do?

Answer 1

most commonly used test to compare variances

Answer 2

F test is very sensitive to its assumption that both distributions are normal

Answer 3

- Levene's test | - F test

Answer 4

2-sample t-test or Welch’s t-test

Answer 5

2-sample t-test or Welch’s t-test

Answer 6

normal distributed variables

Answer 7

- 2- sample t-test assumes equal variance | - Welch’s t-test does NOT assume equal variance

Answer 8

- mean of paired differences | - mean difference between two groups

Answer 9

- random sample(s) - populations are normally distributed (for 2-sample t-test only): populations have equal variances

Answer 10

- previous data / theory - histograms - quantile plots - Shapiro-Wilk test

Answer 11

points form an approximately straight line

Answer 12

to test statistically whether a set of data comes from a normal distribution

Answer 13

- if sample sizes are large, sometimes parametric tests work OK anyway - transformations - non-parametric tests - permutation tests - bootstrapping

Answer 14

means of large samples are normally distributed rule of thumb: if n > ~50, then normal approximations may work

Answer 15

Welch’s t-test if sample sizes are equal and large, then even a 10x difference in variance is approximately OK – but Welch’s is still better

Answer 16

changes each data point by some simple mathematical formula then carry out the test on transformed data

Answer 17

- variable is likely to be the result of multiplication or division of various components - frequency distribution of data is skewed right - variance seems to increase as mean gets larger (in comparisons across groups)

Answer 18

- arcsine transformation - square-root transformation - reciprocal transformation

Answer 19

- require same transformation be applied to each individual - have one-to-one correspondence to original values - have monotonic relationship with original values (ie. larger values stay larger)

Answer 20

- must transform each individual in the same way - transformed values must still carry biological meaning - you CANNOT keep trying transformations until P < 0.05

Answer 21

assume less about underlying distributions

Answer 22

assume a distribution or a parameter

Answer 23

- sign test - RANKS - Mann-Whitney U test

Answer 24

compares data from one sample to a constant

Answer 25

- for each data point, record whether individual is above (+) or below (–) hypothesized constant - use binomial test to compare result to ½

Answer 26

has very low power – therefore it is likely to NOT reject false null hypothesis

Answer 27

more power → more information → higher ability to reject false null hypothesis

Answer 28

used by most non-parametric methods rank each data point in all samples from lowest to highest – ie. lowest data point gets rank 1, next lowest gets rank 2, …

Answer 29

compares central tendencies of two groups using ranks (equivalent to Wilcoxon rank sum test)

Answer 30

1. rank all individuals from both groups together in order (for example, smallest to largest) 2. sum the ranks for all individuals in each group → R1 and R2 3. calculate U1: number of times an individual from population 1 has lower rank than an individual from population 2, out of all pairwise comparisons

Answer 31

- both samples are random samples | - both populations have the same shape of distribution – only necessary when using Mann-Whitney to compare means

Answer 32

for hypothesis testing on measures of association – can be done for any test of association between two variables

Answer 33

1. variable 1 from an individual is paired with variable 2 data from a randomly chosen individual – this is done for all individuals 2. estimate is made on randomized data 3. whole process is repeated numerous times – distribution of randomized estimates is null distribution

Answer 34

all data points are used exactly once in each permuted data set

Answer 35

- eliminate bias | - reduce sampling error (increase precision and power)

Answer 36

- controls - random assignment to treatments - blinding

Answer 37

group which is identical to the experimental treatment in all respects aside from the treatment itself

Answer 38

individuals are randomly assigned to treatments

Answer 39

averages out effects of confounding variables

Answer 40

preventing knowledge of experimenter (or patient) of which treatment is given to whom

Answer 41

unblinded studies usually find much larger effects (sometimes 3x higher) – shows the bias that results from lack of blinding

Answer 42

increase signal to noise ratio if ‘noise’ is smaller, it is easier to detect a given ‘signal’ – can be achieved with smaller s or larger n

Answer 43

- replication - balance - blocking - extreme treatments

Answer 44

carry out study on multiple independent objects

Answer 45

nearly equal sample sizes in each treatment

Answer 46

grouping of experimental unit – within each group, different experimental treatments are applied to different units

Answer 47

stronger treatments can increase the signal-to-noise ratio

Answer 48

increases precision for a given total sample size (n1 + n2), standard error is smallest when n1 = n2

Answer 49

allows extraneous variation to be accounted for – it is therefore easier to see the signal through the remaining noise

Answer 50

compares means of more than two groups asks whether any of two or more means is different from any other – is the variance among groups greater than 0?

Answer 51

like t-test, but can compare more than two groups

Answer 52

like t-test, but can compare more than two groups

Answer 53

H0: all populations have equal means (variance among groups = 0) HA: at least one population mean is different

Answer 54

two-tailed 2-sample t-test

Answer 55

because of sampling error

Answer 56

standard deviation of sample means (when true mean is constant)

Answer 57

variance among groups should be equal to variance due to sampling error plus real variance among population means if at least one of the groups has a different population mean, we expect that variance between sample means can be captured by standard error

Answer 58

number of groups

Answer 59

mean squares group

Answer 60

mean squares error

Answer 61

F > 1 (but must take into account sampling error – F calculated from data will often be greater than one even when null is true, therefore we must compare F to null distribution)

Answer 62

convenient way to keep track of important calculations scientific papers often report ANOVA results with ANOVA tables

Answer 63

- random samples - normal distributions for each population - equal variances for all populations

Answer 64

non-parametric test similar to a single factor ANOVA uses ranks of the data points

Answer 65

categorical explanatory variable

Answer 66

ANOVAs can be generalized to look at more than one categorical variable at a time - can ask whether each categorical variable affects a numerical variable - can ask whether categorical variables interact in affecting the numerical variable

Answer 67

treatments are chosen by experimenter – not a random subset of all possible treatments - things we care about - ie. specific drug treatments, specific diets, season

Answer 68

treatments are a random sample from all possible treatments - things that can affect response variable, but we don’t care too much about - ie. family, location

Answer 69

no difference

Answer 70

test multiple hypotheses ie. no difference based on North and South alone

Answer 71

1 - (1-𝛼)^N ie. for 20 tests, probability of at least one Type I error is ~65% type 1 error rate for each test = 𝛼 Pr[not making type I error | null is true] = 1-𝛼 Pr[not making type I error on 2 tests | null is true] = (1-𝛼)(1-𝛼) = (1-𝛼)^N Pr[at least one type I error] = 1- (1-𝛼)^N

Answer 72

probability increases - do too many tests → probability gets too high - do more tests → will find something that is statistically significant due to chance

Answer 73

uses smaller 𝛼 value 𝛼' = 𝛼 / (number of tests)

Answer 74

compares all group means to all other group means to find which groups are different from which others

Answer 75

after finding evidence for differences/variation among means with single-factor ANOVA

Answer 76

H0: 𝜇1 = 𝜇2 H0: 𝜇1 = 𝜇3 H0: 𝜇2 = 𝜇3 etc.

Answer 77

probability of making at least one Type 1 error throughout the course of testing all pairs of means is no greater than significance level (𝛼)

Answer 78

- multiple comparisons would cause t-tests to reject too many true null hypotheses - Tukey-Kramer adjusts for the number of tests - Tukey-Kramer also uses information about variance within groups from all the data, so it has more power than t-test with Bonferroni correction

Answer 79

⍴ (rho) value is between -1 and 1

Answer 80

correlation coefficient (r): describes relationship between two numerical variables

Answer 81

describes proportion of variation in one variable that can be predicted from the other variable

Answer 82

variance is subset of covariance

Answer 83

- random sample - X is normally distributed with equal variance for all values of Y - Y is normally distributed with equal variance for all values of X

Answer 84

- r is normally distributed with mean = 0 - every time sampling distribution is normal, use t when using estimated standard error - if ⍴ ≠ 0, there is asymmetry

Answer 85

alternative to Pearson’s correlation that does not make so many assumptions

Answer 86

estimated correlation will be lower if X or Y are estimated with error

Answer 87

both compare two numerical variables

Answer 88

each ask different questions: - correlation – symmetrical - regression – asymmetrical

Answer 89

predicts Y from X (one variable from another)

Answer 90

- random sample - Y is normally distributed with equal variance for all values of X, assuming variance for all values of X is the same - relationship between X and Y can be described by a line

Answer 91

Y = a + bX

Answer 92

best line that minimizes sum of squares for the residual

Answer 93

residual = observed Y - predicted Y for every X value, Ŷ (predicted value of Y, by regression line) is value of Y right on the line

Answer 94

predicts amount of variance in Y explained by regression line

Answer 95

unwise to extrapolate beyond range of the data

Answer 96

H0: 𝛽 = 0 HA: 𝛽 ≠ 0

Answer 97

confidence intervals for predictions of mean Y

Answer 98

confidence intervals for predictions of individual Y

Answer 99

- transformations - quadratic regression - splines

Answer 100

help assess assumptions

Answer 101

- mean population is right on the line, and there’s variance around it - residual should roughly be the same size across all values of X (should be centred around 0, with equal positives and negatives) - residual should be spread out across the line, and about the same distance from the line on average for every X

Answer 102

(sample size should be at least 7x the number of terms) - very unlikely that new X would fall on the line - tradeoff between fit and prediction error – would fit better with your particular data set, but would have larger prediction error

Answer 103

tests for relationship between numerical variable (as the explanatory variable) and binary variable (as the response variable) ie. does the dose of a toxin affect probability of survival? ie. does the length of a peacock's tail affect its probability of getting a mate?

Answer 104

papers are more likely to be published if P < 0.05 – causes bias in science reported in literature

Answer 105

- simulation | - randomization

Answer 106

simulates sampling process on computer many times – generates null distribution from estimates done on simulated data computer assumes null hypothesis is true

Answer 107

L(hypothesis A | data) = P[data | hypothesis A]

Answer 108

other data sets – ONLY cares about the specific data set we have

Answer 109

captures level of surprise prefer models that make data less surprising, and have higher likelihood

Answer 110

supports one hypothesis better than another if likelihood of that hypothesis is higher than likelihood of the other hypothesis therefore we try to find the hypothesis with maximum likelihood (least surprising data) – all estimates we have learned so far are also maximum likelihood estimates

Answer 111

- calculus | - computer calculations

Answer 112

ie. maximum value of L(p=x) is found when x = ⅜ note that this is the same value we would have gotten by methods we already learned

Answer 113

1. input likelihood formula to computer 2. plot value of L for each value of x 3. find largest L

Answer 114

compares likelihood of maximum likelihood estimate to null hypothesis use log-likelihood ratio

Answer 115

ꭓ^2 = 2 (log likelihood ratio)

Answer 116

df = number of variables fixed to make null hypothesis

Answer 117

two-sample t-tests and confidence intervals are robust to violations of equal standard deviations as long as: - sample sizes of the two groups are roughly equal - standard deviations are within three times of one another.

Answer 118

- extreme doses increase power, and so enhance the probability of detecting an effect - however, effects of a large dose might be very different from effects of a smaller, more realistic dose - if an effect is detected, then studies of the effects of more realistic doses would be the next step

Answer 119

removes effects of confounding variables

Answer 120

avoids unconscious bias

Answer 121

increases possibility of confounding by unmeasured variables

Answer 122

unplanned comparisons – intended to search for differences among all pairs of means planned comparisons – must be few and identified as crucial in advance of gathering and analyzing the data

Answer 123

failure to reject a null hypothesis that the difference between a given pair of means is zero does not imply that the means are equal, because power is not necessarily high, especially when the differences are small if the means of the “medium” and “isolated” treatments differ from one another, then one or both of them must differ from the means from the other two groups, but we don’t know which

Answer 124

sampling error in the estimates of earwig density and the proportion of males with forceps means that true density and proportion on an island are measured with error measurement error will tend to decrease the estimated correlation therefore, the actual correlation is expected to be higher on average than the estimated correlation.

Answer 125

- residuals are symmetric and don’t show any obvious non-normality - variance of the residuals does not appear to change greatly for different values of X

Answer 126

minimizes the sum of squared differences between the predicted Y-values on the regression line for each X and the observed Y-values

Answer 127

differences between predicted Y-values on the estimated regression line, and the observed Y-values

Answer 128

variance of the residuals

Answer 129

fraction of the variation in Y that is explained by X

Answer 130

- first, check the data to ensure this individual was not entered incorrectly - perform the analysis with and without the outlier included in the data set to determine whether it has an influence on the outcome - if it has a big influence, then it is probably wise to leave it out and limit predictions to the range of X- values between 0 and about 200 (and urge them to obtain more data at the higher X-value)

Answer 131

give the confidence interval for the predicted Y for a given X

Answer 132

prediction interval, because it measures uncertainty when predicting Y of a single individual

Answer 133

(analysis of covariance) compares many slopes

Answer 134

H0: 𝛽1 = 𝛽2 = 𝛽3 = 𝛽4 = 𝛽5… (multiple null hypotheses) HA: at least one of the slopes is different from another

Answer 135

method for estimation (and confidence intervals) - often used for hypothesis testing too - often used in evolutionary trees

Answer 136

- for each group, randomly pick with replacement an equal number of data points, from data of that group - with this bootstrap dataset, calculate bootstrap replicate estimate

Answer 137

two individuals in a pair share many things in common with each other but differ from members of other pairs whatever variation these shared differences causes in the response variable is factored out in the difference between them by looking at the differences, we potentially avoid much of the error variance in the data separate samples do not share these properties

Final: Ch 11-20 Flashcards

(173 cards)