Statistics Presentation Notes Flashcards

Question

What is an example of a non-falsifiable hypothesis (HINT: it remains a popular idea in most people's heads nonetheless)?

Answer 1

God created the Universe

Answer 2

1. rejection of original claim 2. modification of original claim 3. confirmation of original claim

Answer 3

1. development of null and alternative hypotheses 2. calculation of a test statistic 3. converting the test statistic to a P-value 4. deriving a conclusion

Answer 4

(1) null (2) difference (3) pattern (4) cause-and-effect

Answer 5

Ho = "Ghostbuster" and "Night Shadow" eggplants are the same size Ha = "Ghostbuster" eggplants are larger than "Night Shadow" fruits

Answer 6

to establish the distribution curve, we could measure 1,000 "Ghostbuster" and 1,000 "Night Shadow" eggplants and take their average masses (mu[G] and mu[NS]), then take their difference (mu[G] - mu[NS]) next, we mix the 2,000 observations and pull 1,000 of them at random and assign them as the average mass for a hypothetical "Ghostbuster" group (mu[g1]), giving the other 1,000 the distinction of a hypothetical "Night Shadow" average (mu[ns1]), after which we take their averages (mu[g1] - mu[ns1]) repeat the previous step 999 times (mu[g2] - mu[ns2], mu[g3] - mu[ns3]), ... mu[g999] - mu[ns999], mu[g1000] - mu[ns1000]) and plot the frequency of the differences as a histogram

Answer 7

plot the true difference in mass (mu[G] - mu[NS]) on the histogram with the frequency of simulated mass differences count the number of observations which are larger than the true difference; the P-value will be the number of observations divided by 1,000

Answer 8

"... after many decades of custom, tradition and vigilant enforcement by editors and journal reviewers"

Answer 9

[1] sample size (n) [2] the measurement under investigation (such as difference in size between two cultivars) [3] level of variation (sigma^2)

Answer 10

larger sample sizes are more reflective of the overall population or true value

Answer 11

the lower the amount of variation between groups, the lower the resulting P-values

Answer 12

all conclusions are based on incomplete information, which may not actually reflect the true situation

Answer 13

(1) failure to reject a null hypothesis which is true (2) rejection of a null hypothesis which is false

Answer 14

false rejection of a null hypothesis which is true

Answer 15

something is thought to be occurring when nothing actually is

Answer 16

failure to reject a null hypothesis which is actually false

Answer 17

nothing is thought to be occurring when something actually is

Answer 18

the probability of correctly rejecting a null hypothesis which is false

Answer 19

statistical power = 1.0 - beta (probability of a type II error)

Answer 20

the study may have been under-powered, meaning it had too few samples

Answer 21

they are inverse -- as one grows, the other shrinks

Answer 22

a type I error proclaims there is a phenomenon where none actually exists, which can lead people to do foolish things for bad reasons a type II error, as a false negative, doesn't generally mobilize people, and can often be corrected in the future with more sensitive tech

Answer 23

linear model

Answer 24

over a very narrow range of x values, the function can appear more linear

Answer 25

homogeneity of variance

Answer 26

independence

Answer 27

(1) evenly distributed (2) y = 0 (3) uncorrelated

Answer 28

inflated estimate of variance

Answer 29

normal qq-plot

Answer 30

weighted standard deviation

Answer 31

(1) calculate the residuals (e) for each value of the response variable (y - y-hat) (2) make each residual (e) standardized by dividing it against the weighted standard deviation (sigma^2) (3) for the scatter-plot, set the theoretical quantiles of the differences on the x-axis and the standardized residuals on the y-axis

Answer 32

the linear abline shows where standardized residuals would fall if they were perfectly normal

Answer 33

all the values fall nicely on the abline

Answer 34

all the values are linear early on and curve upward later (think of the curve of a circle with center at (0, 0) which is being looked at in Quadrant IV)

Answer 35

all the values curve upward early on and become more linear later (think of the curve of a circle with center at (0, 0) which is being looked at in Quadrant II)

Answer 36

distribution of errors with fat-tailed residuals

Answer 37

distribution of errors with thin-tailed residuals

Answer 38

(1) parametric theory (2) confidence intervals (3) hypothesis testing

Answer 39

t-plot of residual values versus the fitted data

Answer 40

homoscedastic

Answer 41

heteroscedastic

Answer 42

maintain sampling design with subjects that are indepedent spatially and/or temporally from one another

Answer 43

(1) base-10 ("common") logarithmic (2) base-e ("natural") logarithmic (3) square-root (4) arcsine square root

Answer 44

orders of magnitudes in size

Answer 45

values need to be greater than zero

Answer 46

count data

Answer 47

outputs are compressed between (0, 1.0)

Answer 48

three kinds of t-test: (1) one sample t-test (2) two-sample t-test (3) paired t-test all t-tests compare the means of the data

Answer 49

z[obs] = (x-bar - mu) / (sigma / n^0.5) (In English, the test statistic is equal to the difference of the means of the samples minus the expectation under the null hypothesis, divided by the average amount of variation between the samples)

Answer 50

the t-statistic is used when the sample size is small or the standard deviation of the population is not known

Answer 51

(1) paired t-tests are used if two measurements are taken on the same unit (2) paired t-tests can remove variability between units

Answer 52

control of sources of variation

Statistics Presentation Notes Flashcards

(80 cards)