Exam 3 Flashcards

Question

ex. matched pairs t-test

Answer 1

Make cola - 1. right after produced, or 2. one month later --diff. = fresh - stored, n = 10 1. STATE: is there evidence that cola lost sweetness during storage? 2. PLAN: two measurements on each batch = fresh and stored - -perform matched pairs t test on Md ``` parameter: Md = mean difference in sweetness of all cola after one month di = fresh - stored H0 = Md = 0 Ha: Md > 0 alpha = .05 ``` 3. SOLVE: conditions = SRS yes, plot data: no outliers YES -dbar = .30, Sd = 1.16 t = (dbar - Md) / (Sd/radical n) t = (.30 - 0) / (1.16 / radical 10) = 0.818 p-value = .200 < value < .250 4. CONCLUDE: value > alpha, so fail to reject Ho and conclude that evidence is not strong enough to say cola lost sweetness after one month of storage

Answer 2

One sample inference: intervals an tests for mean (Mu) --application: matched pairs intervals and tests for a mean diff. (Md) two-sample inference: intervals and tests for a difference btw two means (M1 - M2) matched pairs --2 SRS of pairs, one individual for each condition, or experiment using paired units - 1 unit randomly assigned to each treatment two sample inference - -2 SRS - 1 from each population or - -experiment using unpaired units - half randomly assigned to each treatment

Answer 3

Pop. 1. Pop. 2 pop. mean. Mu 1. Mu 2 COMMON pop. SD. sigma sigma. (only one thats same) sample size n1 n2 sample mean xbar 1. xbar 2 sample SD s1 s2 Mu 1 - Mu 2 = diff. btw 2 population means xbar 1 - xbar 2 = dif. btw sample means 1. investigate sample distribution of xbar 1 - xbar 2 for SRS from pop. of interest 2. use sample distribution to develop C.I. for Mu 1 - M2 3. use sample distribution to develop test of significance for Mu 1 - Mu 2

Answer 4

1. take SRS of size N1 from pop. 1 2. same from pop. 2 (n2) 3. both pop. normally distributed with no outliers check 4. find xbar1 - xbar2 center = mean distribution of xbar1 - xbar2 = Mu1 - Mu2 spread = SD = radical((sigma^2/n1)+(sigma^2/n2)) OR sigma*radical((1/n1) + (1/n2)) shape = approx. normal if both n1 and n2 are at least 30 how do we estimate sigma? Sp = radical ((n1 - 1)*s1^2 + (n2 - 1)*s2^2) / n1 + n2 - 2

Answer 5

C.I. = xbar1 - xbar2 +/- t* Sp*Radical(1/n1) + (1n2) df = n1 + n2 - 2 test! Ho: Mu1 = Mu2 or Mu1 - Mu2 = 0 Ha: Mu1 >/does not = Mu2 t = (xbar1 - xbar2) / Sp*Radical(1/n1) + (1/n2) conditions: 1. randomness of data collection? - SRS or treatment SRS 2. normality of pop. or large sample size - check by making sure there are no outliers or both sample sizes > 30 3. equal pop. st. dev. (Sigma) - - --check by (larger s) /( smaller s) < 2

Answer 6

1. STATE: does antidepressant cause an INC. in water consumption? use alpha = .05 2. PLAN: Use a two-sample t test for means --let Mud = mean water intake for rats in drug group ---Mup = mean water intake for rates in placebo group (so this was SRS of two treatments) parameter: Mud - Mup Ho: Mud - Mup = 0, or Mud = Mup Ha: Mud - Mup > 0 , or Mud > Mud alpha = .05 3. SOLVE Check: SRS?, Normal and no outliers, and same pop. st. dev. (check by large s / small s = .750 / .564 = 1.33 <2 so good drug placebo xbar = 8.48ml xbar2 = 7.93 ml s = .750 ml s = .564 ml n = 10 n = 10 test stat. = Sp = radical((n1 - 1)*s1^2) + (n2 - 1)*s2^2) / n1 = n2 - 2 = radical [(10 - 1).75^2 + (10 - 1).564^2] / 10 + 10 - 2 = .664 ``` t = (xbard - xbarp - 0) / Sp*radical(1/nd + 1/np) t = (8.48 - 7.93 - 0) / .664*radical (1/10 + 1/10)) = 1.852 df = 10 + 10 - 2 = 18 ``` p-value = .025 < pvalue < .05 4. CONLUDE: pvalue < .05 so reject Ho

Answer 7

Sp = .6635, t* = 1.734 Xbard - Xbarp +/- t* sp*radical(1/ns) + (1/np) = 8.48 - 7.93 +/- (1.734)*.6635*Radical(1/10) + (1/10) CI does not include 0 (.036, 1.065) so thus Mud does not equal Mup --this confirms significance test of rejecting Ho

Answer 8

remember the chart with C - C, C - Q, Q - C, Q - Q One-sample inference - intervals and tests for a mean (Mu) two-sample inference: intervals and tests for a DIFFERENCE btwn 2 means (Mu1 - Mu2) multi-sample inference: intervals and tests for comparisons of 3 or more means (Mu1 - mu3, Mu1 - Mu2, Mu2 - Mu3, 1/2(Mu1 + Mu2))

Answer 9

2 sample inference - 2 separate SRSs - 1 from each population - OR --OR experiment using unpaired units (half randomly assigned to each treatment) multi-sample inference - -3 or more separate SRSs (1 from each population) OR - -OR expertement using unblocked units (randomly assigned to 3 or more treatments) most scientific studies involve 3 or more groups - However: inferences and related issues are much more complicated for multi-sample studies - -complete discussion beyond scope of the course - -we will discuss just 1 useful test of significance

Answer 10

Ho: M1 = M2 --> (xbar1 - xbar2) / sp*radical(1/n1 + 1/n2) gives p-value 1 Ho: M1 = M3 --> (xbar1 - xbar3) / sp*radical(1/n1 + 1/n3) gives p-value 2 Ho: M2 = M3 --> (xbar2 - xbar3) / sp*radical(1/n2 + 1/n3) gives p-value 3 3 ho: and 3 p-value: don't know which p-value to use - -multiple tests - the more tests performed...the 1. greater probability of observing an extreme statistic due to chance 2. the greater probability of declaring significance for at least one test when all diff. are really due to chance alone needed: one overall test (one null hypothesis, one test stat, one p-value) to TEST EQUALITY OF 3+ MEANS

Answer 11

1. overall test - -test procedure: one-way analysis of variance (ANOVA) - -test stat: F ratio of variances 2. follow up analysis --if overall test is significant: comparison of CI for individual means can shed some light on general question of difference among Sus by testing... Ho: M1 = m2 = m3 vs. Ha: at least one Mi is diff. from the others

Answer 12

conditions: 1. random: SRS or random allocation 2. pop. normally distributed or large sample size = no outliers in plots of data or sample sizes > 30 3. st. dev. of pop. approx. = - --so check that (largest s) / (smallest s) < 3 test stat called "F" or "ANOVA F" - -calc. F called analysis of variance (ANOVA) - -basic idea of D: compare variation among xbars to variation expected due to randomness - -formula for F and associated p-value = use one-way ANOVA software IF - -p-value > alpha done!! (can't reject hypotheses that pop. means are =) - -p-value < alpha - only know at least one campion of means is diff. from 0 - look at the CI or draw box plots to know which one is off - -HINT: F is always in box on top right and you never have to solve for it - -you will know you have to reject Ho but to see which is off look at box plots - if they overlap then diff. of means is not statistically significant - if do not overlap the means differ significantly

Answer 13

2 categorical variables in each individual (ex. handedness and birth type [single vs. twins]) --investigate relationship btwn variables using visual displays and numerical summaries 1. two way table of counts - -summarizes C-C relationship 1. the explanatory variable is usually the row variable (gender) and the response variable is the column (opinion on beards) 2. 2-way rectangular table of combined categories 3. count individuals in each combined category 4. sum across rows and over columns to get marginal totals 5. roles of row and column variables can be switched marginal total for females - -numerical summary tool: conditional distributions for rows and columns - -visual display tool: grouped bar chains, stacked bar chains, others)

Answer 14

- -divide cell counts by row total to get conditional distributions - -evaluate C-C relationship by comparing - -if conditional distributions are diff. there is a potential relationship or association for visual display: grouped/stacked bar charts

Answer 15

- -summarize in 2-way table - -calculate conditional distribution of response variable for each value of explanatory variable - -if continual distributions are diff, there is potential connection btw categorical variables

Answer 16

1. investigate sampling distribution of phat1 - phat2 for SRS from 2 populations of interest or randomized controlled experiment with 2 treatments 2. use sampling distribution to develop a CI for p1 - p2 3. use sampling distribution to develop a test of significance for p1 - p2 diff. btw proportion of doctors taking aspirin who had heart attacks and proportion of doctors receiving placebo who had heart attacks p1 - p2 = .009 - .017 = -.008

Answer 17

1. take SRS of size N1 from pop. 1 - observe categorical variable 2. take separate SRS of size n2 from pop. 2 and observe categorical variable 3. compute phat1 - phat 2 ``` center = mean is p1 - p2 spread = SD is radical (p1*(1 - p1))/n1. +. (p2*(1-p2))/n2 ``` shape - approx. normal if n1 and n2 are large --check by n1p1 >5, n1(1-p1) > 5, n2p2 > 5, n2(1-p2) >5 for "approx." sampling distribution of phat1 - phat2 ``` center = same (p1 - p2) SD = same but use phat instead of p under the radical ``` shape = normal if n1phat1 > 5 and all others (same but use phat instead of p)

Answer 18

CI estimate +/- margin of error = phat1 - phat2 +/- z*radical (phat1*(1 - phat1))/n1. +. (phat2*(1-phat2))/n2 phat1 - phat2 = estimate z* = table value SD = standard error

Answer 19

Ho: p1 = p2, or Ho: p1 - p2 = 0 test statistic = (estimate - hypothesized value of p1 - p2) / SD expected under Ho z = (phat1 - phat 2 - 0) / radical (p1*(1-p1))/n1. +. (p2*(1-p2))/n2 problem?? we don't know p1 and p2 ==use phat1 pooled sample proportion to estimate p1 and p2 as we assume Ho: p1 = p2 to be true standard error for phat1 - phat2 - -is the whole SD formula under the radical when finding CI (used lots of times in cards) - -or use radical (phat*1-phat)*(1/n1 + 1/n2)) when calc. a test statistic assuming the null hypothesis is true

Answer 20

multi sample inference for proportions: chi-squaredfor tables of counts C-C 1 sample inference --intervals and tests for a proportion (p) 2 sample inference --intervals and tests for a diff. btwn 2 proportions (p1 - p2) multi-sample inference --intervals and tests for comparisons of 3 or more proportions

Answer 21

- -1 from each population, categorical variable or experiment using unblocked units - -randomly assigned to several treatments, categorical response variable or - -1 SRS, 2 categorical variables for each individual

Answer 22

Ho: there is NO association btw the 2 categorical variables (they are independent) Ha: there is an association btw the 2 categorical variables (they are not independent) conditions: 1. randomness: 1 SRS with 2 variables or multiple SRSs with 1 variable or randomized experiment with multiple treatments 2. large sample size = all > 5

Answer 23

``` o = observed e = expected (row total * column total) / grand total ``` expected refers to values that would be expected of the null hypothesis were true (NO association) chi-squared method 1. calculate expected counts assuming Ho is true 2. calculate a test statistic to measure the difference btw what we observe and what we expect if Ho were true test statistic = x^2 = sum of all cells (O - E)^2) / E use a chi-square table w (r-1) and (c-1) degrees of freedom to get a p-value --how likely is it to get such a big discrepancy btw observed and expected?

Answer 24

1. STATE: Is there an association btw type of religion and religious knowledge? 2. PLAN: use a chi squared test with Ho: there is no association Ha: there is an association alpha = .05 3. SOLVE: check conditions - -random? 4 pop. and 1 categorical variable (religion and answer to JS question) - -large? all expected counts > 5 x^2 test = sum ((O - E)^2) / E --df = (4-1) * (2-1) = 3 ``` x^2 = 40 and df = 3 --pvalue = .0005 ``` 4. CONCLUDE: reject Ho - evidence of association btwn religion an religious knowledge

Answer 25

it supports Ho

Answer 26

sampling variability

Exam 3 Flashcards

(50 cards)