final Flashcards

(56 cards)

1
Q

What is a number that summarizes sample data values and is often used to estimate a characteristic of a population?

A

Statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a number that describes some characteristic of an entire population?

A

Parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Values from repeated measurements of the same sample are evenly distributed around the true population value, but are highly variable. What does this indicate?

A

Low precision, unbiased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Select ALL statistics below that are considered ‘robust statistics’.

A
  • Median
  • IQR
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does it mean when two events are mutually exclusive?

A

Both events cannot occur simultaneously

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

According to the law of total probability, what must the sum of all probabilities equal?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is theoretical probability?

A

The frequency that some event will happen that is based on all known outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is empirical probability?

A

The frequency that some event will happen in the long run, based on observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between simple random sampling and stratified random sampling?

A

Simple: Randomly selecting people without a strategy; Stratified: Population divided into subgroups and random samples taken from each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do we test the null hypothesis?

A

Because the exact expected values of a test statistic are known under the assumption there is no difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two reasons why the sample statistic may deviate from the null hypothesis?

A
  • Difference due only to random variation
  • If there really is a difference
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A p-value is defined as the probability of obtaining what?

A

A test statistic as extreme (or more extreme) if the null hypothesis were true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Under the null hypothesis statistical testing framework, what is the probability of declaring there is an effect when there is no effect known as?

A

Type one Error rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the null hypothesis value for the population parameter?

A

The expected parameter value under an assumption that there is no difference or effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the effect of increasing sample size on a p-value, all other things being equal?

A

Decreases the p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which of the following would decrease the p-value of a t-test, assuming all other factors were held the same?

A

Increase sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What would increase the width of a confidence interval?

A

Increasing the confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

If you know the population standard deviation and population mean, what test would you use?

A

Z

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If your data are positively skewed, what transformation would you use to attempt to ‘normalize’ it?

A

Logarithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

If your data are negatively skewed, what transformation would you use to attempt to ‘normalize’ it?

A

Square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Statistical power is defined as what?

A

The probability that a study would detect an effect, if the effect is of a specified size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which of the following would increase statistical power, assuming all other factors were held the same?

A

Increase sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

To determine if resistance training can prevent or reverse loss of bone density in post-menopausal women, what test should be used?

A

2-sample t-test

24
Q

What test should be used if the control group is negatively skewed with six outliers?

A

Mann-Whitney U-Test

25
According to the owner's manual for an analytical balance, what type of test is used for repeated measurements of a calibration weight?
One sample t-test
26
What test should be used to determine if adding nitrogen fertilizer increases the abundance?
Paired t-test
27
If you increase the number of tests for a single null hypothesis, what is the effect on the experiment wise type one error?
Experiment wise type-1 error will decrease
28
How can you control (or account for) the experiment wise type-1 error rate for multiple tests?
* Bonferroni method * ANOVA
29
What does the Bonferroni method do?
Reduces individual comparison-wise type-one error to control for an overall experiment-wise type-1 error
30
Select ALL statements below that are assumptions of ANOVA.
* Each group is a random sample from population of interest * The error variances within each group are equal * The variable is approximately normally distributed within each group * Sample sizes across groups must be equal
31
ANOVA stands for what?
Analysis of Variance
32
What does 'among-group variance' include?
* Random variation * The treatment effect
33
A random block design is used when what occurs?
There is increased variability due to the group the samples are from
34
A factorial design is used when what is present?
There is an interaction between two variables
35
What would be appropriate if you collect data from three lakes and N=10, data not distributed (Shapiro Wilkes)?
Levene’s Test
36
If previous tests determined variances were equal, what test should be used?
ANOVA
37
If you collect 10 water samples from two lakes and compare 5 variables, what adjustment should be used?
Bonferroni Adjustment
38
What type of ANOVA should be used when testing the effect of three diets on peristaltic blood pressure with variability across individuals?
Random Block ANOVA
39
If you select 60 people to test a diet pill, each group gets 20 people, and the data is not normally distributed with unequal variances, what test should be used?
Welch’s ANOVA
40
To determine if a food supplement affects activity in mice, but one mouse escaped and the data is not normally distributed with unequal variance, what test should be used?
Kruskal Wallis
41
If you are interested in the effect of acetaminophen and there might be an interaction between two variables, with data normally distributed and variances equal, what test should be used?
Factorial ANOVA
42
Two variables have an association when what occurs?
Values of one variable coincide with a pattern of change in the other variable
43
What is an observation in statistical analysis?
Finding a statistical pattern of co-variation among two variables
44
What is an inference in statistical analysis?
Concluding one variable causes an effect on the other
45
Identify the lurking variable.
A
46
Assumptions for linear regression include what?
* Linearity * Statistical independence * Homoscedasticity * Normality
47
What is the null hypothesis of correlation analysis?
P=0
48
How do you assess the linearity assumption of linear regression?
Scatterplot
49
Applying Bayesian thinking to decision making involves incorporating prior information represented as what?
P(D|H1,X)
50
What does the law of inverse probability (Bayes' Theorem) calculate?
P(H|D)
51
If tree farms represent a major long term investment, what method do foresters often use to predict the productivity of a piece of land?
Pearson’s Correlation
52
What does a forester want to do after identifying the soil/site variable most strongly associated with forest plantation productivity?
Use this association in Regression
53
An environmental health scientist wants to determine the association between exposure to ozone air pollution and lung capacity. What method should be used?
Spearman’s Correlation
54
What type of statistical method is used when bacteria from lake sediments are cultured in the presence of different concentrations of methyl mercury?
Regression
55
Describe the difference between a 95% confidence interval and a 95% credible interval.
* Confidence: Range of values where the true population mean lies with 95% certainty * Credible: Represents probability about the true value
56
What is the difference between Bayesian and Frequentist approaches?
* Bayesian: Uses prior knowledge, parameters are random * Frequentist: Parameters are fixed, probability is a frequency of occurrence