final Flashcards by Haley Rain

What is a number that summarizes sample data values and is often used to estimate a characteristic of a population?

Statistic

How well did you know this?

Not at all

Perfectly

What is a number that describes some characteristic of an entire population?

Parameter

How well did you know this?

Not at all

Perfectly

Values from repeated measurements of the same sample are evenly distributed around the true population value, but are highly variable. What does this indicate?

Low precision, unbiased

How well did you know this?

Not at all

Perfectly

Select ALL statistics below that are considered ‘robust statistics’.

Median
IQR

How well did you know this?

Not at all

Perfectly

What does it mean when two events are mutually exclusive?

Both events cannot occur simultaneously

How well did you know this?

Not at all

Perfectly

According to the law of total probability, what must the sum of all probabilities equal?

How well did you know this?

Not at all

Perfectly

What is theoretical probability?

The frequency that some event will happen that is based on all known outcomes

How well did you know this?

Not at all

Perfectly

What is empirical probability?

The frequency that some event will happen in the long run, based on observations

How well did you know this?

Not at all

Perfectly

What is the difference between simple random sampling and stratified random sampling?

Simple: Randomly selecting people without a strategy; Stratified: Population divided into subgroups and random samples taken from each

How well did you know this?

Not at all

Perfectly

Why do we test the null hypothesis?

Because the exact expected values of a test statistic are known under the assumption there is no difference

How well did you know this?

Not at all

Perfectly

What are the two reasons why the sample statistic may deviate from the null hypothesis?

Difference due only to random variation
If there really is a difference

How well did you know this?

Not at all

Perfectly

A p-value is defined as the probability of obtaining what?

A test statistic as extreme (or more extreme) if the null hypothesis were true

How well did you know this?

Not at all

Perfectly

Under the null hypothesis statistical testing framework, what is the probability of declaring there is an effect when there is no effect known as?

Type one Error rate

How well did you know this?

Not at all

Perfectly

What is the null hypothesis value for the population parameter?

The expected parameter value under an assumption that there is no difference or effect

How well did you know this?

Not at all

Perfectly

What is the effect of increasing sample size on a p-value, all other things being equal?

Decreases the p-value

How well did you know this?

Not at all

Perfectly

Which of the following would decrease the p-value of a t-test, assuming all other factors were held the same?

Increase sample size

How well did you know this?

Not at all

Perfectly

What would increase the width of a confidence interval?

Increasing the confidence level

How well did you know this?

Not at all

Perfectly

If you know the population standard deviation and population mean, what test would you use?

How well did you know this?

Not at all

Perfectly

If your data are positively skewed, what transformation would you use to attempt to ‘normalize’ it?

Logarithm

How well did you know this?

Not at all

Perfectly

If your data are negatively skewed, what transformation would you use to attempt to ‘normalize’ it?

Square

How well did you know this?

Not at all

Perfectly

Statistical power is defined as what?

The probability that a study would detect an effect, if the effect is of a specified size

How well did you know this?

Not at all

Perfectly

Which of the following would increase statistical power, assuming all other factors were held the same?

Increase sample size

How well did you know this?

Not at all

Perfectly

To determine if resistance training can prevent or reverse loss of bone density in post-menopausal women, what test should be used?

Study These Flashcards

2-sample t-test

What test should be used if the control group is negatively skewed with six outliers?

Study These Flashcards

Mann-Whitney U-Test

According to the owner's manual for an analytical balance, what type of test is used for repeated measurements of a calibration weight?

One sample t-test

What test should be used to determine if adding nitrogen fertilizer increases the abundance?

Paired t-test

If you increase the number of tests for a single null hypothesis, what is the effect on the experiment wise type one error?

Experiment wise type-1 error will decrease

How can you control (or account for) the experiment wise type-1 error rate for multiple tests?

* Bonferroni method * ANOVA

What does the Bonferroni method do?

Reduces individual comparison-wise type-one error to control for an overall experiment-wise type-1 error

Select ALL statements below that are assumptions of ANOVA.

* Each group is a random sample from population of interest * The error variances within each group are equal * The variable is approximately normally distributed within each group * Sample sizes across groups must be equal

ANOVA stands for what?

Analysis of Variance

What does 'among-group variance' include?

* Random variation * The treatment effect

A random block design is used when what occurs?

There is increased variability due to the group the samples are from

A factorial design is used when what is present?

There is an interaction between two variables

What would be appropriate if you collect data from three lakes and N=10, data not distributed (Shapiro Wilkes)?

Levene’s Test

If previous tests determined variances were equal, what test should be used?

ANOVA

If you collect 10 water samples from two lakes and compare 5 variables, what adjustment should be used?

Bonferroni Adjustment

What type of ANOVA should be used when testing the effect of three diets on peristaltic blood pressure with variability across individuals?

Random Block ANOVA

If you select 60 people to test a diet pill, each group gets 20 people, and the data is not normally distributed with unequal variances, what test should be used?

Welch’s ANOVA

To determine if a food supplement affects activity in mice, but one mouse escaped and the data is not normally distributed with unequal variance, what test should be used?

Kruskal Wallis

If you are interested in the effect of acetaminophen and there might be an interaction between two variables, with data normally distributed and variances equal, what test should be used?

Factorial ANOVA

Two variables have an association when what occurs?

Values of one variable coincide with a pattern of change in the other variable

What is an observation in statistical analysis?

Finding a statistical pattern of co-variation among two variables

What is an inference in statistical analysis?

Concluding one variable causes an effect on the other

Identify the lurking variable.

Assumptions for linear regression include what?

* Linearity * Statistical independence * Homoscedasticity * Normality

What is the null hypothesis of correlation analysis?

P=0

How do you assess the linearity assumption of linear regression?

Scatterplot

Applying Bayesian thinking to decision making involves incorporating prior information represented as what?

P(D|H1,X)

What does the law of inverse probability (Bayes' Theorem) calculate?

P(H|D)

If tree farms represent a major long term investment, what method do foresters often use to predict the productivity of a piece of land?

Pearson’s Correlation

What does a forester want to do after identifying the soil/site variable most strongly associated with forest plantation productivity?

Use this association in Regression

An environmental health scientist wants to determine the association between exposure to ozone air pollution and lung capacity. What method should be used?

Spearman’s Correlation

What type of statistical method is used when bacteria from lake sediments are cultured in the presence of different concentrations of methyl mercury?

Regression

Describe the difference between a 95% confidence interval and a 95% credible interval.

* Confidence: Range of values where the true population mean lies with 95% certainty * Credible: Represents probability about the true value

What is the difference between Bayesian and Frequentist approaches?

* Bayesian: Uses prior knowledge, parameters are random * Frequentist: Parameters are fixed, probability is a frequency of occurrence

final Flashcards

(56 cards)