exam Flashcards

(26 cards)

1
Q

What is an association?

A

Two variables have an association if change in values of one variable coincide with a pattern of change in the other variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a positive association?

A

As a person’s height increases, so does their weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a negative association?

A

As water temperature increases, dissolved oxygen concentration decreases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What data is needed for studying associations?

A

One sample of randomly chosen individuals from the population of interest, measuring two variables of interest (X and Y).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does each row in the data for studying associations represent?

A

Each row (case) contains the measurements for X and Y made on a single individual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is correlation analysis?

A

A descriptive method to determine if an association is positive or negative and if it is strong or weak.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is regression analysis?

A

A predictive method to determine direction and strength of association and to predict values of Y given values of X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does correlation indicate?

A

Correlation indicates a statistical pattern of co-variation among two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between correlation and causation?

A

Correlation does not imply causation; correlation is an observation while causation is an inference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the criteria for inferring cause-effect from associations?

A
  • Association documented at multiple sites by multiple studies
  • Association present when effects of other lurking variables eliminated
  • Plausible cause-effect mechanism that explains co-variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a lurking variable?

A

A variable that is not included in the analysis but affects the relationship between the studied variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an example of a plausible cause-effect mechanism?

A

Contraception increases time between births, allowing infants to nurse longer and delay exposure to waterborne diseases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is statistical correlation sufficient to conclude a cause-effect relationship?

A

No, statistical correlation alone is not sufficient to conclude a cause-effect relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the purpose of regression analysis?

A

To predict values of Y based on data for X and to define associations between variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the uses of regression analysis?

A
  • To predict future values of Y based on historical data
  • To inform policies or regulations based on statistical associations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the assumptions of linear regression?

A
  • Linearity
  • Statistical independence
  • Homoscedasticity
  • Normality
17
Q

What does homoscedasticity refer to?

A

Constant variance of residuals across predictors.

18
Q

What happens if the assumptions of linear regression are violated?

A

Predictions, confidence intervals, and insights might be inefficient, biased, or misleading.

19
Q

What is the least-squares regression line?

A

A line fit through the middle of data points in a scatterplot that minimizes the sum of squared vertical differences between observed and predicted values.

20
Q

What is the purpose of the least-squares regression line?

A

To predict Y-values based on X-values.

21
Q

How is the slope of the least-squares regression line calculated?

A

Using the formula: ( b_1 = rac{sum (x_i - ar{x})(y_i - ar{y})}{sum (x_i - ar{x})^2} )

22
Q

What is the Y-intercept in the least-squares regression line?

A

Calculated using the formula: ( b = ar{y} - b_1 ar{x} )

23
Q

What is the Durbin Watson test used for?

A

To test for autocorrelation in residuals.

24
Q

What is the formula for linear regression in R?

A

lm(response ~ predictor, dataset)

25
What is checked for normality of residuals?
Graphical methods like histograms and normal probability plots, and statistical tests like Shapiro-Wilk.
26
What is a 95% confidence interval in regression?
A range that estimates the uncertainty around the predicted values.