Slides 18, 19 (˶ᵔ ᵕ ᵔ˶) Flashcards

1
Q

What is the difference between correlation and causation?

A

Correlation indicates a relationship between two variables, while causation implies that one variable directly affects the other.

Understanding the distinction is crucial in statistics and research methodologies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two conditions for omitted variable bias?

A
  1. Z is determined by X
  2. Z is correlated with Y

These conditions must hold for omitted variable bias to affect the relationship between X and Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an omitted variable?

A

A variable that is not included in the analysis but affects the dependent variable.

It can lead to biased estimates if not accounted for.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the types of models mentioned?

A
  • Multiple regression models
  • Nonlinear models
  • Interaction terms
  • Nonparametric models

Each model type serves different purposes in statistical analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Fill in the blank: The term _______ refers to the bias introduced by excluding relevant variables from a model.

A

omitted variable bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of ANOVA in statistics?

A

To compare means across multiple groups.

ANOVA stands for Analysis of Variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or False: Multicollinearity refers to the situation where independent variables in a regression model are highly correlated.

A

True

This can lead to unreliable coefficient estimates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does residual analysis help to identify?

A

Potential problems with a regression model.

It assesses how well the model predictions match the actual data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

List three types of sampling techniques.

A
  • Random sampling
  • Stratified sampling
  • Systematic sampling

Each technique has its strengths and weaknesses in research design.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the role of interaction terms in a regression model?

A

To assess the combined effect of two independent variables on the dependent variable.

This allows for more complex relationships to be modeled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Fill in the blank: A model that includes both linear and nonlinear relationships is referred to as a _______ model.

A

nonlinear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the term ‘residual’ refer to in regression analysis?

A

The difference between observed and predicted values.

Residuals are key for assessing model fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the purpose of using visual analytics in data analysis?

A

To help interpret and communicate data findings effectively.

Visualizations can reveal patterns that might not be apparent in raw data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or False: The presence of outliers does not affect the results of a regression analysis.

A

False

Outliers can significantly skew the results and interpretations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a common method for assessing the fit of a regression model?

A

R-squared value.

It indicates the proportion of variance explained by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does the term ‘scientific paper’ imply in the context of statistical analysis?

A

A document that presents research findings and methodologies.

It often undergoes peer review before publication.