Stats Part 3 Flashcards by James Clyne

What is the chi-square test used for?

To assess whether observed categorical frequencies differ from expected frequencies.

How well did you know this?

Not at all

Perfectly

What are the assumptions of a chi-square test?

Expected frequencies > 5, independent observations, categorical data.

How well did you know this?

Not at all

Perfectly

What is the chi-square statistic?

The sum of squared differences between observed and expected frequencies, divided by expected frequencies.

How well did you know this?

Not at all

Perfectly

When would you use a chi-square test of independence?

To test if two categorical variables are associated.

How well did you know this?

Not at all

Perfectly

What is ANOVA?

Analysis of variance, used to compare means across three or more groups.

How well did you know this?

Not at all

Perfectly

What is the null hypothesis in ANOVA?

That all group means are equal.

How well did you know this?

Not at all

Perfectly

What does a significant F-statistic in ANOVA suggest?

At least one group mean differs from the others.

How well did you know this?

Not at all

Perfectly

What are assumptions of ANOVA?

Normality, homogeneity of variances, independent observations.

How well did you know this?

Not at all

Perfectly

What is the multiple comparisons problem?

Testing many hypotheses increases the chance of false positives.

How well did you know this?

Not at all

Perfectly

What is the Bonferroni correction?

A method to adjust p-values to reduce the chance of Type I errors in multiple testing.

How well did you know this?

Not at all

Perfectly

What is linear regression?

A model that describes the relationship between a dependent variable and one or more predictors.

How well did you know this?

Not at all

Perfectly

What is the slope coefficient in linear regression?

It represents the expected change in the outcome for a one-unit change in the predictor.

How well did you know this?

Not at all

Perfectly

What does the intercept mean in linear regression?

The expected value of the outcome when all predictors are zero.

How well did you know this?

Not at all

Perfectly

What is R-squared?

The proportion of variance in the outcome explained by the model.

How well did you know this?

Not at all

Perfectly

What are the assumptions of linear regression?

Linearity, independence, homoscedasticity, normality of residuals.

How well did you know this?

Not at all

Perfectly

What is logistic regression?

Study These Flashcards

A model used to predict the probability of a binary outcome.

What is an odds ratio?

Study These Flashcards

The ratio of the odds of an event occurring in one group to another.

Why can’t we use linear regression for binary outcomes?

Study These Flashcards

Because it can predict probabilities outside the [0, 1] range.

What is bootstrapping?

Study These Flashcards

A resampling method that draws repeated samples from the data with replacement.

What is permutation testing?

Study These Flashcards

A method that shuffles data labels to test the null hypothesis without assumptions.

Why use bootstrapping?

Study These Flashcards

To estimate confidence intervals and standard errors when the theoretical distribution is unknown.

What is a Monte Carlo simulation?

Study These Flashcards

A technique using repeated random sampling to model uncertainty in a process.

When is simulation useful?

Study These Flashcards

When analytical solutions are hard or when exploring complex systems.

What is the Shapiro-Wilk test?

Study These Flashcards

A test for normality of a distribution.

What is Levene’s test used for?

To assess equality of variances across groups.

Why check assumptions before hypothesis testing?

Violations may invalidate the test results.

Stats Part 3 Flashcards

(26 cards)