Central Limit Theorem and Hypothesis Testing Flashcards

(24 cards)

1
Q

t-distribution

A

a probability distribution that is used when estimating population parameters when the sample size is small, and the population standard deviation is unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

the t-distribution resembles a normal distribution but has heavier tails. What does this mean?

A

it’s more prone to producing values far from its mean (allows for more variability and accounts for the uncertainty due to smaller sample sizes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

the shape of the t-distribution depends on the _____, which is dependent on ____

A

degrees of freedom, sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

when do you use a t-distribution over a z-distribution?

A
  • when sample size is less than 30
  • for hypothesis testing to compare means and compute confidence intervals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the three types of t-tests and what do they do?

A
  1. one-sample: determines if the sample mean is significantly different from a known value
  2. two-sample: compares the means of two independent groups
  3. paired: compares the means of two related groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how do you calculate a t-score (formula)

A

t = (Xbar - μ)/(s/√n)

Xbar is sample mean
μ is population mean
s is the sample SD
n is sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is the chi-squared distribution?

A

when you sum the squares of
𝑘 independent standard normal random variables (variables with a mean of 0 and a standard deviation of 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the chi-square statistic?

A

a value used in chi-squared tests to measure how much observed data deviate from what we would expect under a particular hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

formula for chi-squared statistic

A

χ^2 =∑ ((Oi - Ei)^2)/Ei)

Oi = observed freq for category i
Ei = expected freq for category i
the sum of ∑ is taken over all categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are common applications of chi-squared distribution?

A
  • goodness of fit test
  • test of independence
  • variance estimation
  • ANOVA and regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

chi-squared goodness of fit test

A

determines whether an observed frequency distribution matches an expected frequency distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the F distribution?

A

used to compare two variances by assessing the ratio of these variances

used often in ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

confidence interval

A

provides a range of values, derived from the sample data, that is likely to contain the true population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

confidence level

A

indicates the degree of certainty that the interval contains the parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

point estimation

A

involves calculating a single value (a point estimate) from sample data to estimate an unknown population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

margin of error

A

usually calculated as the the critical value (from the z or t distribution) times the standard error of the point estimate

17
Q

Central Limit Theorem

A

states that the mean of a sufficiently large number of iterates of random values will be approximately normally distributed

18
Q

irrespective of the shape of the underlying distribution of the population, by increasing the sample size, sample means and proportions will ….

A

approximate normal distributions if the sample sizes are sufficiently large

19
Q

the law of large numbers

A

as the number of repetitions of a probability experiment increases, the proportion with which a certain outcome is observed gets closer to the probability of the outcome

20
Q

standard error and formula

A

measures the variability or dispersion of the sample statistic from the population parameter

sample deviation of the sample means

SE = (s/√n)

21
Q

type I error

A

rejecting null hypothesis when it’s true

22
Q

type II error

A

not rejecting the null hypothesis when it is false

23
Q

how do you reduce probability of making a type I error?

24
Q

reducing probability of type I error usually