Final Flashcards

(36 cards)

1
Q

Probability

A

The measure of the likelihood that an event will occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sample Space

A

The set of all possible outcomes of an experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Event

A

A subset of the sample space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Compliment

A

Denoted as A’ or A^c; all outcomes not in A.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Addition Rule

A

P(A∪B)=P(A)+P(B)−P(A∩B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multiplication Rule (for Independent Events)

A

P(A∩B)=P(A)⋅P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Conditional Probability

A

P(A|B)= P(A∩B) / P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Probability Mass Function (PMF)

A

Gives the probability of each possible value in a discrete random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Probability Density Function (PDF)

A

Gives the probability density of a continuous random variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Expected Value (μ)

A

μ=∑(i=1, n) xi * P(X=xi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean (X^-)

A

(X^-) = [∑(i=1, n) xi] / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Variance (Var(x))

A

Var(x) = [∑(i=1, n) (xi-[X^-])^2] / n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard Deviation (σ)

A

σ = SQRT (Var(x))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Summation Notation

A

∑(i=1, n) xi
“The sum of xi from i=1 to n.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Factorial Notation

A

n! = the product of all positive integers up to n
EX: 3! = 321 = 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Central Limit Theorem (CLT)

A

States that the distribution of the sum (or average) of a large number of independent, identically distributed random variables (typically n > 30) approaches a normal distribution, regardless of the original distribution

17
Q

Conditions for CLT

A

The random variables must be independent.
The sample size should be sufficiently large.
The original distribution’s shape doesn’t matter.

18
Q

CLT Formula for Sample Means

A

If X is a random variable with mean μ and standard deviation
σ, then the distribution of the sample mean (X^-) approaches a normal distribution with a mean, μ, and standard deviation, σ/SQRT(n)

19
Q

Maximum Likelihood Estimation (MLE)

A

A method for estimating the parameters of a statistical model that maximizes the likelihood function

20
Q

Likelihood Function

A

L(θ|X)=P(X|L) where:
L is the likelihood function
θ is the parameter
X is the data.

21
Q

Log-Likelihood Function

A

ℓ(θ)=ln(L(θ|X)); often used for easier calculations
Set dℓ/dθ =0 and solve for the parameter θ.

22
Q

Confidence Interval (CI)

A

A range of values constructed from sample data so that the population parameter is likely to occur within that range at a certain level of confidence

23
Q

Confidence Level

A

The probability that the interval contains the true parameter
Common choices are 90%, 95%, and 99%

24
Q

CI Formula for a Mean

A

(X^ˉ) ± Z⋅σ/SQRT(n) where:
Z is the Z-score corresponding to the desired confidence level.

25
Hypothesis Testing Steps
State the null hypothesis (H0) and alternative hypotheses (H1). Choose the significance level (α). Calculate the test statistic. Make a decision: reject or fail to reject the null hypothesis.
26
Type I Error
Rejecting a true null hypothesis (false positive)
27
Type II Error
Failing to reject a false null hypothesis (false negative)
28
P-Value for Z TEST
1 Sided: P(Z > z) or P(Z < z) as appropriate. 2 Sided: P(|Z| > |z|) where z is the calculated Z-value.
29
P-Value for T TEST
1 Sided: P(t > t_observed) or P(t < t_observed) as appropriate. 2 Sided: P(|t| > |t_observed|) where t_observed is the calculated t-value.
30
CLT for Sample Proportions
If X is the number of successes in a sample of size, n, from a population with proportion, p, the distribution of (^p) (sample proportion) approaches a normal distribution with mean, p and standard deviation SQRT [(p*(1-p)) / n]. EX: Z = (^p)-p / (SQRT [(p*(1-p)) / n])
31
MLE for Normal Distribution (Known Variance)
If X1, X2,..., Xn are independent and identically distributed (i.i.d.) random variables from a normal distribution with mean, μ, and known variance, σ^2, the MLE for μ is the sample mean, (X̄)
32
CI for Population Proportion
(^p) ± Z * SQRT [(^p * (1 - ^p)) / n
33
Test Statistic for Population Mean (Known Variance) (Z TEST)
For testing a hypothesis about the population mean (μ) with known variance (σ^2), the test statistic is: Z = (X̄ - μ₀) / (σ / √n), where: X̄ is the sample mean, μ₀ is the hypothesized population mean.
34
Z-Score (Z)
The Z-score is determined by the desired level of confidence. You can find this value in a Z-table or use statistical software. EX: If you're constructing a 95% confidence interval, use a Z-score corresponding to the critical value for a 95% confidence interval.
35
Test Statistic for Population Mean (Unknown Variance) (T TEST)
t = (X̄ - μ₀) / (s / √n)
36
Degrees of Freedom (df)
df = n - 1 where: n is the sample size