Final Flashcards
(36 cards)
Probability
The measure of the likelihood that an event will occur
Sample Space
The set of all possible outcomes of an experiment
Event
A subset of the sample space
Compliment
Denoted as A’ or A^c; all outcomes not in A.
Addition Rule
P(A∪B)=P(A)+P(B)−P(A∩B)
Multiplication Rule (for Independent Events)
P(A∩B)=P(A)⋅P(B)
Conditional Probability
P(A|B)= P(A∩B) / P(B)
Probability Mass Function (PMF)
Gives the probability of each possible value in a discrete random variable.
Probability Density Function (PDF)
Gives the probability density of a continuous random variable.
Expected Value (μ)
μ=∑(i=1, n) xi * P(X=xi)
Mean (X^-)
(X^-) = [∑(i=1, n) xi] / n
Variance (Var(x))
Var(x) = [∑(i=1, n) (xi-[X^-])^2] / n
Standard Deviation (σ)
σ = SQRT (Var(x))
Summation Notation
∑(i=1, n) xi
“The sum of xi from i=1 to n.”
Factorial Notation
n! = the product of all positive integers up to n
EX: 3! = 321 = 6
Central Limit Theorem (CLT)
States that the distribution of the sum (or average) of a large number of independent, identically distributed random variables (typically n > 30) approaches a normal distribution, regardless of the original distribution
Conditions for CLT
The random variables must be independent.
The sample size should be sufficiently large.
The original distribution’s shape doesn’t matter.
CLT Formula for Sample Means
If X is a random variable with mean μ and standard deviation
σ, then the distribution of the sample mean (X^-) approaches a normal distribution with a mean, μ, and standard deviation, σ/SQRT(n)
Maximum Likelihood Estimation (MLE)
A method for estimating the parameters of a statistical model that maximizes the likelihood function
Likelihood Function
L(θ|X)=P(X|L) where:
L is the likelihood function
θ is the parameter
X is the data.
Log-Likelihood Function
ℓ(θ)=ln(L(θ|X)); often used for easier calculations
Set dℓ/dθ =0 and solve for the parameter θ.
Confidence Interval (CI)
A range of values constructed from sample data so that the population parameter is likely to occur within that range at a certain level of confidence
Confidence Level
The probability that the interval contains the true parameter
Common choices are 90%, 95%, and 99%
CI Formula for a Mean
(X^ˉ) ± Z⋅σ/SQRT(n) where:
Z is the Z-score corresponding to the desired confidence level.