Fundamentals of Probability Flashcards
(37 cards)
Three steps to generate a statistical model
(1) What is the data-generating process (DGP)?
(2) Build an appropriate probability model that reflects the assumed DGP including
assumptions of how Y is distributed (i.e., stochastic component)
(3) Come-up with a parameterization of the stuff that gets estimated (i.e., systematic component) and theory of inference to derive statistical model
Data-generating process
This is the joint probability distribution that is supposed to characterize the entire population from which the data set has been drawn.
Stochastic Component
The assumption about the way Y is distributed, in the case of linear regression it is an assumption about the normal distribution
yi~N(yi|μi, σ^2)
Systematic Component
Parameterization of the stuff that gets estimated
μi=B0+B1Xi+B2X2+….
Population regression function
yi=alpha+betax1+ui(error term), i=1,…,n
Sample regression function
yi_hat=(alpha_hat)+(beta_hat)xi, i=1,…,n
Experiment
Repeatable procedure for making an observation
Outcome
possible result of repeatable procedure for making observation
The sample space (Ω) of an experiment
Set of all possible outcomes
An event
Subset of the sample space, i.e., any set of outcomes
The probability of an event
it’s long-run relative frequency.
A ∪ B
Give the operation name, definition and interpretation
Union
elements either in A or B or in both occur
either A or B or both
A ∩ B
Give the operation name, definition and interpretation
Intersection
elements both in A and B
both A and B occur
A_hat
Give the operation name, definition and interpretation
Complement
elements not in A
A does not occur
A ⊆ B
- If B contains A
: “when A occurs, so does B (but not
necessarily vice versa)”
The intuitive definition of probability
assigning real numbers to every element of the
sample space in a way that the sum of all such numbers is 1.
- A random variable
function that assigns a number to each outcome of the
sample space of an experiment.
Probability distributions
for all possible
outcomes, it tells us the probabilities for these outcomes to occur.
Which types of distributions exist?
Discrete, Continuous
What is the discrete distribution?
e.g., Bernoulli, Binomial, Poisson
What are the Continuous distributions?
e.g., Uniform, Normal, Logistic, t-distribution
Probability density function: definition and formal way to write it
Distribution of probabilities for all values of a random variable X
The probabilities p(x) or P(X = x) for all values of a random variable X form the probability density function (PDF).
Probability density function (PDF): What is the probability that we get
* exactly xi
(for discrete distributions)?
* a ≤ xi ≤ b (for continuous distributions)?
Cumulative distribution function (CDF):
The probability of observing a value less or equal than x
cumulative distribution function (CDF): What is the probability that we get some
value equal to or smaller than xi?
Expected Value: definition and formula
Specifies the center of the probability distribution
X discrete:
E(X) = ∑(all x) * xp(x)
X continuous:
E(X) = ∫( +∞ −∞) * xp(x)dx