week 2 formatted Flashcards

Question 1

Q

State Bayes’ Theorem for random variables Y and X in terms of their conditional and marginal densities.

Answer

A

fY|X(y|x) = [fX|Y(x|y) * fY(y)] / fX(x), where fX(x) = ∫ fX|Y(x|y) * fY(y) dy.

Question 2

Q

In Bayesian analysis, how is the parameter θ treated, and what represents the initial beliefs about it?

Answer

A

θ is treated as a random variable with a prior density π₀(θ) encapsulating beliefs about θ before observing data.

Question 3

Q

Write down the formula for the posterior distribution π(θ|x) using Bayes’ Theorem, given data x = (x₁, ..., xn).

Answer

A

π(θ|x) = [Πi=1n fX|θ(xi|θ) * π₀(θ)] / f(x) = [L(θ, x) * π₀(θ)] / ∫ L(θ, x) * π₀(θ) dθ

Question 4

Q

What is the likelihood function, L(θ, x), in the context of Bayesian inference?

Answer

A

L(θ, x) = Πi=1n fX|θ(xi|θ), representing the probability (or density) of observing the data x given a specific value of the parameter θ.

Question 5

Q

What is the term for the denominator in the Bayes’ Theorem formula for π(θ|x), and what does it represent?

Answer

A

The denominator, f(x) = ∫ L(θ, x) π₀(θ) dθ, is called the marginal likelihood or evidence. It represents the marginal probability (density) of observing the data x, integrated over all possible values of θ.

Question 6

Q

What is the proportionality relationship used for calculating the posterior distribution, ignoring the normalizing constant?

Answer

A

π(θ|x) ∝ L(θ, x) * π₀(θ)

Question 7

Q

Describe how Bayesian updating works sequentially when a new datum x₂ arrives after observing x₁.

Answer

A

The posterior after x₁, π(θ|x₁), becomes the prior for processing x₂. The new posterior is π(θ|x₁, x₂) ∝ fX|θ(x₂|θ) * π(θ|x₁).

Question 8

Q

If T = T(X) is a sufficient statistic for θ, how does this simplify the calculation of the posterior distribution π(θ|x)?

Answer

A

The posterior distribution depends on the data x only through the value of the sufficient statistic T(x). That is, π(θ|x) ∝ g(T(x), θ) * π₀(θ), where L(θ, x) = g(T(x), θ)h(x) by the Factorization Theorem.

Question 9

Q

What are the two main computational/analytical challenges mentioned in Bayesian inference related to the posterior and marginal likelihood?

Answer

A

Evaluating the marginal likelihood integral f(x) = ∫ L(θ, x) π₀(θ) dθ. \n2. Determining the distributional form of the posterior π(θ|x).

Question 10

Q

What is a conjugate prior family P for a class of likelihood distributions F = {fX|θ(x|θ)}?

Answer

A

P is conjugate for F if, for any prior π₀(θ) ∈ P and any likelihood fX|θ(x|θ) ∈ F, the resulting posterior distribution π(θ|x) is also in the family P.

Question 11

Q

What is the main advantage of using a conjugate prior?

Answer

A

It leads to an analytically tractable posterior calculation, meaning the form of the posterior distribution is known and often easy to compute.

Question 12

Q

Write the general form of a k-parameter exponential family pdf/pmf, fX|θ(x|θ).

Answer

A

fX|θ(x|θ) = h(x) * c(θ) * exp[ Σj=1k tj(x) * wj(θ) ]

Question 13

Q

What are the components h(x), c(θ), t_j(x), and w_j(θ) in the exponential family definition?

Answer

A

h(x) is a function of x only; c(θ) is a function of θ only (related to the normalizing constant); tj(x) are the sufficient statistics; wj(θ) are functions of the parameters (often called natural parameters).

Question 14

Q

When is an exponential family called ‘regular’?

Answer

A

The family is regular if the support of the distribution, denoted by the set X, does not depend on the parameter θ.

Question 15

Q

What is the form of the conjugate prior π₀(θ) for a parameter θ of a regular k-parameter exponential family likelihood?

Answer

A

π₀(θ) = d(α, β) * [c(θ)]α * exp[ Σj=1k βj * wj(θ) ], where α and β = (β₁, ..., βk) are hyperparameters and d(α, β) is the prior normalizing constant.

Question 16

Q

Given a sample x = (x₁, ..., xn) from a regular exponential family and a conjugate prior as defined above, what is the form of the posterior distribution π(θ|x)?

Answer

Study These Flashcards

A

The posterior is proportional to [c(θ)](α+n) * exp[ Σj=1k (βj + Σi=1n tj(xi)) * wj(θ) ]. It has the same form as the prior but with updated hyperparameters.

Question 17

Q

How are the hyperparameters (α, β) updated to get the posterior hyperparameters (α*, β*) for the conjugate prior of a regular exponential family after observing data x = (x₁, ..., xn)?

Answer

Study These Flashcards

A

α* = α + n; βj* = βj + Σi=1n tj(xi) for j = 1, ..., k.

week 2 formatted Flashcards

(17 cards)