week 6 Flashcards by tyrion lannister

“What additional specification is needed to implement a fully hierarchical model beyond the prior π(θ|φ)?”

A hyperprior distribution π(φ) for the hyperparameter(s) φ.

How well did you know this?

Not at all

Perfectly

“In a fully hierarchical model (Y|θ ~ fθ, θ|φ ~ π(θ|φ), φ ~ π(φ)), how is the joint posterior π(θ, φ | y) expressed proportionally?”

π(θ, φ | y) ∝ L(y|θ) * π(θ|φ) * π(φ), where L(y|θ) is the likelihood Π fθ(yi).

How well did you know this?

Not at all

Perfectly

“What is the main challenge mentioned regarding the posterior distribution of hierarchical models?”

In many cases, it cannot be evaluated analytically.

How well did you know this?

Not at all

Perfectly

“What sampling approach is suggested as suitable for handling hierarchical models?”

Gibbs sampling.

How well did you know this?

Not at all

Perfectly

“In the school data example, what does yij represent?”

The score of the j-th student in the i-th school.

How well did you know this?

Not at all

Perfectly

“In the school data example, what does ni represent?”

The number of pupils who took the test in school i.

How well did you know this?

Not at all

Perfectly

“Specify the hierarchical model structure used for the school data example.”

yij|μi, σ² ~ N(μi, σ²); μi|μ0, σ0² ~ N(μ0, σ0²); σ⁻² ~ Ga(a, b); μ0 ~ N(μ00, σ00²); σ0⁻² ~ Ga(a0, b0).

How well did you know this?

Not at all

Perfectly

“How is the joint posterior distribution π(μ1,…,μ38, σ⁻², μ0, σ0⁻² | y) expressed proportionally in the school data example?”

∝ [Π(i=1 to 38) Π(j=1 to ni) f(yij|μi, σ²)] * [Π(i=1 to 38) π(μi|μ0, σ0²)] * π(σ⁻²) * π(μ0) * π(σ0⁻²).

How well did you know this?

Not at all

Perfectly

“What is the full conditional distribution for the school-specific mean μi in the example?”

N( (Σ(j=1 to ni) yij / σ² + μ0 / σ0²) / (ni / σ² + 1 / σ0²), 1 / (ni / σ² + 1 / σ0²) ).

How well did you know this?

Not at all

Perfectly

“What is the full conditional distribution for the inverse variance σ⁻² in the school data example?”

Ga( a + (1/2)Σ(i=1 to 38) ni, b + (1/2)Σ(i=1 to 38) Σ(j=1 to ni) (yij - μi)² ).

How well did you know this?

Not at all

Perfectly

“What is the full conditional distribution for the overall mean hyperparameter μ0 in the school data example?”

N( (Σ(i=1 to 38) μi / σ0² + μ00 / σ00²) / (38 / σ0² + 1 / σ00²), 1 / (38 / σ0² + 1 / σ00²) ).

How well did you know this?

Not at all

Perfectly

“What is the full conditional distribution for the inverse hypervariance σ0⁻² in the school data example?”

Ga( a0 + 19, b0 + (1/2)Σ(i=1 to 38) (μi - μ0)² ). (Note: 19 = 38/2)

How well did you know this?

Not at all

Perfectly

“What specific hyperparameter values were used in the school data analysis (a, b, μ00, σ00², a0, b0)?”

a = 0.001, b = 0.001, μ00 = 0, σ00² = 100², a0 = 0.001, b0 = 0.001.

How well did you know this?

Not at all

Perfectly

“What is the posterior median estimate for μ0 in the school data example?”

0.11.

How well did you know this?

Not at all

Perfectly

“What is the 95% credible interval for μ0 in the school data example?”

(-0.14, 0.11).

How well did you know this?

Not at all

Perfectly

“What is the posterior median estimate for σ² in the school data example?”

Study These Flashcards

0.91.

“What is the 95% credible interval for σ² in the school data example?”

Study These Flashcards

(0.87, 0.99).

“What is the posterior median estimate for σ0² in the school data example?”

Study These Flashcards

0.10.

“What is the 95% credible interval for σ0² in the school data example?”

Study These Flashcards

(0.05, 0.17).

“What complication can arise in implementing a Gibbs sampler, especially in hierarchical models?”

Study These Flashcards

Some of the full conditional distributions might not be known distributions (i.e., cannot be sampled from directly).

“What approach is typically used within a Gibbs sampler to handle unknown full conditional distributions?”

Study These Flashcards

A Metropolis-Hastings step is used to sample from that specific unknown conditional distribution.

“What is the name given to the approach where Metropolis-Hastings steps are used for some updates within a Gibbs sampler?”

Study These Flashcards

Metropolis-within-Gibbs.

“In the surgical outcomes example, what does ri represent?”

Study These Flashcards

The number of deaths in hospital i.

“What model is used for the number of deaths ri given ni operations and mortality rate θi?”

Study These Flashcards

ri | ni, θi ~ Binomial(ni, θi).

"What model is used for the hospital-specific mortality rate θi given hyperparameters μ and φ?"

θi | μ, φ ~ Beta(μφ, (1-μ)φ).

"What prior distribution is used for the hyperparameter μ in the surgical outcomes example?"

μ ~ Beta(1, 1) (which is Uniform(0,1)).

"What prior distribution is used for the hyperparameter φ in the surgical outcomes example?"

φ ~ Gamma(1, 0.01).

"What is the full conditional distribution for θi | μ, φ, θ-i, y in the surgical outcomes example?"

Beta(μφ + ri, (1-μ)φ + ni - ri).

"Is the full conditional distribution for μ | θ1,...,θ12, φ a standard distribution?"

No, it is proportional to [Π θi^(μφ) * (1-θi)^((1-μ)φ)] / [Γ(μφ)Γ((1-μ)φ)]^12 (and the prior for μ).

"Is the full conditional distribution for φ | θ1,...,θ12, μ a standard distribution?"

No, it is proportional to [Γ(φ) / (Γ(μφ)Γ((1-μ)φ))]^12 * [Π θi^(μφ) * (1-θi)^((1-μ)φ)] * φ^(a-1)exp(-bφ) (using general Gamma prior notation, slide used a=1, b=0.01).

"In the surgical outcomes Gibbs sampler, how are the θi's updated?"

Directly sampled from their Beta full conditional distribution.

"In the surgical outcomes Gibbs sampler, how are μ and φ updated?"

Using Metropolis-Hastings random walk steps, as their full conditionals are not standard distributions.

"In the power plant pumps example, what distribution does the number of failures Yj follow given failure rate θj and time tj?"

Yj | θj ~ Poisson(θj * tj).

"In the power plant pumps example, what prior distribution is assumed for the failure rate θj given hyperparameters α and β?"

θj | α, β ~ Gamma(α, β).

"What hyperpriors were assumed for α and β in the power plant pumps example by George et al. (1993)?"

α ~ Exp(1) and β ~ Gamma(a1, b1) with a1=0.1, b1=1.

"What is the full conditional distribution for θj in the power plant pumps example?"

Gamma(yj + α, tj + β).

"Is the full conditional distribution for α in the power plant pumps example a standard distribution?"

No, it is proportional to [β^α / Γ(α)]^10 * (Π θj^(α-1)) * exp(-α).

"What is the full conditional distribution for β in the power plant pumps example?"

Gamma(a1 + 10α, b1 + Σ(j=1 to 10) θj).

"How is the posterior sampling performed in the power plant pumps example, given the non-standard conditional for α?"

Using Metropolis-within-Gibbs: Update θj's and β directly from their Gamma conditionals, update α using a Metropolis-Hastings step.

week 6 Flashcards

(39 cards)