week 6 Flashcards
(39 cards)
“What additional specification is needed to implement a fully hierarchical model beyond the prior π(θ|φ)?”
A hyperprior distribution π(φ) for the hyperparameter(s) φ.
“In a fully hierarchical model (Y|θ ~ fθ, θ|φ ~ π(θ|φ), φ ~ π(φ)), how is the joint posterior π(θ, φ | y) expressed proportionally?”
π(θ, φ | y) ∝ L(y|θ) * π(θ|φ) * π(φ), where L(y|θ) is the likelihood Π fθ(yi).
“What is the main challenge mentioned regarding the posterior distribution of hierarchical models?”
In many cases, it cannot be evaluated analytically.
“What sampling approach is suggested as suitable for handling hierarchical models?”
Gibbs sampling.
“In the school data example, what does yij represent?”
The score of the j-th student in the i-th school.
“In the school data example, what does ni represent?”
The number of pupils who took the test in school i.
“Specify the hierarchical model structure used for the school data example.”
yij|μi, σ² ~ N(μi, σ²); μi|μ0, σ0² ~ N(μ0, σ0²); σ⁻² ~ Ga(a, b); μ0 ~ N(μ00, σ00²); σ0⁻² ~ Ga(a0, b0).
“How is the joint posterior distribution π(μ1,…,μ38, σ⁻², μ0, σ0⁻² | y) expressed proportionally in the school data example?”
∝ [Π(i=1 to 38) Π(j=1 to ni) f(yij|μi, σ²)] * [Π(i=1 to 38) π(μi|μ0, σ0²)] * π(σ⁻²) * π(μ0) * π(σ0⁻²).
“What is the full conditional distribution for the school-specific mean μi in the example?”
N( (Σ(j=1 to ni) yij / σ² + μ0 / σ0²) / (ni / σ² + 1 / σ0²), 1 / (ni / σ² + 1 / σ0²) ).
“What is the full conditional distribution for the inverse variance σ⁻² in the school data example?”
Ga( a + (1/2)Σ(i=1 to 38) ni, b + (1/2)Σ(i=1 to 38) Σ(j=1 to ni) (yij - μi)² ).
“What is the full conditional distribution for the overall mean hyperparameter μ0 in the school data example?”
N( (Σ(i=1 to 38) μi / σ0² + μ00 / σ00²) / (38 / σ0² + 1 / σ00²), 1 / (38 / σ0² + 1 / σ00²) ).
“What is the full conditional distribution for the inverse hypervariance σ0⁻² in the school data example?”
Ga( a0 + 19, b0 + (1/2)Σ(i=1 to 38) (μi - μ0)² ). (Note: 19 = 38/2)
“What specific hyperparameter values were used in the school data analysis (a, b, μ00, σ00², a0, b0)?”
a = 0.001, b = 0.001, μ00 = 0, σ00² = 100², a0 = 0.001, b0 = 0.001.
“What is the posterior median estimate for μ0 in the school data example?”
0.11.
“What is the 95% credible interval for μ0 in the school data example?”
(-0.14, 0.11).
“What is the posterior median estimate for σ² in the school data example?”
0.91.
“What is the 95% credible interval for σ² in the school data example?”
(0.87, 0.99).
“What is the posterior median estimate for σ0² in the school data example?”
0.10.
“What is the 95% credible interval for σ0² in the school data example?”
(0.05, 0.17).
“What complication can arise in implementing a Gibbs sampler, especially in hierarchical models?”
Some of the full conditional distributions might not be known distributions (i.e., cannot be sampled from directly).
“What approach is typically used within a Gibbs sampler to handle unknown full conditional distributions?”
A Metropolis-Hastings step is used to sample from that specific unknown conditional distribution.
“What is the name given to the approach where Metropolis-Hastings steps are used for some updates within a Gibbs sampler?”
Metropolis-within-Gibbs.
“In the surgical outcomes example, what does ri represent?”
The number of deaths in hospital i.
“What model is used for the number of deaths ri given ni operations and mortality rate θi?”
ri | ni, θi ~ Binomial(ni, θi).