Chapter 5 - Trees Flashcards

Question 1

Q

What is spatial compositional data?

Answer

A

consist of compositional data observed across geographic locations
both the relative structure of the composition and its spatial information are crucial
example: proportion of tree species types across plots in a forest

Question 2

Q

What motivated your choice of the spatial tree species data?

Answer

A

real-world application, representing species diversity across the geographic region
compositional structure (proportions of multiple tree types per location)

Question 3

Q

How did you incorporate spatial structure into your model?

Answer

A

spatial penalised regression splines were used to model the latent spatial variation across locations
splines allow the model to borrow strength across neighbouring locations while remaining flexible
spline basis functions were incorporated as latent variables in the linear predictor of the GDM model, allowing the mean composition to vary smoothly over space

Question 4

Q

What are spatial penalised regression splines and why did you use them?

Answer

A

spatial penalised regression splines incorporate a penalty term in the spline to prevent overfitting
smoothness of the spatial surface is controlled by a penalty term (lambda)
high lambda - more smooth curve
low lambda - more flexible, wiggly curve
provides a computationally efficient and flexible way to model the spatial structure

Question 5

Q

How did you handle missing values in the spatial data?

Answer

A

missing values in the spatial compositions were handled directly as the GDM can handle and predict missing values
sampled during the MCMC process

Question 6

Q

How did you evaluate predictive accuracy across spatial locations?

Answer

A

posterior predictive model checking was used to assess how well the model produced missing tree counts

Question 7

Q

What were the benefits of using GDM for spatial data?

Answer

A

GDM can handle overdispersed counts
accommodates zero counts naturally
combined with spatial splines, it provided flexible way of understanding complex spatial relationships

Question 8

Q

How are spatial penalised regression splines incorporated in your spatial framework?

Answer

A

included as random effects in the linear predictor of the GDM model
coefficients were inferred jointly with other parameters in a Bayesian framework
splines captured smooth spatial variation

Question 9

Q

What were the key evaluation metrics used to assess model performance?

Answer

A

MAE
RMSE
Bayesian Coverage and mean width of uncertainty intervals
Xi - out-of-sample R^2 quantifying prediction error variance relative to the baseline

Question 10

Q

How did you ensure fair comparisons between your models?

Answer

A

using the same tree species and randomly sampled spatial locations
using the same basis function / model specification

Question 11

Q

How did your GDM models compare with GAMs in predictive accuracy?

Answer

A

GDM outperformed in producing counts that were more similar to the original counts
occurred across the different levels of missing components, overall for each tree species and overall - for both MAE, RMSE and xi

Question 12

Q

In what scenarios did your methods outperform traditional methods the most?

Answer

A

GDM outperforms could be explained by the GDM’s ability to predict missing values with compositional constraints.
Specifically, within the GDM, if the model observes high counts of one tree species, it knows to predict low counts for the remaining species.
In contrast, the GAM lacks this knowledge, leading to over-prediction of very high counts.

Therefore, the GDM is an effective tool for modelling spatial compositional data

Question 13

Q

What ecological interpretations can be made from your tree species spatial model?

Answer

A

model revealed distinct areas in the grid that were dominated by particular species
could help identify hotspots of species diversity and areas where specific species are likely to be under or over represented

Question 14

Q

How generalisable are your methods to other types of compositional data?

Answer

A

Spatial compositional data:
* Environmental data (e.g. farming crops in different field)
* Epidemiological data (e.g. disease prevalence)

The framework is broadly useful for compositional data (counts or proportions) with spatial information.

Question 15

Q

What are the main limitations of your approaches?

Answer

A

computational cost - running the model
spatial - decision on what spatial model to use (e.g. spatial penalised regression spline, GMRF, CAR)

Question 16

Q

How would your model handle spatio-temporal data?

Answer

Study These Flashcards

A

The current spatial GDM model could be extended to spatio-temporal settings by
* adding temporal spline terms
* dynamic latent processes, including time-varying covariates in the linear predictor

This would allow the model to capture temporal evolution of spatial composition patterns such as species migration or seasonal effects

Question 17

Q

Describe your model’s latent structure?

Answer

Study These Flashcards

A

spline-based spatial random effects - capturing smooth variation of the compositions

Chapter 5 - Trees Flashcards

(17 cards)