Shapland Flashcards
3 advantages of Bootstrap Model
- Generates a distribution of possible outcomes as opposed to a single point estimate.
Provides more info of potential results; can be used for capital modeling - Can be modified to the statistical features of the data under analysis
- Can reflect the fact that insurance loss distributions are generally skewed right. This is because the sampling process does not require a distribution assumption.
Model reflects the level of skewness in the underlying data.
Briefly describe the ODP Model
Incremental claims q(w,d) are modelled directly using a GLM.
Link function: Log
Distribution: ODP
Steps:
1. Use the model to estimate parameters
2. Use bootstrapping (sampling residuals with replacement) to estimate total distribution
Calculate E(q(w,d)) and V(q(w,d)) using ODP GLM Model
E(q(w,d)) = m_w,d
ln(m_w,d) = n_w,d
n_w,d = a_w + sum(b_d)
V(q(w,d)) = phi*m^z_w,d
z = 0 if Normal error distribution
z = 1 if Poisson error distribution
z = 2 if Gamma error distribution
Explain the GLM Model Setup for a 3x3 triangle
See image
Briefly explain for to solve for weight matrix in GLM model
Solve for a and b parameters of the Y = X*A matrix equation that minimizes the squared difference between vector of the log of actual incremental losses (Y) and the log of expected incremental losses (Yhat).
Use Maximum Likelihood or the Newton-Raphson method.
Calculate fitted incremental using GLM Model
ln(E(IncLoss_AY,d)) = ln(m_w,d) = n_w,d) = a_w + sum(b_d)
E(IncLoss_AY,d) = m_w,d = exp(n_w,d)
Explain what is the simplified GLM Method and its steps
Fitted (expected) incremental using a Poisson error distribution are the same as incremental losses using volume-weighted average LDFs.
Steps:
1. Use cumulative claim triangle to calculate LDFs
2. Develop losses to ultimate
3. Calculate expected cumulative triangle
4. Calculate expected incremental triangle from cumulative triangle
3 advantages of the simplified GLM frameword
- GLM can be replaced with simpler link ratio approach while still being grounded in the underlying GLM framework.
- Using age-to-age ratios serves as a “bridge” to the deterministic framework and allows the model to be more easily explained to others.
- We can still use link ratios to get a solution if there are negative incremental, whereas the GLM with a log link might not have a solution.
Calculate Unscaled Pearson residual
r_w,d = q(w,d)-m_w,d / sqrt(m^z_w,d)
Unscaled Pearson residual = (Actual IncLoss - Expected IncLoss)/sqrt(Expected IncLoss^z)
Calculate Scale Parameter
phi = sum(unscaled residuals ^2)/(N-p)
N = # incremental values in triangle
p = #AYs + #LDFs + #hetero groups - 1
State the assumption about residuals necessary for bootstrapped samples
Residuals are independent and identically distributed (iid).
Note: no particular distribution is necessary. Whatever distribution the residuals have will slow into simulated data.
Calculate sampled incremental loss for bootstrap model
q(w,d) = rsqrt(m^z_w,d) + m_w,d
SimIncLoss(AY,d) = SimResidual * sqrt(E(IncLoss)^z) + E(IncLoss)
If m_w,d negative, take absolute value
Calculate Standardized Pearson Residuals
rH_w,d = r_w,d * fH_w,d
fH_w,d = sqrt(1 / (1-H_w,d)) = hat matrix adjustment factors
Explain the steps to create a distribution of point estimates
- Create a sample triangle of incremental losses using sample standardized Pearson residuals, r*, and expected incremental from model m_w,d.
- Calculate cumulative triangle and LDFs for simulated triangle.
- Calculate point estimate of unpaid losses for sampled data.
- Run steps 1-3 for many samples to get a distribution of point estimates.
Note: these steps ignore process variance. We can add process variance to future incremental values using a Gamma distribution.
Explain how to add process variance to future incremental values in bootstrap model
qsim(w,d) follows Gamma (mean = m_w,d, var = phi*m_w,d)
m_w,d is the expected future incremental for this iteration, calculated from sampled bootstrap triangle.
Calculate Standardized Pearson Scale Parameter
phiH = sum(rH_w,d ^2) / N
In the bootstrap model, use unscaled Pearson scale parameter.
Standardized Pearson scale parameter could be used to approximate scale parameter.
Explain the Bootstrapping BF Model
With ODP bootstrap model, iterations for the latest few Ays can result in more variance than expected.
Incorporate BF model by using a priori loss ratios for each AY with standard deviations for each loss ratio and an assumed distribution.
During simulation, for each iteration simulate a new a priori loss ratio.
Explain the Bootstrapping Cape Cod Model
With ODP bootstrap model, iterations for the latest few Ays can result in more variance than expected.
Apply the Cape Cod algorithm to each iteration of the bootstrap model.
3 pros of using fewer parameters in generalizing ODP model
- Helps avoid potentially over-parameterizing the model
- Allows ability to add parameters for calendar-year trends
- Can be used to model data shapes other than data in triangle form (e.g. missing incremental in first few diagonals)
2 cons of using fewer parameters in generalizing ODP model
- GLM must be solved for each iteration of the bootstrap model, slowing simulations.
- Model is no longer directly explainable to others using age-to-age factors
Explain how to correct for negative incremental values using the modified log-link.
When sum of incremental losses in development column in positive, modify the log-link triangle calculations:
ln(q(w,d)) = -ln(abs(q(w,d))) if q(w,d) negative
Explain how to correct for negative incremental values using the negative development periods.
When the sum of the incremental losses in development column is negative.
- Shift incremental losses up by the size of the largest negative increment value (psi)
q+(w,d) = q(w,d) - psi
where psi is negative - Calculate log-link triangle, run the GLM and calculate fitted incremental values m+_w,d
- Shift fitted incremental losses back down by psi
m_w,d = m+_w,d + psi
Explain how to correct for negative incremental values using simplified GLM adjustments
When fitted incrementally (m_w,d) are negative, make the following adjustments to the formulas for residuals and simulated incremental values:
r_w,d = q(w,d) - m_w,d / sqrt(abs(m^z_w,d))
q(w,d) = r * sqrt(abs(m^z_w,d)) + m_w,d
Just switch abs(m_w,d) for m_w,d in the square roots
Explain how to correct for negative values during simulation using process variance
When fitted incremental (m_w,d) is negative, there are 2 options to simulate incremental from Gamma distribution using absolute values:
1. Change sign of simulated value
-Gamma(mean = abs(m_w,d), var = phi*abs(m_w,d))
BUT this results in a skewed left distribution
- Shift the entire distribution to have a mean of m_w,d:
Gamma(mean = abs(m_w,d), var = phiabs(m_w,d)) + 2m_w,d