Behaviour module Flashcards

Question

Sampling distribution & central limit theorem

Answer 1

* Sampling distribution – the distribution of parameter estimates we would get if we repeated our study many times The width of this distribution depend on the model/parameter, how noisy our data is, and sample size * Central Limit Theorem – if our sample size is large, the sampling distribution of parameter estimates is normally distributed

Answer 2

* Estimating standard errors from one sample: In general linear models, we can estimate the standard deviation of the sampling distribution for each parameter directly from our one dataset (standard errors). * Like with parameter estimation, standard errors y need to be computed computationally in non-linear likelihood profile or BOOTSTRAPPING

Answer 3

* For i in range(n_bootsraps): * Resample data with replacement: individual observations can be sampled more than once * Sample the row indices of your df with replacement * Number of sample rows should equal the # of rows in the df * Fit model, estimate parameters * Store fitted parameters * Calculate standard error/CI based on the distribution of parameter values * 0.025 and 0.975 quantiles of the bootstrapped parameters for 95% CI * Standard deviation of bootstrapped parameters for standard errors

Answer 4

* In hierarchical data, observations are not independent. * Systematic variability across individuals or groups can lead to violations of the assumption of uncorrelated errors. * Treating each data point as independent can result in overly narrow confidence intervals for parameter estimates, misrepresenting uncertainty. Hierarchical bootstrapping: A method to estimate model uncertainty for hierarchical data while preserving its structure. * Example: Measuring recall as a function of time for 10 subjects, each tested at 20 different delay periods. * Steps: 1.Resample Groups: Randomly sample 10 subjects with replacement. 2.Resample Observations: For each sampled subject, randomly sample 20 observations with replacement. 3.Fit the Model: Fit the model on this resampled dataset. 4.Repeat: Perform this process many times to generate a bootstrapped sampling distribution.

Answer 5

* Two methods of model comparison: * LOOCV * Akaike information criterion (AIC)

Answer 6

* only when models have the same number of parameters Likelihood ratios as a simple way for comparing models Fit two alternative models to data Calculate the log-likelihood of each model * Likelihood ratio: is the ratio of the likelihoods of each model: L(m1)/L(m2) Or exp(logL(m1) - logL(m2) )

Answer 7

* Adding complexity to models will almost always improve the ability to fit a data set * But in general, the goal isn’t fitting the data we have as well as possible, we want our models to generalize to new data * One way of assessing this is to fit model to part of data, and test the predictions against another part of the data (cross-validation)

Answer 8

Leave-one-out cross validation (LLOCV) 1. Randomly select one observation as the test data, and the rest will be used to for the model (training data) 2. Use model to predict y in the test data, and calculate the difference between observed and predicted values * Do this for every data point

Answer 9

* In general performance is worse on out of sample data (data that was not used to fit the model), because our models fit noise * Models that are too complex are worse as generalizing to new data because the are more flexible to fit noise in our sample

Answer 10

* The bias error is an error from erroneous assumptions in the model. For example, important variables are left out of the model that leads to underfitting. * The variance is an error from sensitivity to small fluctuations in the training set. High variance may result from a model fitting the random noise in the training data (overfitting).

Answer 11

* AIC is a way of comparing and ranking models of different complexity * Depends on the log-likelihood + penalty for number of parameters * AIC score decreases with log-likelihood (how well model fits data) * AIC increases with the number of parameters * Goal is to find models that minimize AIC scor * AIC, based on information theory, aimed at scoring models by their ability to generalize to new data * Models with lower AIC scores are generally considered better because they strike a balance between goodness-of-fit and model complexity.

Answer 12

* AIC is a way of comparing and ranking models of different complexity * Depends on the log-likelihood + penalty for number of parameters * AIC score decreases with log-likelihood (how well model fits data) * AIC increases with the number of parameters * Goal is to find models that minimize AIC scor * AIC, based on information theory, aimed at scoring models by their ability to generalize to new data * Models with lower AIC scores are generally considered better because they strike a balance between goodness-of-fit and model complexity.

Behaviour module Flashcards

(36 cards)