final Flashcards

Question

What is data permutation

Answer 1

Like shuffling data such that one column is held and the other is randomly arranged, to use for hypothesis testing

Answer 2

There is not enough evidence to reject the null hypothesis then

Answer 3

can help to describe residual deviance, which is how much the model differs from a perfect model that fits the data perfectly

Answer 4

-2log(maximized log-likelihood for model| L1 / Saturated model where n=p so 1 obs per B (including B0) | LS) = -2(L1-LS)

Answer 5

an analogy to r^2 which is the avergae expected value for success - the average expected vlaue for failure (p1-po)*bar hat

Answer 6

The higher it is the better it will be at producing observed data The probability of the model to observe the given data given the models parameters

Answer 7

To prevent overfitting, where the data is fit too well to the observations so any predictions outside of the range would be invalid

Answer 8

penalizes extra variables to balance under vs over fitting, and is used to compare models to one another with different quantities of k

Answer 9

AIC = -2LL+2K, where as the AIC score decreases the model performance increases LL = K = number of parameters where = p+1 (predictors +intercept)

Answer 10

AIC min is the best performing model and AIC max is the worst performing model deltaK = AIC(of model k) - AICmin (best)

Answer 11

deltaK <= 2 comparable to best fit model deltaK>10 no support that the model fits 2

Answer 12

The probabilities for our models under AIC which we can use to rank parameters where wk = [exp(-deltak/2)]/[sum of model exp(-delta(n)/2)]

Answer 13

RMSE which is the Root means squared erros is to see how close our model can get to near perfect predictions RMSE = sum(sqrt(SSE/n)) where n is the number of observations from new/testing dataset

Answer 14

a way of estimating for parameters that make the obsserved data most likely under an asusmed probability dsitribution so we can assume normality such that ML=OLS

Answer 15

observed data (k) over number of trials (total = n) for a given p_hat with the eqn ("n/k")p^k(1-p)^(n-k) with MLE being at p_hat = k/n focuses on single param p

Answer 16

N=finite population size K = number of success in pop = captures+mark k = observed data = recaptured+mark n = number of draws without replacement = recaptures (("K/k")["(N-k)/(n-k)"])/("N/n") draws successes without replacement

Answer 17

change in log-odds for 1 unit increase in x1

Answer 18

Prob. distribution is well described by link function logit mean=n*p var=n*p*q where q is (1-p) log/poisson mean=var=lambda Observations are independent

Answer 19

true mean for the response when all predictors are = 0 for linear For logistic regression B0 represents the log-odds of the outcome occuring when all independent varibales are 0

Answer 20

Uppercase is for the population whereas lower case is for samples

Answer 21

pi =e^B/(1+e^B)

Answer 22

if the histogram of slopes doesnt follow a normal distribution around zero and does around another value we have an alternate hypothesis with the respective correlation

Answer 23

the higher k values capture the data more accurately which in turn make it harder to have good predictions so RMSE increases The lower the RMSE the better fit the model is

Answer 24

For MLE it maximizes the likelihood function whereas for OLS it minimizes the residuals

Answer 25

1) Model Rwn by ID'ing the response and preductor w/ b0 to fit the form Y=B1x+B0 2) Take n, number of observation and use Yi = B1x+Bo+error 3) Run a regression with b0=B0 and b1=B1 and so on to find the SE's and coeffs 4) Evaluate to see if we can apply a linear model, and if B's != procced to next step 5) Calculate the Score = (B1-(null=0))/SE(B1) and then calc for the coresponding df 6) Tscore table and compare CI's to the standard 1.95(SE) for true pop slope -------> If Tscore by hand is greater than Tscore table then the slope is statistically significant 7) calc slope interval by bi+_ t*df * SE(bi)

Answer 26

that a linear model is a well reperesentative model for the data

Answer 27

If beta is positive such that e^B > 1 than it increases the odds if e^1 =1 then no change in odds and below is decreases the odds

Answer 28

by how many units are models are off from the actual values on average

Answer 29

increase in value per 1additional unit increase in log-odds per 1 additional unit increase log-count per 1 additional unit

Answer 30

The MLE is the value that, if true, would give you the highest probability of having observed the data you actually did. This makes it a natural choice for estimating the parameters of a model.

Answer 31

Bootstrap the OLS and arrange the slopes into a histogram to verfiy if null or alternate hypothesis is present based on the centering of slopes Can further test for more accurate results by permutating the data and then find confidence intervals if required

Answer 32

As the sample number/size increases the CI range narrows because the SE decreases by SE=sigma/sqrt(n)

Answer 33

observation- a single data point in dataset variable- characteristic attribute of the observation

final Flashcards

(57 cards)