Statistics/ML Flashcards
Bonferroni
Alpha/m
Bagging
Take n bootstrap samples, fit model to each, take average
Random forest
Basically bagging with decision trees, but for each split, choose from random p/3, for example, of the covariates
Support vector machines
Maximize M subject to all but some number of points being farther away than M from decision boundary, with sum of distances for those that aren’t <= C
Newton Raphson
x_1 = x_0 - f(x_0)/f’(x_0)
Gradient descent
x_1 = x_0 - gamma * gradient of F
Logit
P(y=1) = e^(X beta)/(e^(X beta)+1)
log(p/(1-p)) = X beta
K nearest neighbors
Use plurality vote for classification, or mean for regression
Std error
Std deviation of a statistic’s sampling distribution , or an estimate of it, eg
(1/sqrt(n))sqrt(sum(xi - mean)^2)
Normal density
(1/sigma sqrt(2 pi)) e^( - (1/2)((x- mu)^2/sigma^2))
T test
Sample should be normal, but ok for large samples I believe.
Tau-hat/(se(tau-hat))
Eg difference divided by
Sigma hat * Sqrt (1/n1 + 1/n2)
Covariance of beta hat for regression
Sigma^2. (X’X)^(-1)
Estimate sigma with
1/(n-p). *. Sum of (y - X beta)^2
Law of large numbers
Lim as n -> inf
P(|mean(Y1,..,Yn) - mu|>= ep)
=0
Central Limit Theorem
Limit as n -> infinity
P(. (1/(sigma/sqrt(n)). *
(Mean(Y1,…,Yn) - mu) <= z). =
Phi(z)
Ie
(1/(sigma/sqrt(n)) times (Ybar - mu)
converges to a unit normal
Note that Ybar has std dev
sigma * sqrt(n)
SUTVA
Stable Unit Treatment Value Assumption
Response of one unit only depends on their treatment not on treatment of others
Eg if some people assigned to travel on public transportation, some in cars, then wouldn’t hold, because it affects the traffic