Formulas Flashcards
(44 cards)
What is the formula for mean squared error?
What is the sigmoid function?
What is the function for the hyperbolic tangent?
What are the two equations used to update the weights using Momentum?
What is the equation for the running average of the gradients used in Adam?
What is the equation for the squared gradients used in Adam?
How is each parameter updated when using Adam?
What is the Bayes Rule?
What is the formula for the entropy of a discrete probability distribution?
What is the formula for KL-divergence for two probability distributions?
What is the formula for the entropy of a continuous probability distribution?
What is the formula for the KL-divergence of a continuous probability distribution?
What is the entropy of a Gaussian Distribution?
What is the entropy of a d-dimensional Gaussian distribution?
What is the KL-divergence between two d-dimensional multivariate Gaussian Distributions?
What is the Wasserstein difference for two multivariate Gaussian Distributions?
What is the cross entropy error for a binary classification task?
What is the Gaussian Distribution equation?
What is the multivariate Gaussian Distribution equation?
For softmax, what is Prob(i)?
For softmax what is log Prob(i)
What is the equation for the gradient using softmax?
What is the weight decay equation?
What is the formula for the mean?