Unconstrained and Constrained Optimization Flashcards
How do you find a critical point of a single variable function using calculus?
Take derivative and set it equal to zero. Solve for critical point(s).
How do you determine Max/Min of a single variate function using calculus?
Take second derivative, if its “+” then its a minima (inflection concave, bowl), if “-“ then maxima (inflection convex, hill)
How do you find the critical points for multi-variate function with calculus?
Take gradient of function w.r.t. all variables. Set = 0, solve for critical point(s).
How do you determine if multi-variate function is max/min?
Take Hessian H(x), (determinant of second gradient).
- Positive Definite Hessian - positive value of determinant, all eigenvalues ‘+’, point is minima.
- Positive Semi-Definite Hessian - Det(H(x)) >= 0, one eigenvalue = 0, point could be infinite line or unbound from below/above.
- Negative Definite - negative determinant, all eigenvalues ‘-‘, point is maxima.
- Indefinite - both positive and negative eigenvalues, saddle point.
Define a Positive Definite Matrix, what does PD hessian mean?
PD matrix means all eigenvalues are positive. PD hessian means determinant is positive, point is a minima.
Define a Positive Semi-Definite Matrix, what does PSD hessian mean?
A PSD matrix means at least one, or multiple eigenvalues are = 0, the rest are positive. A PSD hessian means your minimizer could be a hyperplane of infinite minima or its unbounded from below/above
Define a Negative Definite Matrix, what does ND hessian mean?
a ND matrix means all eigenvalues are negative. A ND hessian means the point is a maximum.
Define a non-singular indefinite matrix
Both positive and negative eigenvalues
Define a non-singular/invertible matrix
matrix that can be inverted, AA^(-1) = I
Define a singular matrix
matrix that does not have an inverse
Find eigenvalues and eigenvectors of 2x2 matrix:
1) (A-lambdaI)
2) det(A-lambdaI) = 0
3) (A-lambda_iI)V_i = 0 *for each lambda is V
What is optimization?
The process of making something fully perfect, functional, or effective as possible. *Mathematically finding the min/max within a neighborhood or constraints.
What is the goal of unconstrained optimization?
To minimize some function f(x), or maximize -f(x).
What are challenges associated with unconstrained optimization.
- The function f(x) may be unknown form, or have little knowledge of the shape
- f(x) may be expensive to evaluate, may need to reduce number of function calls/queries of objective function
What is Newton’s Root Finding Method?
Root finding methods are used the determine the ‘x’ values at which a function ‘g’, crosses zero using successive approximations to these roots:
x_n+1 = x_n - f(x_n)/f’(x_n)
What is the first step to Newton’s root finding method?
Need an initial point.
How do we determine an initial pt for Newton’s root finding?
Make a plot, or guess based on intuition.
What is a multimodal problem?
a multimodal problem is a problem with multiple local minima/optimum over a range of x
*unimodal - monotonically increases for some range x<=m, and decreases for some range x>=m (single optima)
When do calculus-based method fail to find a minima
- Higher order functions
- Transcendental functions
- Constrained optimization
- Functions that are “black box” functions of unknown form
When does a derivative of a function not exist?
- When the function is not continuous: cusp, hold, any discontinuity
- If an asymptote or vertical/horizontal line
What is the Lagrangian Function?
A modification to an objective function accounting for “m” inequality constraints and l equality constraints
What are Lagrange multipliers?
They tell you the sensitivity of the solution to perturbation of the constraint.
- zero = inactive / passive
- large = highly sensitive
- small = insensitive to changes in constraint
What is an active constraint?
A constraint for which the constrained optimization solution lies on the boundary, preventing the solution from reaching the unconstrained global minima.
What are 3 characteristics of “Black Box” functions?
1) Functions of unknown form
2) Functions may be engineering tools that are expensive to run: inadequate # samples, can’t plot, too many vbls to visualize
3) Functions often strongly non-linear and have discontinuities