final exam review Flashcards
(96 cards)
How is entropy used in clustering?
entropy (IG) ranks attributes by how much they contribute information. 0 = no info, 1 = max info
Describe SLC algo
- each obj starts as own cluster
- merge 2 closest clusters
- repeat n-k times
what is an important consideration for SLC?
how you define distance what is “close” for clusters/data points. This can be considered your domain knowledge
Describe K-means algo
- pick k centers at random.
- for each center
a. claim the closest points
b. recompute the centroid (dimensional mean) - repeat until convergence
define a consistent tie-breaking metric.
what rand opt also is K-means similar to?
hill climbing - take steps to move to a better configuration
What is a pitfall of K-means? How can you avoid it?
can get stuck in local optima. avoid by doing random restarts, or defining your space to be points of the convex hull.
K-means time complexity
each iteration O(Kn)
total number of iterations = O(K^n)
K-means properties
monotonically non-increasing in error.
squared error metric for the distance from centers/means
EM algo overview
soft clustering technique that assigns based on probability distributions.
- select k gaussians distributions
- find prob that each point (x_i) would be in each gaussian distribtuion
- take avg of all x_is in each distribution (cluster)
- recompute cluster based on wgt’d avg of the x_i’s
- repeat until the x’s don’t move that much
EM Properties
- does not diverge
- doesn’t fully converge, but practically does
- can get stuck in local optima, need restarts
- monotonically non-decreasing likelihood.
- If E, M solvable, works with any distribution.
Can EM or K-means ever act in the same way?
if probabilities were argmax to 0 or 1, EM would function same as k-means
What are some properties of clustering?
richness
scale invariance
consistency
what is cluster richness
all possible partitions are possible. for any cluster that you want, there is some distribution where P_distribution can produce that cluster
scale invariance
mult. list by a positive scalar does not change clustering (ie changing units)
cluster consistentcy
making similar things more similar and different things more different will not change the groupings.
what is the impossibility thm
you can never have all 3 three of consistency, richness, and scale invariance for a cluster algorithm
what is game theory?
the mathematics of conflict. games can have single or multiple agents.
what is a strategy
mapping of all possible states to an action s.
each player has their own strategy
can only map to states attainable by a player
what is the difference between a pure and a mixed strategy?
pure represent straight moves by a player, while mixed strategies are based on a probability distribution over the players’ strategies.
In a pure strategy a player chooses an action for sure, whereas in a mixed strategy, they choose a probability distribution over the set of actions available to them.
what is minimax
for a game btw A and B, A considers the best case counter, while B considers the worst case counter.
this is all about pov.
A picks the strategy that maxes it’s value while knowing your opponent is trying to minimize your value.
also known as covering your ass.
what is the fundamental results for a 2 player, zero sum, deterministic game of perfect info?
- minimax = maximin
- there always exists a pure strategy for each player.
- this is saying the value found in the matrix that describes the game is when A, B both try to maximize their strategies acting rationally.
what is the effect of hidden information?
- mini poker
- minimax does not equal maximin b/c one player’s strategy depends on what the opponent is going to do.
what is a Nash Equilbrium
for n players with strategies S1,..,Sn.
These n players with strategies are in a Nash eq. iff for all states the player chooses the strategy that maximizes their utility.
This is saying that a no player in the game has a reason to change its strategy.
a unique NE exists if the elimination of strictly dominated strategies eliminates all but one combo.
any NE will survive the elimination of strictly dominated states
T/F : for a finite # of players and finite # of strategies, there exists a NE
True