6 - games and adversarial search Flashcards
(7 cards)
Games vs. single-agent search
We don’t know how the opponent will act
Solution: strategy or policy + efficnecy
Minimax value of a node
the utility (for MAX) of being in the corresponding state, assuming perfect play on both sides
indicates how good it would be for a player to reach that position
Minimax strategy
a kind of backtracking algorithm that is used in decision making and game theory to find the optimal move for a player, assuming that your opponent also plays optimally
Alpha-Beta Pruning
Alpha-beta pruning enhances the Min-Max algorithm by eliminating branches that do not affect the final decision. The key formulas for alpha-beta pruning are:
Alpha (α): The best value that the maximizing player can guarantee so far.
Beta (β): The best value that the minimizing player can guarantee so far.
ALPHA-BETA cutoff
ALPHA-BETA cutoff is a method for reducing the number of nodes explored in the Minimax strategy. For the nodes it explores it computes, in addition to the score, an alpha value and a beta value.
Expectiminimax
for chance nodes, average values weighted by the probability of each outcome
Monte Carlo simulation
when you get to a chance node, simulate a large number of games with random dice rolls and use win percentage as evaluation function