MI Flashcards

Question

What is a search problem in AI?

Answer 1

A search problem defines an initial state, a goal state, and actions to move between states. The task is to find a path to the goal (often with minimal cost).

Answer 2

1. Uninformed (BFS, DFS, UCS). 2. Informed (Greedy, A*). 3. Local (Hill Climbing, Simulated Annealing).

Answer 3

A* uses f(n) = g(n) + h(n), where g(n) is the path cost so far and h(n) is a heuristic estimating the cost to the goal.

Answer 4

Search algorithms are used in pathfinding, AI planning, game AI, and robotics.

Answer 5

Planning is selecting and organizing actions to transform an initial state into a goal state.

Answer 6

It has states, actions, an initial state, a goal state, and a plan (sequence of actions).

Answer 7

STRIPS is a formalism that represents actions with preconditions, add lists, and delete lists.

Answer 8

It ignores negative (delete) effects of actions to simplify planning, often used for heuristic generation.

Answer 9

A MAS involves multiple autonomous agents that can interact cooperatively or competitively.

Answer 10

Minimax is used in adversarial games to minimize potential losses assuming an optimal opposing player.

Answer 11

Alpha-Beta pruning cuts off branches in the Minimax tree that won't affect the final choice, speeding up the search.

Answer 12

A strategy profile in which no player can benefit by unilaterally changing their strategy.

Answer 13

Planning in environments where actions have uncertain or probabilistic outcomes.

Answer 14

An MDP is defined by (S, A, T, R, γ), where T is a transition function, R a reward function, and γ a discount factor.

Answer 15

The Bellman Equation iteratively updates value functions to find an optimal policy in an MDP.

Answer 16

RL is where an agent learns from trial-and-error in an environment, receiving rewards or penalties for actions.

Answer 17

A model-free RL method updating Q(s,a) via: Q(s,a) ← Q(s,a) + α [r + γ * max Q(s',a') – Q(s,a)].

Answer 18

A self-driving car learns optimal driving strategies through repeated trials and feedback (rewards/penalties).

Answer 19

They may use game theory, reinforcement learning, or rule-based systems depending on cooperation or competition.

Answer 20

It represents an agent’s preferences, guiding action selection to maximize expected utility.

Answer 21

If one chain lowers its prices while the other keeps prices high, the lower‐priced chain will attract more customers and boost its profits (both do it == bad)

Answer 22

Value Iteration applies the Bellman update repeatedly to each state until values converge.

Answer 23

It alternates between policy evaluation (computing state values for a policy) and policy improvement until optimal.

Answer 24

Balancing trying new actions (exploration) and exploiting known high-reward actions to maximize long-term rewards.

Answer 25

Robotics, where a robot must plan and act under uncertain transitions.

Answer 26

Planning is used in robotics, autonomous driving, logistics, and scheduling tasks.

Answer 27

STRIPS actions are defined by preconditions, add effects (what becomes true), and delete effects (what becomes false).

Answer 28

A function estimating the remaining cost to reach a goal, used to guide search algorithms like A*.

Answer 29

(Scheduling problems, Sudoku, Map Coloring, and Timetabling.)

Answer 30

(1. Backtracking Search, 2. Constraint Propagation, 3. Generalized Arc Consistency (GAC), 4. Local Search.)

Answer 31

(GAC ensures that for every constraint, each value in a variable's domain has a consistent corresponding value in other domains. It extends Arc Consistency to higher-order constraints.)

Answer 32

(1. Models uncertainty using probability, 2. Encodes conditional dependencies, 3. Uses a Directed Acyclic Graph (DAG).)

Answer 33

(A variable is independent of its non-descendants given its parents.)

Answer 34

(Pathfinding in Google Maps, solving puzzles like the 15-Puzzle, or finding bugs in software.)

Answer 35

(Breadth-First Search (BFS), Depth-First Search (DFS), Uniform Cost Search (UCS).)

MI Flashcards

(59 cards)