Algorithms that Changed the World Flashcards

Question

What is the role of a gateway router in hierarchical routing?

Answer 1

It connects different ASes and runs both intra-AS and inter-AS routing protocols.

Answer 2

Performance — since all routers are under one admin, there’s no need for policy decisions.

Answer 3

Policy — administrators decide how traffic is routed between ASes.

Answer 4

Reduces routing table size, limits traffic update scope, improves performance, and supports routing policies

Answer 5

Routing Information Protocol (RIP) and Open Shortest Path First (OSPF)

Answer 6

BGP (Border Gateway Protocol)

Answer 7

RIP uses distance vector with hop count, while OSPF uses link-state with a full network map and Dijkstra’s algorithm.

Answer 8

It uses TCP to send messages like OPEN (establish session), UPDATE (announce/withdraw routes), KEEPALIVE (maintain session), and NOTIFICATION (error handling). Routing decisions are based on policies set by administrators, not just shortest path.

Answer 9

To find the value 𝑥 that makes a function 𝑓(𝑥) as small as possible.

Answer 10

A convex function has exactly one global minimum and no local “traps,” so simple methods will always find the true minimum.

Answer 11

f(tx1 + (1−t)x2) ≤ tf(x1) + (1−t)f(x2) for all 𝑡∈[0,1]

Answer 12

It means 𝑓 is convex (its slope is non-decreasing).

Answer 13

By noting that at the minimum 𝑥*, the derivative satisfies 𝑓′(𝑥*)=0.

Answer 14

Find a and b such that g(a) and g(b) have opposite signs, ensuring a root lies between.

Answer 15

Repeatedly split the interval in half, keeping the subinterval where the sign change occurs, until the root is located sufficiently precisely.

Answer 16

Find af(b)

Answer 17

(3− 5 )/2≈0.382

Answer 18

Fit a parabola through three points and jump to its vertex as a trial minimum.

Answer 19

To ensure the bracket always contains the minimum and to guarantee convergence when the parabola fit would step outside the bracket.

Answer 20

“Best” (lowest function value), “Good” (second-lowest), and “Worst” (highest).

Answer 21

Add up all the vertex coordinates except the worst one, then divide by the number of those remaining points.

Answer 22

You take the centroid, move away from the worst point by the same distance, and test that new point.

Answer 23

If its function value is better than the good point but not better than the best point.

Answer 24

If the reflected point beats the best point, you move further in that direction (twice as far from the centroid).

Answer 25

If the further-out point is even better, take it; otherwise keep the reflected point.

Answer 26

If the reflection is better than the worst but not better than the good point.

Answer 27

If the reflection is worse than the worst point; then you move half-way from the worst back toward the centroid.

Answer 28

You shrink the entire simplex by moving every point halfway toward the best point.

Answer 29

Reflection uses 1× the distance, expansion 2×, contraction ½×, and shrink ½×.

Answer 30

When all vertices are very close together, or when their function values differ by less than a tolerance.

Answer 31

Because it never computes or approximates any gradients—only compares function values.

Answer 32

It may stall or converge slowly on high-dimensional or very noisy problems.

Answer 33

Every new trial point must improve on the worst point, so the overall worst value can only get better over time.

Answer 34

It’s a quantity you choose (like how much to invest), represented by a nonnegative number.

Answer 35

Add a new nonnegative slack variable so that slack = right-hand side minus the weighted sum of decision variables.

Answer 36

It satisfies all constraints and has all variables nonnegative.

Answer 37

Because a linear objective over a convex feasible region always reaches its best value at a vertex.

Answer 38

Basic variables solve the current equations and take on nonzero values; nonbasic variables are held at zero.

Answer 39

Choose a nonbasic variable whose coefficient in the objective row would increase the objective if you raised it.

Answer 40

For each basic variable, calculate how much you can increase the entering variable before the basic one hits zero; the smallest limit determines who leaves.

Answer 41

Rewriting one equation to swap a nonbasic entering variable into the basis and push a basic one out, then updating all other equations.

Answer 42

When no nonbasic variable has a coefficient that can improve the objective—so you’re already at the best corner.

Answer 43

It measures the “distance” from a constraint wall to the current point along that constraint’s direction.

Answer 44

Because all changes are linear adjustments of rows; you only compare and combine numbers, no slopes involved.

Answer 45

That means the constraints conflict and there’s no solution meeting all of them.

Answer 46

The linear program is unbounded in that direction and the objective can go to infinity.

Answer 47

llocating advertisement budget across channels to maximize votes or sales under cost limits.

Answer 48

Because it’s fast in practice, simple to implement, and reliable for large, real-world problems.

Answer 49

A cycle-free subgraph that includes every vertex of the original graph.

Answer 50

Its total edge-weight sum is as small as possible among all spanning trees.

Answer 51

Arbitrarily, you can start anywhere.

Answer 52

The lowest-cost edge that connects a tree vertex to a non-tree vertex.

Answer 53

Because you only add edges that go from the current tree to outside vertices.

Answer 54

Maintain for each outside vertex v: the minimum edge weight bestCost[v] and its neighbor closest[v].

Answer 55

Once every vertex has been moved into the tree set.

Answer 56

O(v^2), because of each of the V-1 steps scans up to V vertices.

Answer 57

It models the cheapest way to ensure full connectivity without redundant links.

Answer 58

Designing cost-effective electrical grids, road networks, or communication backbones.

Answer 59

Two sets (or boolean arrays) for in-tree vs out-tree, plus arrays bestCost[] and closest[].

Answer 60

To build a spanning tree of minimum total weight by repeatedly adding the next-lightest edge that doesn’t form a cycle.

Answer 61

It takes the cheapest remaining edge that does not form a cycle with those already chosen.

Answer 62

So it can consider them in ascending cost order, ensuring the cheapest viable edge is always picked next.

Answer 63

A disjoint-set (union-find) that tracks which component each vertex belongs to.

Answer 64

O(ElogE), dominated by sorting the edges (which is O(ElogV) as well).

Answer 65

Exactly when you add an edge whose endpoints are currently in different groups.

Answer 66

The cheapest edge crossing any partition of the vertices must appear in every MST.

Answer 67

By always choosing the least-cost edge that keeps the forest cycle-free, it never forgoes a necessary cheap connection.

Answer 68

The graph is disconnected and no spanning tree exists.

Answer 69

Double all MST edges to form a circuit, then shortcut repeated vertices to produce a valid tour with cost < 2× optimum.

Answer 70

Because edge weights satisfy the triangle inequality, so skipping intermediate vertices can only shorten or keep the same path length.

Answer 71

To guarantee a unique MST; if weights tie, any one of the minimal spans is acceptable.

Answer 72

Sorting the edges, then repeatedly scanning the sorted list and using union-find to build the MST.

Answer 73

Because it makes the locally optimal (cheapest-edge) choice at each step without backtracking.

Answer 74

The smallest convex shape (intersection of all convex sets) that contains every point.

Answer 75

Choose the point with the smallest x-coordinate (if tied, the one with largest y).

Answer 76

It defines your incoming direction so you can measure clockwise angles from r→h.

Answer 77

Scan all other points (plus the start), pick the one with the smallest clockwise angle from your current direction.

Answer 78

As soon as you return to the initial starting point h, having wrapped all hull edges.

Answer 79

Because at each step it always picks the point that makes the smallest clockwise angle, ensuring you stay on the outside boundary.

Answer 80

O(|S|·|H|) where |S| = total points and |H| = hull points, making it output-sensitive.

Answer 81

Estimating an animal’s home range by “peeling” away outlier observations.

Answer 82

It implicitly breaks ties by the order of scanning.

Answer 83

Because it may scan all remaining points for each hull edge, leading to ∼n scans of ∼n points.

Answer 84

The point with the smallest x-coordinate (if tied, the one with smallest y).

Answer 85

By increasing polar angle around p₀ (counter-clockwise). If angles tie, keep only the farthest point.

Answer 86

A stack that holds the current hull vertices.

Answer 87

If the last two points on the stack and the new point make a clockwise (right) turn, you pop.

Answer 88

Once the stack’s top two points and the new point make a counter-clockwise (left) turn, then push the new point.

Answer 89

At most once each, so the scanning phase is O(n).

Answer 90

O(n log n), dominated by the initial sort.

Answer 91

Pre-discarding points that lie strictly inside a simple enclosing quadrilateral so they need not be sorted or scanned.

Answer 92

The farthest pair of points in a set must both lie on the hull, so you only check hull points.

Answer 93

Two sets are linearly separable only if their convex hulls do not intersect.

Answer 94

To detect errors at the receiver and request a retransmission from the sender.

Answer 95

It requires two-way feedback and adds delay while waiting for retransmits.

Answer 96

By appending one extra bit equal to the XOR of all data bits, catching any odd number of bit flips.

Answer 97

Any error that flips an even number of bits, since their XOR remains zero.

Answer 98

Treat the message as a polynomial, divide by a fixed generator polynomial, and append the remainder as check bits.

Answer 99

The polynomial x³ + x² + 1 (since coefficients map directly to bits).

Answer 100

Because shifts become multiplications by xⁿ and bitwise XOR is modular subtraction—ideal for hardware registers.

Answer 101

Shift the CRC register left with the next message bit in; if the shifted-out bit was 1, XOR with the low r bits of the generator.

Answer 102

Exactly N + r shift/XOR steps.

Answer 103

All single- and double-bit errors, all odd-numbered errors, all bursts up to 16 bits, and nearly all longer bursts.

Answer 104

It uses only small shift registers and XOR gates, with no multipliers or complex logic.

Answer 105

By dividing the received (message + CRC) by the same generator; a zero remainder means no detected errors.

Answer 106

Parity misses even-bit errors, whereas CRCs can catch structured bursts and patterns far beyond single bits.

Answer 107

To reserve space for the r-bit remainder and ensure proper alignment in the polynomial division.

Answer 108

Send each bit three times (000 or 111) and decode by majority vote; but it reduces throughput to one-third.

Answer 109

Three parity bits are interleaved among the four data bits in positions 1, 2, and 4 of the 7-bit word.

Answer 110

p1 = d1 ⊕ d2 ⊕ d4 p2 = d1 ⊕ d3 ⊕ d4 p3 = d2 ⊕ d3 ⊕ d4

Answer 111

The 7×4 generator matrix G via the product R=G⋅D (mod 2).

Answer 112

Multiply the received 7-bit vector by the 3×7 parity-check matrix H to get a 3-bit syndrome that indexes the error position.

Answer 113

Apply the 4×7 decoding matrix M to the corrected codeword, yielding the 4-bit data vector.

Answer 114

Any two-bit error in the 7-bit word.

Answer 115

The number of bit positions in which they differ—used to measure error-detecting/correcting capability.

Answer 116

Because the transmitter adds redundancy so errors can be corrected at the receiver without asking for retransmission.

Answer 117

Reed–Solomon codes and Turbo codes (also LDPC).

Answer 118

To remove redundant bits so that information (text, audio, images) uses fewer bits.

Answer 119

Because entropy measures the minimum information needed to distinguish symbols uniquely.

Answer 120

The theoretical lower bound on bits per symbol.

Answer 121

Assigning every symbol the same number of bits (e.g., 3 bits each for 5 symbols), regardless of frequency.

Answer 122

One that repeatedly makes the locally best choice, hoping this yields an overall optimal code.

Answer 123

They are prefix-free: no codeword is a prefix of another, so decoding can proceed bit by bit.

Answer 124

The two symbols (or nodes) with the smallest probabilities.

Answer 125

One branch gets “0” and the other “1” when you link them to their parent.

Answer 126

A min-heap (priority queue) keyed by frequency.

Answer 127

JPEG image compression and MPEG video compression.

Answer 128

Because the tree structure—and thus code lengths—depend on those frequencies.

Answer 129

It can misalign decoding and corrupt all subsequent symbols until resynchronization.

Answer 130

It codes whole symbols at a time and can’t achieve fractional-bit precision per symbol.

Answer 131

An adaptive dictionary model that learns new strings as it encodes.

Answer 132

Every single symbol (e.g. each byte/character) mapped to a unique codeword.

Answer 133

Pick the longest prefix of the remaining input that exists in the dictionary.

Answer 134

The string w followed by the very next symbol in the input.

Answer 135

It follows the same rule—after outputting the previous string, it appends the first character of the current string to form the new entry.

Answer 136

It constructs the entry as “previous output string + its own first character,” then outputs that.

Answer 137

0 1 2 4 3.

Answer 138

It uses only simple dictionary lookups and updates—no complex arithmetic or probability models.

Answer 139

GIF image compression and the Unix “compress” utility.

Answer 140

LZW adds many single‐symbol entries but gains little compression, so it performs poorly on highly random data.

Answer 141

That transaction history cannot be amended and double-spends are prevented.

Answer 142

By including the predecessor’s cryptographic hash in its header.

Answer 143

Any bit-flip in one block changes its hash, which then invalidates all later blocks’ hashes.

Answer 144

Finding a nonce so that the SHA-256 hash of the block meets a difficulty target (enough leading zeros).

Answer 145

Verifying just recalculates one hash; finding a valid nonce requires many trial hashes.

Answer 146

The longest chain (i.e., the one with the most total proof-of-work)

Answer 147

SHA-2, specifically SHA-256.

Answer 148

Ensure new transactions get added (liveness), old ones can’t change (persistence), and all nodes agree (consensus).

Answer 149

It’s energy-intensive and limits transaction throughput.

Answer 150

How to agree on a secret key or encrypt messages over an insecure channel with no private meeting

Answer 151

The shared secret key that Alice and Bob both compute but never send directly

Answer 152

A function easy to compute in one direction but hard to invert (e.g. modular exponentiation)

Answer 153

Multiply two large primes p and q to get n, compute φ=(p–1)(q–1), then pick a number e less than and coprime to φ and publish (n,e)

Answer 154

A number that satisfies e*d mod φ ≡ 1

Answer 155

Encryption: C = Mᵉ mod n; Decryption: M = Cᵈ mod n

Answer 156

Because recovering d or M from C without factoring n or solving the discrete logarithm is computationally infeasible

Answer 157

Every time they need a new shared secret—they can repeat the exchange as often as needed

Answer 158

The difficulty of discrete logarithms (DH) and integer factorization (RSA) respectively

Answer 159

To see which sine-wave frequencies make up the signal, which can simplify analysis and processing.

Answer 160

It expresses a continuous signal as an integral of all possible sine and cosine waves, yielding a spectrum function F(ω).

Answer 161

The DFT works on a finite list of N samples, replacing integrals with sums over those samples and producing N discrete frequency coefficients.

Answer 162

On the order of N^2 complex multiplications.

Answer 163

It reduces the DFT’s cost from N^2 down to roughly NlogN, making large transforms practical.

Answer 164

When the sample rate is too low, higher-frequency components get misinterpreted as lower ones.

Answer 165

Leakage is the spreading of energy into adjacent frequency bins because you only have a finite number of samples; applying window functions (Hanning, Hamming) helps.

Answer 166

By summing all frequency components multiplied by their complex exponentials and dividing by N (the inverse DFT).

Answer 167

To taper the signal edges and reduce spectral leakage in the resulting frequency spectrum.

Answer 168

log^2 N times, until you reach trivial 1-point DFTs.

Answer 169

Recursively split the DFT into two half-size DFTs (even and odd samples) and then cheaply combine them, reducing work from N^2 to Nlog^2 N.

Answer 170

Because after successive even/odd splits, the natural processing order corresponds exactly to the bit-reverse of the original indices.

Algorithms that Changed the World Flashcards

(196 cards)