DS Entretien 2 Flashcards

Question

How does Binary Search work, and what are its advantages?

Answer 1

Binary Search is an efficient search algorithm that: 1. Checks the middle element of a sorted array. 2. If the target is smaller, it searches the left half; if larger, it searches the right half. 3. Repeats until the element is found or the search space is empty. **Time Complexity:** O(log n) **Space Complexity:** O(1) (iterative), O(log n) (recursive) **Limitations:** Only works on sorted arrays. ## Footnote None

Answer 2

Linear Search checks each element one by one. - **Best for:** Unsorted or small datasets. - **Time Complexity:** O(n) (worst case) - **When to use it?** When data is unsorted, or when you expect the target to be near the beginning. ## Footnote None

Answer 3

Gradient Descent is an optimization algorithm used in ML to minimize loss functions by updating model parameters. **Types:** - **Batch Gradient Descent:** Uses all data points; slow but stable. - **Stochastic Gradient Descent (SGD):** Updates after each data point; noisy but faster. - **Mini-Batch Gradient Descent:** Uses small batches; balances speed and stability. ## Footnote None

Answer 4

The EM Algorithm finds maximum likelihood estimates for models with latent variables (e.g., Gaussian Mixture Models). 1. **Expectation Step (E-step):** Estimates missing data based on current parameters. 2. **Maximization Step (M-step):** Updates parameters to maximize likelihood. 3. Repeats until convergence. Used in **clustering (GMMs), HMMs, and topic modeling.** ## Footnote None

Answer 5

BFS explores a graph level by level using a queue. - **Time Complexity:** O(V + E) (V = vertices, E = edges) - **Used for:** Finding shortest paths (unweighted graphs), web crawling, network broadcasting. ## Footnote None

Answer 6

DFS explores as deep as possible before backtracking. - **Time Complexity:** O(V + E) - **Used for:** Cycle detection, topological sorting, solving mazes. ## Footnote None

Answer 7

Finds the shortest path in a weighted graph using a priority queue. - **Time Complexity:** O((V + E) log V) with a priority queue - **Fails when:** Graph has negative weights (use Bellman-Ford instead). ## Footnote None

Answer 8

K-Means partitions data into K clusters by minimizing variance. - **Limitations:** Sensitive to initialization, assumes spherical clusters, struggles with non-convex shapes. ## Footnote None

Answer 9

DBSCAN groups dense regions and labels outliers separately. - **Advantages:** Detects arbitrary-shaped clusters, handles noise well. - **Limitations:** Struggles with varying density clusters. ## Footnote None

Answer 10

PCA reduces dimensionality by transforming data into uncorrelated principal components. - **Steps:** 1. Standardize the data. 2. Compute the covariance matrix. 3. Find eigenvalues/eigenvectors. 4. Select top k eigenvectors. - **Used in:** Feature reduction, noise filtering, data visualization. ## Footnote None

Answer 11

t-SNE preserves local structures in high-dimensional data, making clusters more visible, while PCA preserves global structure. - **Limitation:** Computationally expensive, non-deterministic. ## Footnote None

Answer 12

LDA: - **Supervised:** Maximizes class separability. - **Used for:** Feature reduction in classification tasks. PCA: - **Unsupervised:** Maximizes variance. ## Footnote None

Answer 13

DP solves problems by breaking them into overlapping subproblems and storing results to avoid redundant calculations. - **Used in:** Time series forecasting, reinforcement learning, optimization problems (e.g., Knapsack, Fibonacci). ## Footnote None

Answer 14

O(n log n) ## Footnote Merge Sort is a divide-and-conquer algorithm that splits the array into halves, sorts each half, and merges them back together.

Answer 15

A divide-and-conquer algorithm that selects a pivot and partitions the array around the pivot, sorting recursively.

Answer 16

Finds the position of a target value within a sorted array by repeatedly dividing the search interval in half.

Answer 17

A variation of binary search that divides the array into three parts and eliminates two-thirds of the search space each time.

Answer 18

O(log₃ n)

Answer 19

A non-comparative sorting algorithm that counts the occurrences of each element and uses this information to place elements in the correct position.

Answer 20

A non-comparative sorting algorithm that sorts numbers digit by digit starting from the least significant digit.

Answer 21

Breadth-First Search

Answer 22

A graph traversal algorithm that explores as far as possible along each branch before backtracking.

Answer 23

Finds the shortest paths from a source node to all other nodes in a weighted graph.

Answer 24

O(E + V log V)

Answer 25

Negative edge weights and detects negative weight cycles.

Answer 26

A pathfinding algorithm that uses heuristics to find the shortest path while optimizing search time.

Answer 27

Finds all pairs shortest paths in a graph.

Answer 28

Finding the minimum spanning tree of a graph by adding edges in increasing weight order.

Answer 29

O(E log E)

Answer 30

Finds the minimum spanning tree by growing the tree one vertex at a time.

Answer 31

O(E log V)

Answer 32

Finding strongly connected components (SCC) in a directed graph.

Answer 33

The maximum matching in a bipartite graph.

Answer 34

O(√V * E)

Answer 35

A linear ordering of vertices in a Directed Acyclic Graph (DAG).

Answer 36

The maximum sum subarray in an array of integers.

Answer 37

A fast and slow pointer algorithm used to detect cycles in a linked list.

Answer 38

The 0/1 knapsack problem by finding the optimal selection of items.

Answer 39

A dynamic programming algorithm to find the longest subsequence common to two sequences.

Answer 40

Finds the shortest paths from a source to all vertices in a graph with negative weights.

Answer 41

The longest subsequence in a sequence where the elements are in strictly increasing order.

Answer 42

Finds the minimum number of coins required to make a given amount using a set of denominations.

Answer 43

A data structure to manage a partition of a set into disjoint subsets.

Answer 44

A tree-based data structure that supports range queries and updates efficiently.

Answer 45

A tree-based data structure used for efficient string storage and retrieval.

Answer 46

Knuth-Morris-Pratt

Answer 47

An algorithm for selecting k random samples from a stream of data where the total number of elements is unknown.

Answer 48

A data structure that provides efficient methods for querying and updating prefix sums in a dynamic array.

DS Entretien 2 Flashcards

(126 cards)