Question 1

Rec System for newly launched online bookstore. 1 Million books, but only 10.000 ratings. Which is the better Rec System:
a. User-User collaborative filtering
b. Item-Item collaborative filtering
c. Content-based recommendation

Accepted Answer

c.) Content-based recommendation

Content-based recommendation does not rely on User-User or Item-Item similarities, but on the underlying features of the items - in this case books - itselfs. Thus, it is able to overcome the cold-start problem CF-methods suffer from, and can be applied to a sparse User-Item matrix.

Question 2

Matrix factorization method:
Determine the Baseline Estimate of the User’s rating of an Item
* Global avg. rating r_g = 3.4
* User avg. rating r_u = 3.0 –> User bias = -0.4
* Item avg. rating r_i = 4.1 –> Item bias = +0.7

Accepted Answer

Baseline Estimate: r_g + User bias + Item bias = 3.4-0.4+0.7 = 3.7

Question 3

Item-Item based Collaborative Filtering application steps - Predict the rating of unknown User-Item pair

Accepted Answer

1. Calculate average rating of items 2. Compute simularity between items a. Cosine similarity b. Jaccard similarity c. Pearson similarity - 1. Subtract the item rating average from the item ratings 2. Compute the cosine similarity 3. Calculate weighted average a. Chose n number of items with highest similarity to item in question b. Multiply Similarity with rating of the other item (n times) and add it up c. Divide by the sums of the n similarities

Question 4

Application steps of Hierarchical Clustering

Accepted Answer

1. Merge nearest cluster (In the beginning, every point is a cluster) 2. Compute new centroid of cluster 3. Repeat steps

Question 5

T/F: GMMs are a method of hard clustering.

Accepted Answer

False GMMs use probabilities for clustering, thus soft.

Question 6

T/F: The updating process (i.e., updating centers and assigning data points to centers) of K-means is similar to the updating process of EM.

Accepted Answer

True Both iteratively update assignments & centers.

Question 7

T/F: EM is a globally optimized algorithm.

Accepted Answer

False As typical for iterative optimization algorithm, it is locally optimized

Question 8

T/F: EM includes the steps of Expectation (E) and Marginalization (M).

Accepted Answer

False EM is Expectation (E) and Maximization (M) steps, not Marginalization

Question 9

Set of training data, and two clustering algorithms: K-Means, and a Gaussian Mixture Model trained using EM. Will these two clustering algorithms produce the same cluster centers (means) for this data set? Explain why or why not.

Accepted Answer

Very similar, but not necessarily equal. Since the data is separated very clearly, K-Means and GMM will group it into two clusters. However, due to the probabilistic nature of the EM method that GMM utilizes and the fact it consider weighted means, the center of K-means and GMM differ.

Question 10

Considering the existence of m data points X, k mixture
distributions: Omega, wij denotes the membership of xi from the mixture j. Let N(xi | Omega) denote the probability of x that is drawn from the Gaussian distribution N(Omega). Derive the
optimization objective of GMM (a.k.a., the likelihood of X that are generated by GMM)

Accepted Answer

See Deriving GMM:
1. Take the weighted sum of the likelihood of a xi belonging to each Gaussian Distribution omega_j

P(xi | Omega) = sum [j=1, k] ( wij . N(xi |omegaj)

Compute the likelihood of all datapoints m

L(Omega) = prod [i=1,m] (P(xi | Omega))

Maximize L(Omega) w.r.t. wij and Omega - Constraint sum [j=1,k] (wij) = 1

Question 11

What are the three categories of Machine Learning to Rank tasks?

Accepted Answer

1. bi-partite LTR 2. k-partite LTR 3. real value based LTR

Question 12

What are the differences between RankSVM, RankBoost, and RankNet in terms of loss functions?

Accepted Answer

RankSVM: Hinge loss = max(1-x,0)
Loss function not smooth, penalizes incorrect ranking moderately

RankBoost: Exponential loss function
Smooth, Huge penalization

RankNet: Logistic loss function
Smooth, More moderate penalization compared to Exponential

Question 13

Why do the loss functions of RankSVM, RankBoost, and RankNet all monotonically decease?

Accepted Answer

The lower the values of the indicator function the more incorrect the ranking is --> Thus, lower values must be penalized more than higher values, i.e. all loss function must monotonically decrease.

Question 14

T/F: the K-partite ranking uses a divide and conquer strategy to decompose the K-partite ranking task into a single bipartite ranking task.

Accepted Answer

False. The K-Partite ranking decomposes into multiple bipartite ranking tasks

Question 15

Which ranking model is better? Explain why. Golden-standard (Actual) ranking scores Model #1 Model #2 Item A 0.9 0.88 1000000 Item B 0.85 0.89 10000 Item C 0.8 0.83 10

Accepted Answer

Model #2 is better. Ranking is concerned about the order of object not if the values are close. Model #2 predicts the order correctly and separates well, too.

Exam 2 Questions Flashcards

(30 cards)