Final Exam: Recommender Systems Flashcards

1
Q

What is a recommender system?

A

An information filtering system that provides suggestions for items that are most pertinent to a particular user. An important part of the information and e-commerce ecosystem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the value of a recommender system?

A

An important part of the information and e-commerce ecosystem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the types of recommender systems?

A
  1. Content-based filtering
  2. Collaborative filtering
  3. Hybrid approach
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the breakdown of collaborative filtering?

A
  1. Model-based filtering (Bayesian Networks and Neural Networks)
  2. Memory-base filtering (Item-based approach and User-based approach)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is an item profile?

A

A profile that contains the description or features of a collection of similar items.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a user profile?

A

A profile that contains all the item profiles whose items are rated positively by the user.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the drawbacks of content-based recommender systems?

A
  1. Require in-depth knowledge of the item features
  2. Nontrivial aggregation
  3. Limited capacity to expand on the users’ existing choices or interests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the advantages of content-based recommender systems?

A
  1. Quickly adapting to user’s dynamic preferences
  2. Does not rely on other users’ data
  3. Ensure security and privacy of user data
  4. Overcome the cold-start problem existing in the collaborative algorithms
  5. Common in personalized news, publications, web page recommendations, etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is collaborative filtering-based recommender systems?

A

Use the similarity between users or the similarity between items. Items liked by similar users will be recommended to the user.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the drawbacks of collaborative filtering-based recommender systems?

A
  1. Nontrivial and time-consuming to find the neighborhood of a target user
  2. Rely on a LARGE data set of users’ activities
  3. Cold-start problem
  4. Privacy concerns
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the advantages of collaborative filtering-based recommender systems?

A
  1. Does not require item features
  2. Ability to expand on the user’s existing interests by discovering new items
  3. Flexible algorithms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is memory-based recommendation?

A

A conversion from a utility matrix to a recommendation.
1. The target user’s profile must exist in the utility matrix
2. Item-based, user-based

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a model-based recommendation?

A

A recommendation utilized to predict the user’s rating for an unrated item.
1. The target user’s profile does not necessarily exist in the training process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is key across both user-based filtering and item-based collaborative filtering?

A

Similarity Computation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is an example of a similarity computation?

A

Pearson Correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a cold start?

A

When insufficient data is utilized to draw meaningful inference.
Remedy: Incentivize users to provide feedback; collect additional data about user and items

17
Q

What is a shilling attack?

A

When fake data is implemented into a database to increase or decrease some items’ popularity.
Remedy: Monitor abnormalities of user behavior

18
Q

What is a latency problem (lack of exploration)?

A

When new items are frequently inserted into the database, but the system fails to explore.
Remedy: Design algorithms to encourage the exploration of new items.

19
Q

What is the sparsity problem?

A

When there are too many NaNs in the input data.
Remedy: Design hybrid approaches, potentially model-based methods.

20
Q

What is scalability?

A

When some algorithms are hard to scale to large amounts of data.
Remedy: Dimension reduction; clusters of data.

21
Q

What is the formula for Pearson’s Correlation?

A

Pxy = corr(x,y) = E[(X-Mean of x)(Y-Mean of y)]/SDy x SDx