Quiz 8 Flashcards

1
Q
  1. Which of the following is NOT a characteristic of the random walk approach to PageRank?
    A. It is a non-deterministic algorithm
    B. It is a simulation of how users navigate the web
    C. It requires a large number of iterations to converge
    D. It produces accurate PageRank scores for all web pages
A

D. It produces accurate PageRank scores for all web pages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  1. What are the benefits of using a teleport set?
    A. It can improve the accuracy of PageRank scores for certain types of webpages
    B. It can make PageRank scores more robust to changes in the web graph
    C. It can make PageRank score more computationally efficient to calculate
    D. It can make PageRank score more scalable to large web graphs
A

A. It can improve the accuracy of PageRank scores for certain types of webpages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
  1. How is the teleporting approach different between topic-specific page rank and random walk with restarts?
    A. Both algorithms teleport to the initial node of the graph
    B. Random walk with restarts teleports to a single node whereas topic specific page rank cannot teleport
    C. Random walk with restarts teleports to a single node whereas topic specific page rank teleport to multiple nodes
    D. Random walk with restarts teleports to multiple nodes of the graph whereas topic specific page rank can teleport to only a single node.
A

C. Random walk with restarts teleports to a single node whereas topic specific page rank teleport to multiple nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. How does Pinterest use PageRank to rank pins in search results?
    A. It assigns a PageRank score to each pin based on the number of repins it has
    B. It considers the quality of the pins that a user has visited in the past
    C. It analyzes the content of the pins, such as keywords and descriptions, to determine their relevance to a given query
    D. It analyzes the occurrence of search queries and calculates the page rank directly
A

B. It considers the quality of the pins that a user has visited in the past

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
  1. True | False Topic-specific PageRank is primarily useful for search engines when ranking results for general queries.
A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
  1. True | False PageRank is a perfect measure of web page importance and cannot be manipulated by spam farms.
A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
  1. True | False While using spam mass estimation technique to combat link spam, picking the seed set manually is an essential step of the process.
A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  1. True | False In the Pixie Random Walk, the early stopping criterion is based on the number of steps taken by the random walker.
A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
  1. How do spammers use link farms to increase the page rank of other web pages?
A

Spammers use other webpages in which they also own to promote their target website.

In which sometimes other webpages that they don’t own end up promoting it as well.

Spammer gets as many links as possible from the accessible pages to grow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
  1. Consider the following Graph
    Compute the PageRank of each node in the figure above for 2 iterations, assuming b = 0.9. Assume only nodes 4 and 5 as trusted pages. (Please not that there is a b modifier)
A

Go over it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  1. Write a pseudocode for the Pixie Random Walk algorithm and describe how similarity is computed between the nodes of a graph.
A

node = query_node.sample()
for i in range (steps)
board_node = node.get_random_neighbor()
pin_node=board_node.get_random_neighbor()
pin_node.count += 1
if random() < alpha
pin_node = query_node.sample()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly