Sparrow (CC, Distributed Comptuing) Flashcards

(5 cards)

1
Q

What is the Background (Sparrow)?

A

Sparrow is a distributed, low-latency scheduling system. It’s designed as an alternative to centralized scheduling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the Problem (Sparrow)?

A
  • Traditional centralized schedulers struggle to provide millisecond-scale scheduling delay and support millions of task scheduling decisions per second.
  • Modifying today’s centralized schedulers to support sub-second parallel tasks presents a difficult engineering challenge.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the solution to the problem (Sparrow)?

A
  • Sparrow employs a decentralized architecture with many parallel schedulers that don’t maintain state information.
  • It utilizes batch sampling, extending it to prevent race conditions.
  • Sparrow incorporates techniques inspired by Tarcil, a shared-state scheduler.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the applications (Sparrow)?

A

The provided document does not explicitly detail real-life applications of Sparrow. However, it suggests applications in low-latency user-facing services like language translation or online search

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Strengths and Weaknesses (Sparrow)?

A

Advantages:
- Decentralization: Easy to add more schedulers.

  • Load Balancing: Uses batch sampling.
  • Fault Tolerance: Frameworks using Sparrow can automatically detect scheduler failures and failover to a backup scheduler in less than 120ms.
  • Scalability: Designed for scaling to large clusters. Simulation results suggest it performs well as cluster size increases, even with tens of thousands of cores.
  • Achieves median response times within 12% of ideal.
    It supports policies and constraints needed by analytics frameworks.

DisAdvantages:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly