Sparrow (CC, Distributed Comptuing) Flashcards
(5 cards)
1
Q
What is the Background (Sparrow)?
A
Sparrow is a distributed, low-latency scheduling system. It’s designed as an alternative to centralized scheduling.
2
Q
What is the Problem (Sparrow)?
A
- Traditional centralized schedulers struggle to provide millisecond-scale scheduling delay and support millions of task scheduling decisions per second.
- Modifying today’s centralized schedulers to support sub-second parallel tasks presents a difficult engineering challenge.
3
Q
What is the solution to the problem (Sparrow)?
A
- Sparrow employs a decentralized architecture with many parallel schedulers that don’t maintain state information.
- It utilizes batch sampling, extending it to prevent race conditions.
- Sparrow incorporates techniques inspired by Tarcil, a shared-state scheduler.
4
Q
What are the applications (Sparrow)?
A
The provided document does not explicitly detail real-life applications of Sparrow. However, it suggests applications in low-latency user-facing services like language translation or online search
5
Q
Strengths and Weaknesses (Sparrow)?
A
Advantages:
- Decentralization: Easy to add more schedulers.
- Load Balancing: Uses batch sampling.
- Fault Tolerance: Frameworks using Sparrow can automatically detect scheduler failures and failover to a backup scheduler in less than 120ms.
- Scalability: Designed for scaling to large clusters. Simulation results suggest it performs well as cluster size increases, even with tens of thousands of cores.
- Achieves median response times within 12% of ideal.
It supports policies and constraints needed by analytics frameworks.
DisAdvantages: