1. Requirements (5 min) 2. Core Entities (2 min) 3. API (5 min) 4. (Optional) Data Flow (5 min) 5. High Level Design (10 - 15 min) 6. Deep Dives (10 min)

Thousand = kilo Million = mega Billion = giga Trillion = tera Quadrillion = peta

Reading 1 mb sequentially from memory = 0.25 ms Reading 1 mb sequentially from SSD = 1 ms Reading 1 mb sequentially from spinning disk = 20 ms Round trip network latency CA to Netherlands = 150 ms

2-hour movie = 1 gb Small book of plain text = 1 mb High-resolution photo = 1 mb Medium-resolution image or web graphic = 100 kb

horizontally - adding more machines vertically - adding more resources to a single machine

Use ElasticSearch Types - geospatial, vector (find image or document), full-text (search document) Set up ElasticSearch to index most databases using Change Data Capture (CDC) Drawbacks - new failure point, new source of latency, stale data

Use when need to give clients near-realtime updates Client makes a request and server holds the request open until it has data Client can then make another request Works with standard load balancers and firewalls

Use when need realtime, bidirectional communication Challenge - must maintain many long open connections Common pattern to use message broker to handle communication and backend services communicate directly with message broker (centralizes connection to client)

System Design Flashcards by Sonia Dillane

Delivery Framework

Requirements (5 min)
Core Entities (2 min)
API (5 min)
(Optional) Data Flow (5 min)
High Level Design (10 - 15 min)
Deep Dives (10 min)

How well did you know this?

Not at all

Perfectly

Delivery Framework: Requirements

Functional requirements (prioritize ~3)
Non-functional
(Optional) capacity estimation

How well did you know this?

Not at all

Perfectly

Non-functional requirements

CAP Theorem (consistancy or performance)
Environmental constraints
Scalability (reads vs writes, hot spots)
Latency
Durability
Security
Fault Tolerance

How well did you know this?

Not at all

Perfectly

Metrics units

Thousand = kilo
Million = mega
Billion = giga
Trillion = tera
Quadrillion = peta

How well did you know this?

Not at all

Perfectly

Common latencies

Reading 1 mb sequentially from memory = 0.25 ms
Reading 1 mb sequentially from SSD = 1 ms
Reading 1 mb sequentially from spinning disk = 20 ms
Round trip network latency CA to Netherlands = 150 ms

How well did you know this?

Not at all

Perfectly

Common storage

2-hour movie = 1 gb
Small book of plain text = 1 mb
High-resolution photo = 1 mb
Medium-resolution image or web graphic = 100 kb

How well did you know this?

Not at all

Perfectly

Common domain estimtations

DAUs on a social media network = 1b
Hours of video streamed on netflix/day = 100 m
Google searches/second = 100k
Size of Wikipedia = 100 gb

How well did you know this?

Not at all

Perfectly

2 types of Scaling

horizontally - adding more machines
vertically - adding more resources to a single machine

How well did you know this?

Not at all

Perfectly

Requirements for horizontal scaling

load balancer
load balancer strategy (round robin, queuing system, least connections, utilization-based)
try to partition data such that a single node has all the data it needs

How well did you know this?

Not at all

Perfectly

Specialized Indexes

Use ElasticSearch
Types - geospatial, vector (find image or document), full-text (search document)
Set up ElasticSearch to index most databases using Change Data Capture (CDC)
Drawbacks - new failure point, new source of latency, stale data

How well did you know this?

Not at all

Perfectly

Communication Protocols

Internally

HTTP(S) or gRPC

How well did you know this?

Not at all

Perfectly

Communicatin Protocols

With Client

REST (Request -> Response)
Long polling
SSE (Server-Sent Events)
Websockets (Bi-directional Channel)

How well did you know this?

Not at all

Perfectly

Long polling

Use when need to give clients near-realtime updates
Client makes a request and server holds the request open until it has data
Client can then make another request
Works with standard load balancers and firewalls

How well did you know this?

Not at all

Perfectly

Websockets

Use when need realtime, bidirectional communication
Challenge - must maintain many long open connections
Common pattern to use message broker to handle communication and backend services communicate directly with message broker (centralizes connection to client)

How well did you know this?

Not at all

Perfectly

Server Sent Events (SSE)

Use when client needs multiple updates from server
Requires single long-lived HTTP connection
Requires less specialized infrastructure than websockets

How well did you know this?

Not at all

Perfectly

Security

Study These Flashcards

API Gateway for Authentication/Authorization
Encryption
Don’t pass userId or things like that through endpoints or bodies, should be in headers

Search Optimized Database

Study These Flashcards

Allows for full text search using indexing, tokenization, stemming
Inverted Index - index from word to document
Can confiure if fuzzy search is allowed
ElasticSearch

API Gateway

Study These Flashcards

Routes requests to correct microservice
Authentication
Rate limiting
Logging

Load Balancer

Study These Flashcards

Need a load balancer whenever you have multiple machines capable of handling the same request
Can leave out of box and pointer and just mention
AWS Elastic Load Balancer

When to use a Queue

Study These Flashcards

Buffer for bursty traffic
Distribute work across a system

If strong latency requirements (< 500 ms), queue will probably exceed

Queues

Study These Flashcards

Message Ordering - typically FIFO but can be priority
Retry configurations
Dead letter queue for debgging/auditing
Scaling with partitions (requires partition key)
Backpressure to slow down producers

AWS SQS

When to use a Stream

Study These Flashcards

Process large amounts of data in real-time (think analytics dashboard)
Support complex processin scenarious like event sourcing (think transactions at a bank)
Support multiple consumers reading from the same stream (think chat room)

Streams

Study These Flashcards

Scaling with Partitioning
Multiple consumers
Replication
Windowing

Kinesis

Distributed Lock

Study These Flashcards

Need to lock a resource for a period of time (maybe 10 min)
Use distributed key-value store like Redis to create a hash map of item -> lock.
Only one system or process can lock the particular item at a time
Can set an expiration on the lock so if process crashes, item doesn’t get stuck in locked state

Think: item in inventory while in cart, assignment of driver to rider

Cache Eviction Policy

* Least Recently Used * FIFO * Least Frequently Used

Cache Write Strategy

* Write-through cache - writes data to both cache and database simultaneously * Write-around cache - just writes to database (caches on next get) * Write-back cache - writes to cache and hen asynchronously to DB (may lose data) | Redis

System Design Flashcards

(27 cards)