System Design Flashcards

Question 1

Q

What are the key components of a system design interview?

Answer

A

Key components include gathering requirements, designing high-level architecture, choosing technologies, addressing scalability, and handling trade-offs. Example: For a URL shortener, define inputs (long URLs), outputs (short URLs), and scale (millions of users).

Question 2

Q

How do you design a scalable REST API?

Answer

A

Use a layered architecture: Load balancer → API servers (e.g., Node.js) → Database (e.g., PostgreSQL). Implement caching (Redis), rate limiting, and horizontal scaling. Example: Route requests via NGINX, cache responses, shard database.

Question 3

Q

What is the difference between horizontal and vertical scaling?

Answer

A

Horizontal scaling adds more servers (e.g., more API instances); vertical scaling increases a server’s resources (e.g., more CPU). Horizontal is preferred for distributed systems due to flexibility.

Question 4

Q

What is a load balancer, and how does it work?

Answer

A

A load balancer distributes traffic across servers to ensure availability. Example: NGINX routes requests to multiple Node.js instances using round-robin or least connections, improving throughput.

Question 5

Q

How do you design a database schema for a social media app?

Answer

A

Tables: Users (id, name), Posts (id, user_id, content), Comments (id, post_id, user_id). Use foreign keys for relationships, index user_id for fast queries. Normalize for consistency, denormalize for read-heavy loads.

Question 6

Q

What is the CAP theorem, and how does it affect system design?

Answer

A

CAP theorem states a distributed system can only guarantee two of: Consistency, Availability, Partition Tolerance. Example: Choose high availability (MongoDB) for social apps or consistency (MySQL) for banking.

Question 7

Q

How do you handle high traffic in a system?

Answer

A

Use load balancers, caching (Redis), CDNs for static assets, and database sharding. Example: Cache user profiles in Redis to reduce database load, scale API servers horizontally.

Question 8

Q

What is caching, and how is it implemented in a full-stack app?

Answer

A

Caching stores frequently accessed data for fast retrieval. Example: Use Redis to cache API responses in a Node.js backend, reducing database queries (O(1) lookup).

Question 9

Q

How do you design a rate limiter for an API?

Answer

A

Implement in-memory storage (Redis) to track requests per user/IP in a time window. Example: Allow 100 requests/hour per IP, reject excess with HTTP 429. Use sliding window for accuracy.

Question 10

Q

What is sharding, and when is it used?

Answer

A

Sharding splits a database into smaller, independent pieces (shards) based on a key (e.g., user_id). Used for large-scale data to improve performance. Example: Shard users by region.

Question 11

Q

How do you design a URL shortener service?

Answer

A

Components: API (POST /shorten, GET /:id), database (short_id, long_url), hash function (e.g., base62). Scale with sharding, caching redirects in Redis. Handle collisions with unique IDs.

Question 12

Q

What is a CDN, and how does it improve performance?

Answer

A

A CDN (Content Delivery Network) caches static content (e.g., images, CSS) on edge servers near users. Example: Cloudflare serves React app assets, reducing latency.

Question 13

Q

How do you ensure data consistency in a distributed system?

Answer

A

Use strong consistency (e.g., distributed locks, Paxos) for critical data or eventual consistency (e.g., DynamoDB) for high availability. Example: Bank transactions need strong consistency.

Question 14

Q

What is the difference between SQL and NoSQL in system design?

Answer

A

SQL (e.g., PostgreSQL) is structured, relational, best for complex queries; NoSQL (e.g., MongoDB) is flexible, scalable, ideal for unstructured data or high write loads.

Question 15

Q

How do you design a notification system?

Answer

A

Components: Queue (Kafka) for events, workers to process notifications, database for user preferences. Use WebSockets for real-time or email/SMS APIs. Scale with partitioning.

Question 16

Q

What is eventual consistency, and where is it used?

Answer

Study These Flashcards

A

Eventual consistency means updates propagate over time, prioritizing availability. Used in NoSQL (e.g., Cassandra) for apps like social media where slight delays are acceptable.

Question 17

Q

How do you handle database migrations in a production system?

Answer

Study These Flashcards

A

Use tools like Flyway or Liquibase for versioned migrations. Apply changes incrementally, test in staging, and use backward-compatible schemas to avoid downtime.

Question 18

Q

What is a microservices architecture, and what are its trade-offs?

Answer

Study These Flashcards

A

Microservices split an app into small, independent services (e.g., user service, payment service). Pros: scalability, flexibility. Cons: complexity, inter-service communication overhead.

Question 19

Q

How do you design a chat application?

Answer

Study These Flashcards

A

Components: WebSocket servers for real-time messaging, database (MongoDB) for message history, Redis for user presence. Scale with sharded message storage and load-balanced servers.

Question 20

Q

How do you optimize a system for low latency?

Answer

Study These Flashcards

A

Use caching (Redis), CDNs, database indexing, and asynchronous processing (queues). Example: Cache API results, use in-memory DB for hot data, optimize queries with indexes.

System Design Flashcards

(20 cards)