Monitoring Flashcards
(8 cards)
1
Q
what to include
A
Metrics Collection, Logging, Tracing, Alerts
2
Q
Metrics Collection
A
Track key system metrics: request latency, error rates, queue depth, DB read/write ops, CPU/memory usage.
3
Q
Metrics Collection tools
A
Prometheus, Datadog, CloudWatch (AWS).
4
Q
Logging
A
Structured logs from services (e.g. request ID, user ID, endpoint).
5
Q
Logging Tools
A
ELK stack, CloudWatch Logs, or Datadog Logs.
6
Q
Tracing
A
Visualizes requests as they move through microservices.
7
Q
Tracing Tools
A
OpenTelemetry, Jaeger, AWS X-Ray for distributed tracing.
8
Q
Alerts
A
Set thresholds (e.g. 95th percentile latency > 500ms) and get notified via Slack/email.