Applying site reliability engineering principles to a service Flashcards

1
Q

What is the primary goal of site reliability engineering (SRE)?

A

The primary goal of SRE is to ensure that a service is highly available, performant, and scalable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the difference between availability and reliability?

A

Availability refers to the percentage of time a service is operational, while reliability refers to the ability of a service to perform its intended function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the “four nines” standard for availability?

A

The “four nines” standard for availability is 99.99%, which means that a service should be operational 99.99% of the time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between a service level objective (SLO) and a service level agreement (SLA)?

A

A SLO is a target availability level set by the SRE team, while an SLA is a contract with an external customer or service provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the three main components of an site reliability engineering (SRE) workflow?

A

The three main components of an SRE workflow are incident management, service improvement, and capacity planning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is incident management in SRE?

A

Incident management in SRE is the process of identifying, triaging, and resolving service disruptions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is service improvement in SRE?

A

Service improvement in SRE is the process of identifying and addressing service-related issues to improve overall performance and reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is capacity planning in SRE?

A

Capacity planning in SRE is the process of forecasting and managing the resources needed to support a service’s growth and performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between reactive and proactive incident management?

A

Reactive incident management is the process of addressing service disruptions after they occur, while proactive incident management is the process of identifying and addressing potential service disruptions before they occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a post-incident review?

A

A post-incident review is a process of analyzing the root cause of a service disruption and identifying ways to prevent it from happening again.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a service-level indicator (SLI)?

A

A service-level indicator (SLI) is a metric used to measure the performance and availability of a service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a service-level objective (SLO)?

A

A service-level objective (SLO) is a target availability level set by the SRE team.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a service-level agreement (SLA)?

A

A service-level agreement (SLA) is a contract with an external customer or service provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between a service-level indicator (SLI) and a service-level agreement (SLA)?

A

A SLI is a metric used to measure the performance and availability of a service, while an SLA is a contract with an external customer or service provider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a service-level target (SLT)?

A

A service-level target (SLT) is a specific availability level that a service is expected to meet, as defined by the SLO.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a service-level agreement violation (SLAV)?

A

A service-level agreement violation (SLAV) is when a service fails to meet its SLA.

17
Q

What is a service-level objective violation (SLOV)?

A

A service-level objective violation (SLOV) is when a service fails to meet its SLO.

18
Q

What is the difference between a service-level agreement violation (SLAV) and a service-level objective violation (SLOV)?

A

A SLAV is when a service fails to meet its SLA, while a SLOV is when a service fails to meet its SLO.