Building and operationalizing storage systems Flashcards

Chapter 2

1
Q

What is a managed database?

A

A managed database is one that does not require as much administration and operational support as an unmanaged database because Google will tae care of core operational tasks, such as creating databases, performing backups and updating the operating systems of database instances. Google also manages sclaing disks, configuring for failover, monitoring and authorizing network connections.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Configuration process for Cloud SQL databases include the specification of:

A
  1. An instance ID
  2. A password
  3. A region and zone
  4. A database version
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a read replica?

A

A read replica is a copy of the primary instance’s data that is maintained in the same region as the primary instance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Cloud Spanner?

A

Is Google’s relational, horizontal scalable, global database. It is a relational database, so it supports fixed schemas and is ANSI SQL 2011 compliant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does cloud spanner keep al replicas sycnrhonized?

A

To keep all replicas synchronized, Cloud Spanner uses a voting mechanism to determine writes. Cloud spanner uses a voting mechanism to determine the latest write-in case of a conflict value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the three types of replicas in cloud spanner?

A
  1. Read-write replicas
  2. read-only replicas
  3. witness replicas
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is designing tables for bigtbal fundamentally different form designing them for relational databses?

A
  1. Bigtable table are denormalized
  2. no support for joins in bigtable
  3. Data stored in bigtable lexicographyically by row-key
  4. Keeping related data in adjacent rows can help make reads more efficient.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are three characteristics of a good row-key?

A
  1. Use prefix for multitenancy
  2. Columns taht are not frequently updated
  3. Nonsqeuential value in teh first part of hte row-key
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Cloud Firestore?

A

Cloud Firestore is the managed document database that is replacing cloud datastore.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

when are document databases used?

A

Document databases are used when the structure of data can vary from one record to another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are features that cloud firestore has that are not previously available in Cloud Datastore?

A
  1. strongly consistent storage layer
  2. real-time updates
  3. mobile and web client libraries
  4. A new collection of document data model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the modes that cloud firestore can operate in?

A

(1) Native mode and (2) cloud datastore mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is datasotre mode for firestore? and what is native mode?

A

In datastore mode, firestoer operates like datastpre but uses the firestore storage system. This provides for strong consistency instead of eventual consistency. The new data model, real-time updates, and mobile and web client library features are available only in Native mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cloud firestore in datastore mode uses a data model taht consistes of:

A
  1. entities,
  2. entity groups,
  3. properties,
  4. keys
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two kinds of indexes that cloud firestore uses?

A
  1. built-in
  2. composite
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a firestore built-in index?

A

A built-in index is created by default for each property in an entity

17
Q

what is a firestore compostie index?

A

Composite indexes index multiple values of an entity

18
Q

What are some common tasks of BigQuery?

A
  1. Interacting with data sets
  2. importing and exporting data
  3. Streaming inserts
  4. Monitoring and logging
  5. Managing costs
  6. Optimizing tables and queries
19
Q

What is stackdriver monitoring?

A

Stackdriver is used for monitoring and logging in BigQuery. Stackdriver Monitoring provides performance metrics, such query counts and time ro run queries.

20
Q

Stackdriver monitoring collects metrics on a range of operations, including:

A
  1. number of scanned bytes
  2. query time
  3. slots allocated
  4. slots available
  5. number of tables in a dataset
  6. uploaded rows
21
Q

stackdriver logging tracks log entries taht describe events. These include:

A
  1. inserting, updating, patching and deleting tables
  2. inserting jobs
  3. executing queries
22
Q

What is cloud memorystore?

A

is a managed redis serivce which is commonly used for caching.

23
Q

What is a bucket?

A

A bucket is a group of objects that share access controls at the bucket level. Individual objects within buckets can have their own access control as well.

24
Q

Four tiers or types of storage:

A
  1. Regional
  2. Multi-regional
  3. Nearline
  4. Coldline
25
Q

What is regional storage?

A

stores multiple copies of an object in multiple zones in one region

26
Q

what is multi-regional storage

A

mitigates the risk of regional outage by storing replicas of objects in multiple regions. Also known as geo-redundant storage.

27
Q

Google offers two network tiers:

A
  1. Standard
  2. Premium
28
Q

What is the standard network tier?

A

Data is routed between regions using public internet infrastructure and is subject to network conditions and routing decisions beyond google’s control

29
Q

What is the premium network tier?

A

routes data over google’s global high speed network. Users of premium tier networking can expect lower latencies

30
Q

What are retention policies?

A

A retention policy uses a bucket lock feature of cloud storage buckets to enforce object retention. By setting a retention policy, you ensure that any object int eh bucket or future objects in the bucket are not deleted until they reach the age specified in the retention policy. This feature is particularly useful for compliance with government or industry regulations.

31
Q

When you manage your own databases, you will be responsible for an array of database and system administration tasks, including:

A
  1. Updating and patching the operating system
  2. Updating and patching the database system
  3. Backing up and, if needed, recovering data
  4. Configuring network access
  5. Managing disk space
  6. Monitoring databse performance and resource utilization
  7. Configuring for high availibility and managing failovers
  8. Configuring and managing read replicas
32
Q
A