Data Storage Flashcards

1
Q

What are GCP’s 6 main data storage services?

A
  1. Cloud Storage
  2. Cloud SQL
  3. Cloud Spanner
  4. Firestore
  5. Bigtable
  6. BigQuery
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Cloud Storage?

A

Cloud Storage is a storage service for objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Cloud Storage’s storage capacity?

A

Unlimited

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Cloud Storage object and how large can it be?

A

An object is an immutable unit of data that can be in any file format. Every object has 2 components: object data and object metadata. Objects are very similar to Avro objects. An object can be up to 5 TB in size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the identifier for a Cloud Storage object?

A

An object’s sole identifier is the combination of its unique name (which is generated by the owner) and its generation number which is created by Cloud Storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are Cloud Storage objects normally accessed?

A

Through HTTP GET requests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is object composition in the context of using Cloud Storage?

A

Object composition is the process of creating small objects out of chunks of data in parrallel, combining them together to create a single object, and then deleting the small temporary objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Are Cloud Storage operations strongly consistent?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is object versioning?

A

Cloud versioning is a Cloud Storage feature that allows a replaced or deleted object to be retained as a noncurrent version by enabling object versioning. This allows you to keep a historical record of object changes and to also restore an object to an older state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the IAM roles inheritance sequence between projects, buckets, and objects.

A

Generally speaking, Cloud Storage buckets inherit IAM roles from projects, and Cloud Storage objects inherit IAM roles from buckets. Please keep in mind, there are ways to set IAM Roles at the object level. Look at the docs for details.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Cloud Storage bucket?

A

A bucket is a container for objects. All buckets are associated with a project.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Are Bucket names required to be globally unique?

A

Yes. Consider using UUIDs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bucket and object names appear in URLs, do not use sensitive information to create names.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the 4 different Cloud Storage classes?

A
  1. Standard
  2. Nearline
  3. Coldline
  4. Archive

Standard does not have a minimum storage duration and has the lowest operation fees.

Nearline has a 30 day minimum storage duration and has operation fees.

Coldline has a 90 day minimunm storage duration and has operation fees.

Archive has a 365 day minimum storage duration and has the most expensive operation fees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Wile IAM can be used to manage access to buckets and objects by inheritance, it is not enough if you need granular control over individual objects. If you need to control access to individual objects, you can use object ACLs or Signed URLs (can be shared with anyone). ACLs are specific to Cloud Storage and not any other service ACLs are a legacy feature. Google Recommends using IAM unless there is a dire need to implement ACLs.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a bucket-lock?

A

A bucket-lock is a feature that allows you to lock-in the data retention policy, permanently preventing the policy from being reduced or remove for its duration. You can always, however, increase the retention policy after a bucket-lock.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What service can be used to move large amounts of data into Cloud Storage?

A

Storage Transfer Service

18
Q

What is Cloud SQL?

A

Cloud SQL is a managed service for MySQL, PostgreSQL, and SQL Server.

19
Q

What is Cloud SQL’s storage capacity?

A

64 TB. This is because Cloud SQL does not natively support autoscaling.

20
Q

Compute Engine VMs come preloaded with SQL Server, and licensing from Microsoft is included automatically.

A
21
Q

What is Cloud Spanner?

A

Cloud Spanner is a SQL relational database service that provides strong consistency for transactions at scale. It is ideal for mission critical OLTP (online transaction processing) applications.

22
Q

What is Cloud Spanner’s storage capaticy?

A

Petabytes

23
Q

What is Firestore?

A

Firestore is a NoSQL document database service. It is very similar to MongoDB.

24
Q

What is Firestore’s storage capacity?

A

Petabytes

25
Q

How large can a Firestore document be?

A

1 MB

26
Q

What are the 4 main differences between Firestore Native and Firestore Datastore?

A
  1. Native uses a document and collection data model, Datastore uses entities and kinds
  2. Native supports Firestore client libraries, Datastore supports Datastore client libraries
  3. Native supports offline data persistence (client libraries cache data and re-syncs when Firestore is back online), Datastore does not
  4. Native supports the ability to listen to documents for real-time updates, Datastore does not
27
Q

When does Google recommend using Firestore Native and when does it recommend using Firestore Datastore?

A

The information on the developer path recommends using Firestore Native when integrating with mobile/web client applications directly and Firestore Datastore when integrating with back-end servers directly. However, Cloud Console recommends using Fire Store Native for all new applications.

28
Q

Are Firestore Native and Firestore Datastore operations strongly consistent?

A

Yes

29
Q

What is Firestore Native and Firestore Datastore’s billing structure?

A

The same:

  1. Billed for each document read, write, and delete
  2. Billed for the amout of storage used
  3. Has a free daily quota for “default” database
30
Q

How important are indexes when using Firestore?

A

Extremely important. In fact, queries are not supported by indexes will fail. By default, all properties in a document are indexed. However, if you have a query that uses multiple property values, you will most likely need to creater a composite index to support it.

31
Q

What is Bigtable?

A

Bigtable is a NoSQL tabular database that can support billions of rows and thousands of columns.

32
Q

What is Bigtable’s storage capacity?

A

Petabytes

33
Q

How large can a Bigtable cell and row be?

A

Cell: 10 MB
Row: 100 MB

34
Q

What is BigQuery?

A

BigQuery is a data warehouse solution for analytics. It has built-in features like machine-learning, geospatial analysis, and business intelligence. It is ideal for OLAP (online analytical processing) workloads.

35
Q

What is Memorystore?

A

Memorystore is a distributed cache solution offered by Google. Memorystore uses Redis or Memcahced as its underlying technology.

36
Q

What is Google’s take on storing images in a database?

A

Google believes a database is normally not the right location to store images. Google suggests storing images and the like in Cloud Storage, which resembles a remote file system.

37
Q

What is the difference between Cloud Storage Uniform and Fine-grained access controls?

A

Uniform uses IAM policies at the bucket level, causing all objects to have the same access policies. Fine-grained uses ACLs which allows objects to have their own access policies.

38
Q

What is the purpose of Cloud SQL Auth Proxy and how does it work?

A

The purpose of Cloud SQL Auth Proxy is to provide a way to connection your application to Cloud SQL in a secure manner without having to configure IP addresses and SSL certificates. Essentially, you configure your application to connect to a local Auth Proxy and in turn, the Auth Proxy connects to Cloud SQL.

39
Q

What is AlloyDB?

A

AlloyDB is a fully-managed PostgreSQL database solution offered by Google. This service supports horizontal scaling.

Google claims AlloyDB is 4 times faster than standard PostgreSQL for transactional workloads and 100 times faster than standard PostgreSQL for analytical queries.

40
Q

What is the AlloyDB’s storage capacity?

A

For most regions: 32 TB
For some regions: 64 TB

41
Q

What is Memorystore’s storage capacity?

A

1 TB *

42
Q

What are the 2 available location modes for Spanner and what is their tradeoff?

A
  1. Single-region
  2. Multi-region