GCP Datastores Flashcards

1
Q

What is MemoryStore for Redis?

A

Google Cloud’s fully managed Redis service.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Cloud Storage?

A

Cloud Storage is a managed service for storing unstructured data as Objects/Blobs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the four storage options for Cloud Storage?

A

Standard - frequently accessed data

Nearline - infrequently accessed data (30d or less)

Coldline - infrequently accessed data (90d)

Archive - Rarely accessed data (1y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Object Lifecycle Management?

A

A Cloud Storage feature that allows you to configure how long a bucket keeps data in a certain storage option.

Example: Changing from Standard to Coldline after 2 weeks of inactivity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a retention policy?

A

A retention policy is a feature that specifics how long data must be unchanged for.

It is applied retroactively on existing objects in a bucket.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a retention policy lock?

A

A guard that prevents a retention policy from being removed.

Once applied you cannot reduce or remove it, but you can increase it.

Furthermore, you cannot delete a locked bucket if it has objects that haven’t met the retention period.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Redis?

A

An open-source, in-memory data store, typically used for caching data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is BigQuery?

A

BigQuery is a serverless data warehouse with built-on ML/AI and BI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Object Versioning in Cloud Storage?

A

Object versioning is a Cloud Storage feature that tracks changes to objects within a bucket and allows admins to manage (restore, delete, etc) them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Firestore?

A

A serverless NoSQL database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Cloud Spanner?

A

A managed globally available, SQL/relationship database with 99.999% availability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Cloud SQL?

A

A fully managed, regionally available SQL database with support for MySQL, PostgreSQL and Microsoft SQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Filestore?

A

A fully managed shared filesystem from Google Cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Persistent Disk?

A

Block storage for VM instances. They act as the instance’s data drive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How are objects stored in Cloud Storage?

A

They are stored in buckets as immutable objects.

A ‘change’ to an object just creates a completely new one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the difference between Cloud SQL and Cloud Spanner?

A

Cloud Spanner is much more expensive as it offers high availability globally and massive amounts of data (PBs).

Cloud SQL is only available regionally and stores much less data (TB)

17
Q

What is Datastore?

A

The previous version of Firestore. It’s suitable for storing semi-structured data and has high consistency, but lacks Firestore’s real-time and offline capabilities.

18
Q

What is BigTable?

A

BigTable is a high performance, NoSQL database in the form of a sparsely populated table can scale up to millions of rows and thousands of columns.

It can store up to PBs of data.

19
Q

How can an application connect to a Cloud SQL instance?

A

Private IP
Cloud SQL Auth Proxy
Public IP (Cloud SQL Auth Proxy is required when using public IP)

20
Q

What is Cloud SQL Auth Proxy?

A

Cloud SQL Auth Proxy authorises and secures Cloud SQL connections by using IAM Permissions.

It validates IAM service accounts/users and secures the connection with a SSL/TLS layer. It is also the recommended way of connecting to a Cloud SQL Database because of the above.

21
Q

How you authorise to Cloud SQL?

A

Cloud SQL Language Connectors - Client libraries to connect a SQL DB

Cloud SQL Auth Proxy - Service to authorise and authenticate connections

Self-managed SSL/TLS certificates - User-issued certificates for data encryption

Authorized Networks - Add authorised IPs. (use with self-managed certs as a best practice)

22
Q

How much data does Cloud SQL store?

A

Upto 30TB of data.

23
Q

When should you use Cloud SQL?

A

When you want to:

  • Migrate local MySQL, PostgreSQL, and SQL Server databases.
  • To reduce maintenances costs for a pre-existing database
24
Q

When should you use Cloud Spanner?

A

When you want to:

  • Store massive amounts of data
  • Scale your database infinitely
  • Support global expansion
  • You need higher availability than Cloud SQL (three 9s vs .95)
25
Q

In Cloud SQL, when should you use Private IP connection over Public?

A

Private IP is the recommended choice as it’s the most secure (only clients on the same VPC can connect to it), but if the database consumers aren’t on a VPC, then use a public IP.

26
Q

In Cloud SQL, what do you need to configure in order to use Private IP?

A

Private Services Access as Cloud SQL cannot use Private Google Access.

27
Q

What is an ACL?

A

An Access Control List is a mechanism to define fine-grained access to Objects.
It is a legacy mechanism kept for interoperability with AWS, and uniform bucket-access (IAM) is recommended over it.

28
Q

What is Dataflow?

A

Dataflow is a fully-managed service for transforming and enriching data in stream (real-time) and batch modes. It provides a simplified pipeline development environment that uses the Apache Beam SDK to transform incoming data and then output the transformed data.

Source - https://cloud.google.com/dataflow/docs/tutorials/dataflow-stream-to-bigquery