DB and Storage Flashcards

1
Q

What is an example of unstructured data that organizations can capture from social media?
A. Post comments
B. Tagging
C. Profile picture
D. Location

A

A. Post comments

Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable

Photograph, Files should be stored on CloudStorage

Regular table type structured data should go to CloudSQL or CloudSpanner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

An organization wants a cost-effective relational database.
Which Google Cloud service should the organization use?

A. Cloud Storage
B. BigQuery
C. Cloud SQL
D. Dataflow

A

C. Cloud SQL

GCP has two options for Relational DB. Cloud SQL and Cloud Spanner

CloudSQL can run MySQL, MS SQL type of DBs. But it can go only upto 10 TB.
If you have a DB running OnPrem you can migrate it to CloudSQL

Another option is CloudSpanner which gives you much better performance and unlimited storage (>10TB)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Your organization is developing an application that will capture a large amount of data from millions of different sensor devices spread all around the world. Your organization needs a database that is suitable for worldwide, high-speed data storage of a large amount of unstructured data.
Which Google Cloud product should your organization choose?

A. Firestore
B. Cloud Data Fusion
C. Cloud SQL
D. Cloud Bigtable

A

D. Cloud Bigtable

The need is for Unstructured data so CloudSQL or CloudSpanner can not be an option.

The Q is asking for Sensor/IoT data and
Bigtable is special for stream data IOT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Your organization needs to analyze data in order to gather insights into its daily operations. You only want to pay for the data you store and the queries you perform. Which Google Cloud product should your organization choose for its data analytics warehouse?

A. Cloud SQL
B. Dataproc
C. Cloud Spanner
D. BigQuery

A

D. BigQuery

BigQuery is used for
- Data Analysis for structured and unstructured data
- As a Data Warehouse
-As a Data Lake

You can also use Looker to make visualization on the BigQuery data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Your organization is developing an application that will manage payments and online bank accounts located around the world. The most critical requirement for your database is that each transaction is handled consistently. Your organization anticipates almost unlimited growth in the amount of data stored.
Which Google Cloud product should your organization choose?

A. Cloud SQL
B. Cloud Storage
C. Firestore
D. Cloud Spanner

A

D. Cloud Spanner

The Q is aksing to store Bank Account data which is kind of structured data and needs a SQL, Relational DB.

So the options are CloudSQL and CloudSpanner.

CloudSQL has a limitation of 10TB and can not give you unlimited storage.

Hence the answer should be CloudSpanner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Your organization wants an economical solution to store data such as files, graphical images, and videos and to access and share them securely.
Which Google Cloud product or service should your organization use?

A. Cloud Storage
B. Cloud SQL
C. Cloud Spanner
D. BigQuery

A

A. Cloud Storage

The need is to store pdf, word, images, video etc. The best option for such storage is CloudStorage. This is like google drive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How would a global organization benefit from managing their data with Cloud Spanner?

A. Cloud Spanner is optimized for cold storage

B. Cloud Spanner replicates data across regions in real time

C. Cloud Spanner is optimized to ingest unstructured data

D. Cloud Spanner visualizes and analyzes data in real time

A

B. Cloud Spanner replicates data across regions in real time

A: Cold storage is for CloudStorage
C: This is Cloud BigTable
D: This is Looker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

our organization wants to migrate its data management solutions to Google Cloud because it needs to dynamically scale up or down and to run transactional
SQL queries against historical data at scale. Which Google Cloud product or service should your organization use?

A. BigQuery
B. Cloud Bigtable
C. Pub/Sub
D. Cloud Spanner

A

D. Cloud Spanner

We are talking about TRANSACTIONAL SQL queries which means a relational DB like CloudSQL or CloudSpanner

CloudSQL is not elastic and will not dynamically scale up/down. It will run fix number of servers.

BigQuery is for Analysis DB like Data Warehouse or Data Lake. It will also not provide transactional DB.

BigTable is for NoSQL, Unstructured DB

Pub/Sub is a messaging service and not a DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Your organization wants to predict the behavior of visitors to its public website. To do that, you have decided to build a machine learning model. Your team has database-related skills but only basic machine learning skills, and would like to use those database skills.
Which Google Cloud product or feature should your organization choose?

A. BigQuery ML
B. LookML
C. TensorFlow
D. Cloud SQL

A

A. BigQuery ML

The imp point is that your team know only DB but not ML.

BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.

There is no such readymade ML facility for CloudSQL.

Your team would need ML knowledge to use TenserFlow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which Google Cloud service or feature lets you build machine learning models using Standard SQL and data in a data warehouse?

A. BigQuery ML
B. TensorFlow
C. AutoML Tables
D. Cloud Bigtable ML

A

D. BigQuery

BigQuery ML lets you create and execute machine learning models in BigQuery using standard SQL queries.

There is no such readymade ML facility for CloudSQL.

Your team would need ML knowledge to use TenserFlow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A global organization is developing an application to manage payments and online bank accounts in multiple regions. Each transaction must be handled consistently in their database, and they anticipate almost unlimited growth in the amount of data stored.
Which Google Cloud product should the organization choose?

A. Cloud SQL
B. Cloud Spanner
C. Cloud Storage
D. BigQuery

A

B. Cloud Spanner

The keywords are Bank Account kind of data and unlimited size of DB.

So you need a Relational DB means CloudSQL or CloudSpanner

CloudSQL has a 10 TB limit so it can’t be used as you need unlimited storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

An organization wants to use BigQuery data analytics to understand their website performance, but wants to move only some data into the cloud.
Which environment should the organization use?

A. Private cloud
B. On-premises
C. Multi-cloud
D. Hybrid cloud

A

D. Hybrid cloud

The Q says that you need to move only partial data from OnPrem to Cloud.
It means that the data will remain at OnPrem as well which would need a service that can make a connection between OnPrem + Cloud

Hybrid Cloud allows to establish a “connection” between cloud and on-premises.

Multi-clouds always include more than one public cloud service. It can also include Private Clouds as well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

An organization has created an ecommerce website. What data on this website would be considered structured data?

A) Product ratings score
B) Product reviews
C) Product photographs
D) Product descriptions

A

A) Product ratings score

Reviews, Descriptions are free form text with no structure. Suitable option for such data is CloudBigTable

Photograph, Files should be stored on CloudStorage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the difference between Standard and Coldline storage?

A. Coldline storage is for data for which a slow transfer rate is acceptable.

B. Standard and Coldline storage have different durability guarantees.

C. Standard and Coldline storage use different APIs.

D. Coldline storage is for infrequently accessed data.

A

D. Coldline storage is for infrequently accessed data.

If you need to store data that is accessed very frequently, in that case Standard storage will be cheaper.

The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)

The cost of storage decreases from Standard to Archive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Your company’s development team is building an application that will be deployed on Cloud Run. You are designing a CI/CD pipeline so that any new version of the application can be deployed in the fewest number of steps possible using the CI/CD pipeline you are designing. You need to select a storage location for the images of the application after the CI part of your pipeline has built them.
What should you do?

A. Create a Compute Engine image containing the application

B. Store the images in Container Registry

C. Store the images in Cloud Storage

D. Create a Compute Engine disk containing the application

A

C. Store the images in Cloud Storage

CloudStorage is to store Documents, Images, PDF etc

The 4 categories are:
Standard (charged for min 30 days)
NearLine (charged for min 60 days)
ColdLine (charged for min 90 days)
Archive (This wont be accessible online. Think of this as data on tape kept in a safe box)

The cost of storage decreases from Standard to Archive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from on-premises file servers to Cloud Storage. Your organization wants the migration process to be automated and to be managed by Google. Your organization has an existing Dedicated Interconnect connection that it wants to use. Which Google Cloud product or feature should your organization use?

A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance

A

D. Transfer Appliance

If you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.

If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.

If you need to transfer data between clouds, then also Storage Transfer will be a better option.

17
Q

Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from on-premises file servers to Cloud Storage. Your organization has calculated that with the current internet speed, it will take them 3 weeks to transfer the data but they want to transfer it within a week. Which Google Cloud product or feature should your organization use?

A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance

A

D. Transfer Appliance

f you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.

If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.

If you need to transfer data between clouds, then also Storage Transfer will be a better option.

18
Q

Your organization is migrating to Google Cloud. As part of that effort, it needs to move terabytes of data from amazon S3 which is a cloud storage to Google Cloud Storage. Which Google Cloud product or feature should your organization use?

A. Storage Transfer Service
B. Migrate for Anthos
C. BigQuery Data Transfer Service
D. Transfer Appliance

A

A. Storage Transfer Service

If you have data that is more than 10 TB or it will take more than a week to transfer, then you can use Transfer Appliance. This is a like a very huge USB drive where you can upload your data. Then google will upload it in your cloud.

If you have less than 10 TB or you think data can transferred in few hours or days, then go with Storage Transfer Service which is like like doing FTP to cloud.

If you need to transfer data between clouds, then also Storage Transfer will be a better option.

19
Q

Your team is publishing research results and needs to make large amounts of data available to other researchers within the professional community and the public at minimum cost.
How should you host the data?

A. Use a Cloud Storage bucket and enable “Requester Pays”

B. Use a Cloud Storage bucket and provide Signed URLs for the data files.

C. Use a Cloud Storage bucket and set up a Cloud Interconnect connection to allow access to the data.

D. Host the data on-premises, and set up a Cloud Interconnect connection to allow access to the data.

A

A. Use a Cloud Storage bucket and enable “Requester Pays”

The Q asks whether you will pay for the network cost for the data people will down or not.

Cloud Storage has an option where you can make the people who are downloading your data to pay for it. This is called “Requester Pay” option.

20
Q

How would an organization benefit from using Looker?

A. Optimal identity and access management

B. Leading serverless warehousing technology

C. Robust data roll-back accuracy

D. Advanced business intelligence and analytics

A

D. Advanced business intelligence and analytics

Looker is a BI application. You can use it to make reports, graphs on data from BigQuery or any other Database

21
Q

An organization wants to transform multiple types of structured and unstructured data in the cloud from various sources. The data must be readily accessible for analysis and insights.
Which cloud data storage system should the organization use?

A. Relational database
B. Private data center
C. Data field
D. Data warehouse

A

D. Data warehouse

Data Warehouse or Data Lake can take data from various sources (DB, File, Storages etc) and analyse them.
You can use Structured as well non-structured Data.

BigQuery is the service you can use for this. You can also use Looker to create graphs, reports on this data.

22
Q

An organization recently launched a virtual customer support agent, generating vast amounts of text and speech data.
Why should they use a cloud data warehouse to interpret this data?

A. To natively visualize both types of data using a dashboard in real time

B. To ingest and analyze structured and unstructured data at scale, in real time

C. To secure data transmission between cloud and on-premises environments

D. To transform data from structured to unstructured

A

B. To ingest and analyze structured and unstructured data at scale, in real time

A is Looker, a BI solution

23
Q

An organization needs to store structured, semi-structured, and unstructured data in its raw, native format in the same repository.
Which cloud data management solution should the organization use?
A. Data field
B. Data lake
C. Database
D. Data warehouse

A

B. Data lake

24
Q

You have many terabytes of data residing within BigQuery. Your organization has asked you to analyze the data using machine learning.

Which choice would be ideal for this scenario?

Dataproc

DataFusion

BigQuery ML

Vertex AI

A

BigQuery ML