INGENIERO DE DATOS GOOGLE CLOUD PLATFORM Flashcards

(30 cards)

1
Q

What is the definition of “Cloud”?

a) Place in the cloud where data is stored.

b) Hardware component inside the computer.

c) Type of malware.

d) Ability to store data and access programs over the internet rather than on
local devices.

A

D

Ability to store data and access programs over the internet rather than on
local devices.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What type of data does BigQuery store?

a) Flat files.
b) Machine learning models.
c) Application logs.
d) Structured tabular data.

A

D

Structured tabular data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which tool helps orchestrate data pipelines on Google Cloud?

a) Cloud Composer
b) Data Catalog
c) Looker
d) Firestore

A

A

Cloud Composer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What type of cloud service allows users to have a complete development
environment to deploy and manage applications without worrying about the
underlying infrastructure?

a) IaaS (Infrastructure as a Service).
b) PaaS (Platform as a Service).
c) SaaS (Software as a Service).
d) DaaS (Data as a Service).

A

B

PaaS (Platform as a Service).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the main library for model training in Google Cloud?

a) TensorFlow
b) Pandas
c) Scikit-learn
d) Numpy

A

A

TensorFlow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the main goal of data engineering on Google Cloud?

a) Building mobile apps
b) Training machine learning models
c) Creating UI components
d) Designing and managing scalable data processing systems

A

D

Designing and managing scalable data processing systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which Google Cloud service allows you to create analytical dashboards?

a) Looker Studio.
b) BigQuery.
c) Cloud Functions.
d) AI Platform.

A

A

Looker Studio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which service is used to implement data pipelines in Google Cloud?

a) Cloud Functions.
b) BigQuery ML.
c) Cloud Dataflow.
d) AI Platform Training.

A

C

Cloud Dataflow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does Cloud Pub/Sub do?

a) Processes batch data
b) Provides SQL-like querying
c) Ingests and delivers event streams
d) Hosts relational databases

A

C

Ingests and delivers event streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

An organization has servers running critical workloads in its facilities around
the world. It wants to be able to manage these workloads in a uniform,
centralized manner, with basic infrastructure management.
What should the organization do?

a) Migrate the workloads to a central office building.
b) Migrate the workloads to multiple joint local facilities.
c) Migrate the workloads to a public cloud.
d) Migrate the workloads to multiple local private clouds.

A

C

Migrate the workloads to a public cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the term “multitenancy” mean in cloud computing?

a) The installation of multiple operating systems on a single server.

b) The creation of multiple backup copies of a file.

c) The ability to access the cloud from multiple devices at the same time.

d) The ability of a cloud service to serve multiple users or clients independently within a single infrastructure.

A

D

The ability of a cloud service to serve multiple users or clients independently within a single infrastructure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the cloud categories according to the role and control exercised by
user and provider?

a) Private, Public and Hybrid.
b) IaaS - PaaS - SaaS.
c) Compute Engine, App Engine and Kubernetes Engine.
d) Agile, DevOps and Containers.

A

A

Private, Public and Hybrid.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In the context of IaaS (Infrastructure as a Service), what responsibility
typically falls on the customer?

a) Physical hardware maintenance.
b) Network and virtual server management.
c) Configuration and management of the operating system and applications.
d) Network and storage infrastructure.

A

C

Configuration and management of the operating system and applications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What format is recommended for storing large datasets in Google Cloud?

a) Parquet
b) JSON
c) CSV
d) TXT

A

A

Parquet

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Which Google Cloud resource is included in the IaaS (Infrastructure as a
Service) model?

a) Google Cloud Functions.
b) Google Compute Engine.
c) Google Dataflow.
d) Google App Engine.

A

B

Google Compute Engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which storage class in Cloud Storage is best for long-term, infrequently
accessed data?

a) Nearline
b) Standard
c) Archive
d) Coldline

A

C

Archive

nearline > 1 mes
coldline > 3 meses
archive > 1 año

17
Q

Which service is best for running Apache Spark or Hadoop jobs on Google
Cloud?

a) Dataproc
b) Cloud SQL
c) BigQuery
d) Cloud Run

18
Q

What is Google Cloud Operations Suite (Cloud’s Observability)?

a) A massive data processing platform.
b) A cloud application monitoring and diagnostics service.
c) A real-time messaging system.
d) A relational database service.

A

B

A cloud application monitoring and diagnostics service.

19
Q

Which service allows scaling deep learning models in Google Cloud?

a) AI Platform Training.
b) BigQuery.
c) Vertex AI.
d) Cloud Run.

20
Q

Which of the following is a benefit of using serverless technologies in data
engineering?

a) More manual control.
b) Lower developer productivity.
c) Automatic scaling and reduced ops overhead.
d) Increased hardware costs.

A

C

Automatic scaling and reduced ops overhead.

21
Q

What is digital transformation?

a) The process of converting physical files to digital.

b) A type of photo editing software.

c) The integration of digital technologies to improve processes, increase
efficiency and offer new value propositions to customers.

d) The creation of mobile applications for all companies.

A

C

The integration of digital technologies to improve processes, increase
efficiency and offer new value propositions to customers.

22
Q

In one organization, updates to virtual machine-based applications take a
long time to complete due to operating system boot times.
What should the organization do to speed up its application upgrades?

a) Migrate the virtual machines to the cloud and add more resources to them.

b) Increase virtual machine resources.

c) Automate application update deployments.

d) Convert the applications in the virtual machines to container-based
applications.

A

D

Convert the applications in the virtual machines to container-based
applications.

23
Q

You are deploying 10,000 new Internet of Things (IOT) devices to collect
temperature data in your warehouses globally. You need to process, store
and analyze these very large datasets in real time.
What should you do?

a) Send the data to Google Cloud Pub/Sub, stream Cloud Pub/Sub to Google
Cloud Dataflow, and store the data in Google BigQuery.

b) Send the data to Google Cloud Datastore and then export to BigQuery.

c) Send the data to Cloud Storage and then spin up an Apache Hadoop cluster as needed in Google Cloud Dataproc whenever analysis is required.

d) Export logs in batch to Google Cloud Storage and then spin up a Google
Cloud SQL instance, import the data from Cloud Storage, and run an
analysis as needed.

A

A

Send the data to Google Cloud Pub/Sub, stream Cloud Pub/Sub to Google
Cloud Dataflow, and store the data in Google BigQuery.

24
Q

An organization is developing an application that will capture a large amount
of data from millions of sensors distributed around the world.
The organization needs a database that is suitable for high-speed storage of
unstructured data. Which Google Cloud product should this organization
choose?

a) Cloud Firestore
b) Cloud Bigtable
c) Cloud Data Fusion
d) Cloud SQL

A

B

Cloud Bigtable

25
What does “scalability” mean in the context of cloud computing? a) The ability to measure server utilization. b) The ability to increase or decrease computing resources according to the needs of running processes. c) A mechanism to increase the security of the networks used in the platform, d) A tool for debugging container-based applications.
B The ability to increase or decrease computing resources according to the needs of running processes.
26
When we talk about PaaS... a) ... we are referring to cloud services with a focus on hardware virtualization. b) ... we are referring to cloud services that integrate applications with the infrastructure layer. c) ... is not part of Cloud Computing. d) ... we are referring to cloud services that abstract the applications from the infrastructure layer.
D ... we are referring to cloud services that abstract the applications from the infrastructure layer.
27
What is the role of Apache Beam in Cloud Dataflow? a) It provides a programming model for building portable data pipelines b) It manages Pub/Sub message flow c) It clusters datasets in BigQuery d) It secures access to Dataflow jobs
A It provides a programming model for building portable data pipelines
28
What are the cloud categories according to the type of services offered? a) IaaS (Innovation) - PaaS (Processes) - SaaS (Solutions). b) Compute Engine, App Engine and Kubernetes Engine. c) IaaS (infrastructure) - PaaS (Platform) - SaaS (Software). d) Private, Public and Hybrid.
C IaaS (infrastructure) - PaaS (Platform) - SaaS (Software).
29
Which storage system provides strong consistency and supports SQL queries across globally distributed data? a) BigQuery. b) Cloud Spanner. c) Bigtable. d) Firestore.
B Cloud Spanner.
30
What is the primary purpose of Dataform in Google Cloud? a) Orchestrating machine learning models. b) Building SQL-based data transformation pipelines in BigQuery. c) Managing object storage policies. d) Monitoring server uptime.
B Building SQL-based data transformation pipelines in BigQuery.