Products Flashcards
(18 cards)
What Is
Dataproc
Cloud Dataproc is a managed Hadoop and Spark service where a preconfigured cluster
can be created with one command line or console operation.
When working with Cloud Dataproc, you must know how to manage data storage, configure a cluster, and submit jobs.
What is
Dataflow
Cloud Dataflow is a managed stream and batch processing service. It is a core component for building pipelines that collect, transform, and output data.
Cloud Dataflow does not require you to configure instances or clusters—it is a no-ops service. Cloud Dataflow pipelines are run within a region. It directly integrates with Cloud Pub/Sub, BigQuery, and the Cloud ML Engine. Cloud Dataflow integrates with Bigtable and Apache Kafka.
What is
Pub/Sub
Cloud Pub/Sub is a real-time messaging service that supports both push and pull subscription models. It is a managed service, and it requires no provisioning of servers or clusters. Cloud Pub/Sub will automatically scale and partition load as needed.
What is
Compute Engine
VM running in google infrastructure
What is
App Engine
Plarform as a service, for building and hosting web apps.
What is
Google Kubernetes Engine (GKE)
Kubernetes Service
What is
Cloud Run
Platform for running serverless apps in containers.
What is
Cloud Functions
Functions Handling
What is
Cloud Storage
Cloud Storage is a managed service for storing unstructured data.
What is
Cloud SQL
Relational database service for PostgreSQL, MySQL, and SQL Server.
Key features:
- regional database
- limit 30 TB (if u need more move to cloud spanner)
What is
Big Query
Data Werehouse
What is
Spanner
Globally distributed, werticly scalled database.
- SQL
- Creating globally synchronous multiple replicas of each rows in multiple locations
What is
Cloud Bigtable
Low-latency, Cassandra, and HBase-compatible NoSQL database service.
Key Features:
- NoSQL
- Wide-column type
- High-volume
- low latency (miliseconds)
Cloud Firestore
NoSQL Document database.
Replacement of Cloud Datastore
Cloud Memorystore
Redis (REmote DIctionary Server) service.
Commonly used for caching.
Persistent Disk
Discs for compute engine
Cloud Datastore
NoSQL document database.
What is
Cloud SQL Proxy
Tool(?) which provides secure access to second generation instances without having to create allow lists or to configure SSL.
The proxy manages authentication and automatically encrypts data.