Featues And Architecture Flashcards
(46 cards)
What is the purpose of the cloud native data platform offered as a service?
Purpose built for Cloud
Which cloud providers does the infrastructure run on?
AWS, GCP, Azure
What characteristics of the cloud does the data platform utilize?
- Elasticity
- Scalability
- High availability
- Cost-efficiency
- Durability
What are the main features of a data warehouse?
- Structured & relational data
- ANSI Standard SQL
- ACID compliant Transactions
- Data stored in dbs, schemas & tables
What is a data lake?
Scalable storage & compute where schema doesn’t need to be defined upfront
What are the features of data engineering in the platform?
- COPY INTO
- Snowpipe
- Separate compute clusters
- Tasks and Streams
How is data secured in the platform?
All data is encrypted at rest and in transit
What does data science in the platform aim to achieve?
Remove data management roadblocks with centralized storage
What are the components of the Snowflake partner ecosystem?
- Data Sharing
- Data Marketplace
- Data Exchange
- BI tools
What are the benefits of using Software as a Service (SaaS) in this platform?
- No management of hardware
- Transparent updates and patches
- Subscription payment mode
- Ease of access through cloud UI
- Automatic optimization
What are the two traditional distributed architecture options mentioned?
- Shared-Disk
- Shared Nothing
What is the advantage of the Shared-Disk architecture?
Simple to manage and single source of truth
What is a disadvantage of the Shared-Disk architecture?
Single point of failure and limited scalability
What is an advantage of the Shared Nothing architecture?
- Co-locating compute & storage avoids network latency
- Generally cheaper
- Improved scaling
What is a disadvantage of the Shared Nothing architecture?
- Scaling still limited
- Storage and compute tightly coupled
- Tendency to overprovision
What is the focus of the Snowflake Multi-Cluster Data Architecture?
Snowflake is built specifically for Cloud
What are the three scalable layers of Snowflake’s architecture?
- Cloud Services Layer
- Query Processing Layer
- Data Storage Layer
What does the Cloud Services Layer handle?
- Authentication
- Managing cloud infrastructure
- Query optimization
- Security
- Metadata
What is the role of the Query Processing Layer?
Manage compute to process queries using separate compute clusters called virtual warehouses
What is a virtual warehouse in Snowflake?
Named abstraction for a cluster of cloud-based compute instances
How does the Data Storage Layer organize data?
Stores table data organized into databases
What is a micro-partition?
Data loaded or inserted into partitions optimized for query results
What is the billing model for Snowflake?
Flat rate by TB calculated monthly
What is the default encryption used for data?
AES256 strong encryption