Featues And Architecture Flashcards

(46 cards)

1
Q

What is the purpose of the cloud native data platform offered as a service?

A

Purpose built for Cloud

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which cloud providers does the infrastructure run on?

A

AWS, GCP, Azure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What characteristics of the cloud does the data platform utilize?

A
  • Elasticity
  • Scalability
  • High availability
  • Cost-efficiency
  • Durability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the main features of a data warehouse?

A
  • Structured & relational data
  • ANSI Standard SQL
  • ACID compliant Transactions
  • Data stored in dbs, schemas & tables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a data lake?

A

Scalable storage & compute where schema doesn’t need to be defined upfront

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the features of data engineering in the platform?

A
  • COPY INTO
  • Snowpipe
  • Separate compute clusters
  • Tasks and Streams
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is data secured in the platform?

A

All data is encrypted at rest and in transit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does data science in the platform aim to achieve?

A

Remove data management roadblocks with centralized storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the components of the Snowflake partner ecosystem?

A
  • Data Sharing
  • Data Marketplace
  • Data Exchange
  • BI tools
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the benefits of using Software as a Service (SaaS) in this platform?

A
  • No management of hardware
  • Transparent updates and patches
  • Subscription payment mode
  • Ease of access through cloud UI
  • Automatic optimization
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two traditional distributed architecture options mentioned?

A
  • Shared-Disk
  • Shared Nothing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the advantage of the Shared-Disk architecture?

A

Simple to manage and single source of truth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a disadvantage of the Shared-Disk architecture?

A

Single point of failure and limited scalability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is an advantage of the Shared Nothing architecture?

A
  • Co-locating compute & storage avoids network latency
  • Generally cheaper
  • Improved scaling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a disadvantage of the Shared Nothing architecture?

A
  • Scaling still limited
  • Storage and compute tightly coupled
  • Tendency to overprovision
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the focus of the Snowflake Multi-Cluster Data Architecture?

A

Snowflake is built specifically for Cloud

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the three scalable layers of Snowflake’s architecture?

A
  • Cloud Services Layer
  • Query Processing Layer
  • Data Storage Layer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What does the Cloud Services Layer handle?

A
  • Authentication
  • Managing cloud infrastructure
  • Query optimization
  • Security
  • Metadata
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the role of the Query Processing Layer?

A

Manage compute to process queries using separate compute clusters called virtual warehouses

20
Q

What is a virtual warehouse in Snowflake?

A

Named abstraction for a cluster of cloud-based compute instances

21
Q

How does the Data Storage Layer organize data?

A

Stores table data organized into databases

22
Q

What is a micro-partition?

A

Data loaded or inserted into partitions optimized for query results

23
Q

What is the billing model for Snowflake?

A

Flat rate by TB calculated monthly

24
Q

What is the default encryption used for data?

A

AES256 strong encryption

25
What is the purpose of User Defined Functions (UDFs)?
Allow users to write their own functions in various programming languages
26
What types of objects are included in the Snowflake Object Model?
* Database * Schema * Table * View * Stage * Pipe * Procedure * Function * Task * Stream
27
What is the difference between a permanent table and a temporary table?
Permanent tables are default, while temporary tables persist only for the session
28
What is a materialized view?
Stores results of a query definition and periodically refreshes it
29
What are tasks in Snowflake?
Objects used to schedule execution of SQL or stored procedures
30
What is a stream in Snowflake?
Schema level object to view and track DML changes to source tables
31
What are the two billing plans available in Snowflake?
* On Demand * Capacity
32
What is SnowSQL?
Allows running commands outside the UI
33
What is Snowpark?
An API accessed outside of Snowflake supporting Java, Scala, and Python
34
What is the maximum number of child tasks in a Directed Acyclic Graph (DAG)?
1000 child tasks max
35
What is Snowflake Scripting?
Extension to Snowflake SQL
36
What can be written using Snowflake Scripting?
Stored procedures and procedural code outside of stored procedures
37
Where can variables be used in Snowflake Scripting?
Only within block
38
How can variables be assigned in Snowflake Scripting?
Using LET keyword in the BEGIN
39
What are the looping constructs available in Snowflake Scripting?
For Loop and Cursor loops over records in table
40
What is RESULTSET in Snowflake Scripting?
Executed when assigned
41
What is Snowpark?
An API accessed outside of the Snowflake
42
Which programming languages does Snowpark support?
* Java * Scala * Python
43
What type of abstraction does Snowpark provide?
In a dataframe
44
What is the execution model of Snowpark?
Executions are lazy
45
What model does Snowpark work on?
Push out model
46
Why do developers prefer using Snowpark?
So developers can use Snowflake in preferred language