snowflake Trial Edition Flashcards

(154 cards)

1
Q

What are the main pushbuttons in Snowflake main screen

A

Projects
Data
Data products
AI & ML
Monitoring
Admin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the options under projects

A

Worksheets
Streamlit
Dashboards
App packages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is VPS

A

Virtual private Snowflake

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the main cloud hosting snowflake

A

AWS, Azure and GCP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Each snowflake account is hosted in a single region

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the unique layers of Snowflake

A

Storage layer
Compute layer
Cloud service layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is share disc architecture

A

one data base with several services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is share nothing architecture

A

several dbs in paralel with many services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the characteristics of Data Storage layer

A

Underlying cloud layer
Virtually infinite storage
Compressed & Encrypted
Cloud Data Redundancy
pay only for stored data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the characteristics of Compute & Processing Layer

A

Query engine or virtual warehouse
Underlying virtual machines
Scale up and down as needed
different size to serve different loads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the characteristics of Cloud Service Layer (Brain of snowflake)

A

Authentication and Authorisation
User and session management
Query compilation, Optimization & Data Caching
virtual Warehouse Management
Metadata management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can each layer be scaled independently

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does snowflake do to the loaded data

A

It compresses and store data in columnar form.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What can we do in the compute layer

A

Select queries
join queries
data loading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is VWH

A

Virtual warehouse and before any query is executed they need to be provisioned.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which layer takes care of the authentication 7 authorisation

A

Cloud service layer for all incoming request from:
Web UI
SnowSQL
JDBC
ODBC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is another name of Query in snowflake

A

Work Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What attributes makes Snowflake a true SAAS Solution

A

No hardware to purchase or configured
No maintenance upgrades or patches
Transparent releases don’t required user intervention.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which instalment options are available

A

AWS, Azure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Three terms are used to described the same compute layer

A

Compute Layer
Virtual Warehouse Layer
Query processing layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Two terms are used to described the same Cloud Services Layer

A

Cloud Services Layer
Virtual Warehouse Layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Two of these terms are used to described the same data layer

A

Data layer
Storage layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Which tasks are performed by the Cloud Services Layer

A

Metadata management
User authentication
Metadata storage
Data security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is DML

A

Data Manipulation language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is DDL
Data Definition Language
26
What are the method types for loading data
Bulk loading and continuous loading
27
what the stages types
External and Internal
28
produce a syntax of an inner join
select A.C_CUSTOMER_SK, A.C_CUSTOMER_ID, A.C_CURRENT_CDEMO_SK, B.CA_STREET_NUMBER, B.CA_STREET_NAME from SNOWFLAKE_SAMPLE_DATA.TPCDS_SF100TCL_OLD.CUSTOMER as A inner join SNOWFLAKE_SAMPLE_DATA.TPCDS_SF100TCL_OLD.CUSTOMER_ADDRESS as B on A.C_CUSTOMER_SK = B.CA_ADDRESS_SK;
29
what did you get as soon as Database is created in snowflake
Information Schema and Public
30
What are the options of the submenu for creating a table
Standard, As select External From file
31
What is the qualification when creating tables
DB.Schema.Table name
32
What is the acceptable syntax for naming surrogate key
_PK, _FK
33
What are the benefits of surrogate keys
performance improvement handle dummy values integrate multiple systems easier admin and update useful when there are no natural keys available
34
What is type 0
Retain original
35
What is SCD type 1
The values are overwritten and no history is maintain.
36
What is SCD type 2
We maintain a complete history of changes. Every time, there is a change a new row will be added to the table. A new column is added to the table to indicate witch record is active. This column(s) could be a flag (true or false) or version number (0,1,2, etc), or time columns to indicate from/to.
37
What is SCD type 3
A partial history is maintained and not a complete history. an extra row is added to indicate the previous state.
38
What is SCD type 4
The dim table changes frequently. There is a need for creating a new table or redesign existing model.
39
What is SCD type 6
This is a combination of 1,2,3. Along with the addition of a new row, we also update the latest value in all the rows.
40
What are the stages types
External and Internal
41
Where is the data flowing to Snowflake to form External Stage
S3, External cloud Provider, Google Cloud, Microsoft Azure
42
What is the copy syntax
copy into from externalStage files = ('file_name1','file_name2') file_format = copyoptions
43
Where can I find Snowflake documentation
docs.snowflake.com
44
What is the syntax for creating a schema
create or replace schema qualified address + name
45
What is the syntax for creating a file format
create or replace file format qualified address + name
46
How to determine a file definition
by using desc file format qualified address + name
47
What is the syntax for changing a file properties
alter file format qualified address + name for example SKIP_HEADER = 1
48
Alter cannot be used for changing properties that are not inherently to its native format.
True
49
What is the syntax for setting up csv file properties
create or replace file format qualified address + name TYPE = CSV, FIELD_DELIMITER = "," SKIP_HEADER = 1;
50
What is the syntax for loading data into a table
copy into DB.Schema.Table name from @qualified name file_format = (format_name = qualified name + csv file format) files = ('file name.csv')
51
What does the truncate command does
Delete the content of a table
52
How to get rid of double quotes in Json file - column
:: String
53
What command is used to select one row from a Json file
raw_file:
54
How to use raw_file for extracting a Json object
raw_file object name.first property, raw_file object name.second property,
55
What is the syntax for creating a role
create role grant usage on to role
56
How can you delete an user
drop user
57
How long does the cash storage last
24 hrs or an soon the data is changed
58
Is clustering for all tables
False
59
What are the most suitable field for clustering
Date or those fields used in the where clause or in joins
60
What is the syntax for creating clusters in tables
create table name ..... cluster by (column1, column2 ...column) or expression <>
61
What is the syntax for changing clusters in tables
alter < table name> cluster by (expression 1, ... expression n) alter
drop clustering key
62
What is the process for building Snowpipe from S3
1.Create Stage 2. Test copy command 3. Create pipe 4. S3 notification
63
What is Snowpipe
It enables loading once a new data is storing in a third party source bucket.
64
What happens when auto_infest = True
Data will be push from source system bucket automatically to Snowflake.
65
What is the syntax for creating a pipe
create or replace pipe auto_ingest=true as copy into our_first from @manage_DB_external
66
Mentioned the time travel type
Standard Enterprise Business Critical Virtual private
67
What is the retention period - default
Data can be recovered up to 1 day only
68
Who can perform the fail safe process
Restoring only by Snowflake support
69
How many methods are there to recover time travel data
2
70
Can undrop work schemas and tables
True
71
What are the table types
Permanent Transient Temporary
72
what does the field kind contain
Table type
73
Can the Transient schema pass on the same properties to created tables under this schema
True
74
is cloned object independent from original
True
75
What is the syntax for creating a clone object
create table clone before (timestamp => (timestamp))
76
Is data sharing involve the recipient with the right to update the data in the original table
False
77
What is the syntax for sharing data on a database
grant usage on database to share orders_share;
78
what objects can you data share
Database Schema Table
79
What is behind_by
it is the time difference between a query and an materialised view.
80
What does masking policy do
hire some field content to unauthorised users
81
What is a classic console
This is the old UI
82
Where can you get more information of how to do things in Snowflake
docs.snowflake.com
83
Where can you get more information about courses
learn.snowflake.com
84
What is identity all about
identifying the person
85
What is authentication
What the user can do
86
what is the role with more power
accountadmin
87
What is the role given for all account trial users as default
SYSADMIN
88
In which direction the inpersonation flows
Downward
89
If you had been awarded SYSADMIN directly, you would not be able to impersonate ACCOUNTADMIN.
True
90
What is DAC
Discrecionary Access Control: you created you owned it
91
After creating a database, what are the default schemas provided
Information schema and public
92
What are databases created for
Databases are used to group datasets (tables) together.
93
Can an information schema be dropped
False
94
What is the content of information schemas
The INFORMATION_SCHEMA schema holds a collection of views.
95
How many menus are included in worksheet
Role Warehouse Database Schema
96
what does Running a SHOW DATABASES command do
It is just like being at the first level of an Object Picker (but with more details, and the ability to cut and paste the info into a spreadsheet).
97
Tsai points out that every worksheet has four configuration settings that are saved with the worksheet. How are they known as
Context settings, 2 provide shot cuts for the data storage location. The other are for the user role and warehouse.
98
What is the warehouse
The machine that crunches data (computer power).
99
What is the idle time allows for the machine to turn of wait for any code running
10 minutes
100
Can Select statement run without role or datawarehouse
False
101
Is the user role and warehouse mandatory
True
102
What can you do with Worksheets
Write code Run code Save code Share
103
Up to what cluster can a warehouse stretch
10
104
Team is also known as
Clusters
105
Team members are also known as
Servers
106
Warehouses can be manually scaled UP or DOWN. They can be set up so that they automatically scale OUT. What did the video call the opposite of SCALING OUT?
Snapping back.
107
What is the meaning of cluster
Group of servers
108
XS not scaling how many servers does it have
1 cluster
109
M not scaling how many servers does it have
1 cluster
110
Is the number of servers different based on XS, S, M, etc
True
111
A XS warehouse when it is scale out would it has more than 1 cluster
True
112
A M warehouse when it is scale out would it has more than 1 cluster
True
113
do clusters hold more than 1 server
True
114
Has any worksheet include a default role
True
115
So what is a stage?
A stage or staging area is a place we put things temporarily with the intention to later move them to a more stable, longer term location.
116
What are the Stages type
Internal and External (Cloud storage services)
117
What are the requirments for creating an external AWS Cloud services
Storage location - S3 bucket Cloud storage access AWS IAM user and policy and In snowflake define stage object with reference to the AWS S3 bucket
118
does snowflake care about capitalization
False
119
snowflake will type table names in uppercase if the name is not in double quotes
True
120
What the tools used for loading data into AWS S3 buckets
Amazon transfer for AFP Amazon CLI Web browser interface Python Java REST API
121
Is there any load wizard for loading data from S3 to Snowflake
False
122
To run the COPY INTO command, you need to have three database objects defined.
Table Stage File format
123
snowflake tools allow you to import 5 popular semi structure data formats like
Json xml Parquet Avro Orc
124
A Snowflake tool used for ingesting semi structure data format is known as
Snowflake variant data type
125
When loading semi structure data, what is the data type of the column holding the upload data
Variant
126
What are the data type trend
Unstructured Quasi structured Semi structured Structured
127
What are some examples of Unstructured
Images, PDF, videos
128
What are some examples of Quasi structured
ClikStream
129
What are some examples of Semi structured
XML, JSON, Parquet, AVRO, ORC
130
What are some examples of Structured
RDBMS AND CSV files
131
What the flatten command do
It flatten for example list arrays in a Json file. This is similar to the Russian nested dolls.
132
Usually how are the twit entities called
Statuses
133
What are the layers of Snowflake architecture
Storage Query Processing cloud Services
134
What is the query processing
It is the muscle of the system and performs massive parallel system
135
Which layer is the brain of the system
Cloud Services: access control, security, optimizer, metadata, manage the infrastructure.
136
What is Data Warehouse
DB that is used for reporting and data analysis.
137
VALIDATION_MODE = RETURN_ERRORS; What is this
When loading the file will be checked for errors.
138
VALIDATION_MODE = RETURN_5_ROWS ; What is this
When loading the file; the first 5 lines will be checked for errors.
139
SIZE_LIMIT = 2000; What is this
The system will load up to 20000 mb
140
RETURN_FAILED_ONLY = TRUE; What is this
The system will indicate the files containing errors.
141
TRUNCATE COLUMNS = TRUE; What is this
Strings are automatically truncated to the target column length
142
TRUNCATE COLUMNS = FALSE; What is this
Copy pro9duces an error if a loaded string exceed the target column length
143
FORCE=TRUE; What is this
Indicates tot he system to load the files(s) regardless of whether the file has been loaded previously and have not change since loaded previously. It has the potential to duplicate the data. Default is FALSE.
144
What is Loading History
Enables you to retrieve the history of data loaded into tables using the COPY INTO
command
145
We have 3 files in our external stage with 20MB (~20 000 000 bytes) each. How many files will be loaded if we set SIZE_LIMIT = 30 000 000 ?
2
146
how do you change from variant to string
SELECT RAW_FILE:first_name::string as first_name FROM OUR_FIRST_DB.PUBLIC.JSON_RAW;
147
How to create a table by using select statement
148
How to format a date column - parquet data to readable date
DATE($1:date::int ) as Date
149
What is this statement METADATA$FILENAME AS FILENAME do in an sql query
Indicate the file name loaded to a stage or table
150
What is this statement TO_TIMESTAMP_NTZ(CURRENT_TIMESTAMP) AS LOAD_DATE,
151
W
152
What measures can be taken to improve performance
1. Create virtual warehouses 2. Scale up during season of high demand 3. Scale out to unknow patters of workload 4. Maximise cached usage 5. Cluster keys - large tables
153
What is a cluster
Subset of rows t olocate the data in micro partitions. For large tables improves the table scan.
154