Azure Data Lists Flashcards

1
Q

Data architectures

A
  • Lambda architecture
  • Kappa architecture
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Lambda architecture layers

A

Batch layer
Speed layer
Serving layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data warehouse workload types

A
  • Relational
  • Non-relational
  • Batch
  • Streaming
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Main phases of a data stream flow

A
  • Production
  • Acquisition
  • Aggregation and transformation
  • Storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Time window aggregation types

A
  • Tumbling window
  • Hopping window
  • Sliding window
  • Session window
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data stream concepts

A
  • Watermarks
  • Consumer groups
  • Time window aggregations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Batch processing scenarios

A
  • Data set transformation and preparation
  • ETL and ELT workloads
  • Machine learning model training
  • Applying machine learning models on data sets for scoring
  • Report generation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Azure batch Processing Services

A
  • Azure Synapse Analytics
  • Azure Data Lake Analytics
  • Azure HDInsight
  • Azure Databricks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Batch processing tools

A
  • Azure Synapse Analytics
  • Azure Data Lake Analytics
  • Azure HDInsight
  • Azure Databricks
  • Apache Hive
  • Apache Pig
  • Apache Spark
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Analytical data stores

A
  • Azure Synapse Analytics
  • Spark SQL
  • HBase
  • Apache Hive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Five V’s of big data

A
  • Volume
  • Velocity
  • Variety
  • Veracity
  • Value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Analytics techniques

A
  • Descriptive analysis
  • Diagnostic analysis
  • Predictive analysis
  • Prescriptive analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

TDSP phases

A
  1. Business needs
  2. Data discovery and acquisition
  3. Model development
  4. Model deployment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Common TDSP roles

A
  • Subject matter expert
  • Data engineer
  • Data scientist
  • Application developer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

MLOps best practices

A
  • Exploratory data analysis (EDA)
  • Data Prep and Feature Engineering
  • Model training and tuning
  • Model review and governance
  • Model inference and serving
  • Model deployment and monitoring
  • Automated model retraining
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Azure Data Factory runtime types

A
  • Azure
  • Self-hosted
  • SSIS (SQL Server Integration Services)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Azure Data Factory transformation types

A
  • External services
  • Mapping data flows (uses Apache Spark code, run on Azure Databricks)
  • Wrangling data flows (Power Query editor in Microsoft Power BI)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Azure Data Factory external services for transformations

A
  • Azure SQL Database
  • Azure Synapse Analytics
  • Azure Databricks
  • Azure HDInsight
  • Azure Functions
  • SQL Server Integration Services (SSIS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Azure Stream Analytics features

A
  • Provisioned or on-demand SQL Server pools
  • Provisioned or on-demand Spark pools
  • Stream processing capabalitiies through window aggregations
  • ML models aggregation through the PREDICT statement
  • Azure DevOps integration
  • Data Factory-like pipelines development experience
  • Power BI report editor integration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Macro-layers for analytics

A
  • Analytical access
  • Reporting access
  • Dashboarding access
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Azure SQL Database purchasing models

A
  • vCore-based
  • DTU-based
22
Q

Services needed to run SQL Server on an Azure VM

A
  • Azure Storage to contain the virtual disk(s).
  • Azure Virtual Network
  • Azure Compute Service to run the VM
23
Q

Extra PostgreSQL data types

A
  • Document
  • Geometry
  • JSON
  • Composite
  • Custom
24
Q

Azure MariaDB and MySQL pricing tiers

A
  • Basic
  • General Purpose
  • Memory Optimized
25
Q

Azure Database Migration Service pricing tiers

A
  • Premium for continuous migration
  • Standard (free) for offline migration
26
Q

Azure Database Migration Service service tiers

A
  • General Purpose
  • Business Critical
27
Q

Data security layers for Azure SQL Database (outside-in)

A
  • Network security
  • Access management
  • Threat protection
  • Information protection
  • Database
28
Q

Information protection layer methods

A
  • Physical encryption
  • Transparent data encryption (TDE)
  • Always encrypted (column-level)
  • Dynamic data masking
29
Q

Threat protection layer methods

A
  • Azure Monitor logs and Event Hubs Audit
  • Advanced Threat Protection
30
Q

Access management layer methods

A
  • Authentication
  • Authorization
31
Q

Network security layer methods

A
  • Firewall
  • Virtual networks
32
Q

SQL Server authentication methods

A
  • SQL Authentication
  • Active Directory - Universal with MFA
  • Active Directory - Password
  • Active Directory - Integrated
33
Q

Azure SQL Database query tools

A
  • Query Editor
  • Sqlcmd utility
  • Azure Data Studio
  • SQL Server Management Studio
  • Visual Studio Code
34
Q

NoSQL storage types

A
  • Key-value store
  • Document store
  • Columnar data store
  • Graph store
35
Q

Document types in document databases

A
  • XML
  • YAML
  • JSON
  • BSON
36
Q

Azure non-relational storage services

A
  • Azure Cosmos DB
  • Azure Table Storage
  • Azure Blob Storage
  • Azure Files
37
Q

Non-relational storage types

A
  • Key-value store
  • Document store
  • Columnar data store
  • Graph store
  • Time series store
  • Object data store
  • External index data store
38
Q

Cosmos DB structure from the top down

A
  • CosmosDB account
  • Databases
  • Containers
  • Logical partitions
  • Physical partitions
39
Q

Cosmos DB consistency levels

A
  • Strong
  • Bounded staleness
  • Session
  • Consistent prefix
  • Eventual
40
Q

Azure Storage performance levels

A
  • Standard
  • Premium (SSD)
41
Q

Azure Table Storage requirements

A
  • Must have a partition key, row key, and timestamp
  • No more than 255 properties (key/value pairs)
  • No more than 1 MB, or 2MB if using the Table API in Cosmos DB.
42
Q

Types of Azure Blob Storage content

A
  • Page blob
  • Block blob
  • Append blob
43
Q

Methods for accessing Azure Blob Storage

A
  • Azure Storage Explorer
  • Azure Blob API, aka Blob service REST API.
  • Azure PowerShell
  • Azure Command-Line Interface (CLI)
  • Azure.Storage .NET client library
44
Q

Azure Files authentication methods

A
  • Active Directory Domain Services (AD DS) for on-premises Active Directory
  • Azure Active Directory Domain Services (AD DS)
  • Storage account access key (one of the two API keys generated for the account)
45
Q

RBAC basic levels

A
  • Reader
  • Contribitor
  • Elevated contributor
46
Q

Azure non-relational storage security components

A
  • Firewall rules
  • Secure transfer using Transport Layer Security (TLS)
  • Storage data encryption
47
Q

Azure non-relational storage data policies

A
  • Time period retention
  • Append enabled (part of time period retention)
  • Legal hold
48
Q

Azure non-relational storage authentication methods

A
  • Shared key
  • Shared access signature (SAS)
  • Azure Active Directory (Azure AD)
  • Azure Active Directory Domain Services (Azure AD DS) for file shares
49
Q

Shared access signature (SAS) configuration options

A
  • Allowed services
  • Allowed resource types
  • Allowed permissions
  • Option to allow deleting versions of objects
  • Start and end date/time
  • Allowed IP addresses
  • Allowed protocols
  • Preferred routing tier
50
Q
A
51
Q

Tools for diagnosing connection problems

A
  • Telerik Fiddler
  • Microsoft Network Monitor (NetMon)
  • Wireshark
52
Q

Management tools for Azure NoSQL/non-relational data

A
    1. Azure Portal
    1. Azure Data Explorer
    1. AzCopy
    1. Cosmos Explorer
    1. Visual Studio Cloud Explorer