Azure Data Lists Flashcards

(52 cards)

1
Q

Data architectures

A
  • Lambda architecture
  • Kappa architecture
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Lambda architecture layers

A

Batch layer
Speed layer
Serving layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data warehouse workload types

A
  • Relational
  • Non-relational
  • Batch
  • Streaming
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Main phases of a data stream flow

A
  • Production
  • Acquisition
  • Aggregation and transformation
  • Storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Time window aggregation types

A
  • Tumbling window
  • Hopping window
  • Sliding window
  • Session window
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data stream concepts

A
  • Watermarks
  • Consumer groups
  • Time window aggregations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Batch processing scenarios

A
  • Data set transformation and preparation
  • ETL and ELT workloads
  • Machine learning model training
  • Applying machine learning models on data sets for scoring
  • Report generation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Azure batch Processing Services

A
  • Azure Synapse Analytics
  • Azure Data Lake Analytics
  • Azure HDInsight
  • Azure Databricks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Batch processing tools

A
  • Azure Synapse Analytics
  • Azure Data Lake Analytics
  • Azure HDInsight
  • Azure Databricks
  • Apache Hive
  • Apache Pig
  • Apache Spark
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Analytical data stores

A
  • Azure Synapse Analytics
  • Spark SQL
  • HBase
  • Apache Hive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Five V’s of big data

A
  • Volume
  • Velocity
  • Variety
  • Veracity
  • Value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Analytics techniques

A
  • Descriptive analysis
  • Diagnostic analysis
  • Predictive analysis
  • Prescriptive analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

TDSP phases

A
  1. Business needs
  2. Data discovery and acquisition
  3. Model development
  4. Model deployment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Common TDSP roles

A
  • Subject matter expert
  • Data engineer
  • Data scientist
  • Application developer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

MLOps best practices

A
  • Exploratory data analysis (EDA)
  • Data Prep and Feature Engineering
  • Model training and tuning
  • Model review and governance
  • Model inference and serving
  • Model deployment and monitoring
  • Automated model retraining
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Azure Data Factory runtime types

A
  • Azure
  • Self-hosted
  • SSIS (SQL Server Integration Services)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Azure Data Factory transformation types

A
  • External services
  • Mapping data flows (uses Apache Spark code, run on Azure Databricks)
  • Wrangling data flows (Power Query editor in Microsoft Power BI)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Azure Data Factory external services for transformations

A
  • Azure SQL Database
  • Azure Synapse Analytics
  • Azure Databricks
  • Azure HDInsight
  • Azure Functions
  • SQL Server Integration Services (SSIS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Azure Stream Analytics features

A
  • Provisioned or on-demand SQL Server pools
  • Provisioned or on-demand Spark pools
  • Stream processing capabalitiies through window aggregations
  • ML models aggregation through the PREDICT statement
  • Azure DevOps integration
  • Data Factory-like pipelines development experience
  • Power BI report editor integration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Macro-layers for analytics

A
  • Analytical access
  • Reporting access
  • Dashboarding access
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Azure SQL Database purchasing models

A
  • vCore-based
  • DTU-based
22
Q

Services needed to run SQL Server on an Azure VM

A
  • Azure Storage to contain the virtual disk(s).
  • Azure Virtual Network
  • Azure Compute Service to run the VM
23
Q

Extra PostgreSQL data types

A
  • Document
  • Geometry
  • JSON
  • Composite
  • Custom
24
Q

Azure MariaDB and MySQL pricing tiers

A
  • Basic
  • General Purpose
  • Memory Optimized
25
Azure Database Migration Service pricing tiers
* Premium for continuous migration * Standard (free) for offline migration
26
Azure Database Migration Service service tiers
* General Purpose * Business Critical
27
Data security layers for Azure SQL Database (outside-in)
* Network security * Access management * Threat protection * Information protection * Database
28
Information protection layer methods
* Physical encryption * Transparent data encryption (TDE) * Always encrypted (column-level) * Dynamic data masking
29
Threat protection layer methods
* Azure Monitor logs and Event Hubs Audit * Advanced Threat Protection
30
Access management layer methods
* Authentication * Authorization
31
Network security layer methods
* Firewall * Virtual networks
32
SQL Server authentication methods
* SQL Authentication * Active Directory - Universal with MFA * Active Directory - Password * Active Directory - Integrated
33
Azure SQL Database query tools
* Query Editor * Sqlcmd utility * Azure Data Studio * SQL Server Management Studio * Visual Studio Code
34
NoSQL storage types
* Key-value store * Document store * Columnar data store * Graph store
35
Document types in document databases
* XML * YAML * JSON * BSON
36
Azure non-relational storage services
* Azure Cosmos DB * Azure Table Storage * Azure Blob Storage * Azure Files
37
Non-relational storage types
* Key-value store * Document store * Columnar data store * Graph store * Time series store * Object data store * External index data store
38
Cosmos DB structure from the top down
* CosmosDB account * Databases * Containers * Logical partitions * Physical partitions
39
Cosmos DB consistency levels
* Strong * Bounded staleness * Session * Consistent prefix * Eventual
40
Azure Storage performance levels
* Standard * Premium (SSD)
41
Azure Table Storage requirements
* Must have a partition key, row key, and timestamp * No more than 255 properties (key/value pairs) * No more than 1 MB, or 2MB if using the Table API in Cosmos DB.
42
Types of Azure Blob Storage content
* Page blob * Block blob * Append blob
43
Methods for accessing Azure Blob Storage
* Azure Storage Explorer * Azure Blob API, aka Blob service REST API. * Azure PowerShell * Azure Command-Line Interface (CLI) * Azure.Storage .NET client library
44
Azure Files authentication methods
* Active Directory Domain Services (AD DS) for on-premises Active Directory * Azure Active Directory Domain Services (AD DS) * Storage account access key (one of the two API keys generated for the account)
45
RBAC basic levels
* Reader * Contribitor * Elevated contributor
46
Azure non-relational storage security components
* Firewall rules * Secure transfer using Transport Layer Security (TLS) * Storage data encryption
47
Azure non-relational storage data policies
* Time period retention * Append enabled (part of time period retention) * Legal hold
48
Azure non-relational storage authentication methods
* Shared key * Shared access signature (SAS) * Azure Active Directory (Azure AD) * Azure Active Directory Domain Services (Azure AD DS) for file shares
49
Shared access signature (SAS) configuration options
* Allowed services * Allowed resource types * Allowed permissions * Option to allow deleting versions of objects * Start and end date/time * Allowed IP addresses * Allowed protocols * Preferred routing tier
50
51
Tools for diagnosing connection problems
* Telerik Fiddler * Microsoft Network Monitor (NetMon) * Wireshark
52
Management tools for Azure NoSQL/non-relational data
* 1. Azure Portal * 1. Azure Data Explorer * 1. AzCopy * 1. Cosmos Explorer * 1. Visual Studio Cloud Explorer