DA Flashcards

(77 cards)

1
Q

What is the primary role of a data architect?

A

To design and manage the data infrastructure within an organization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: Data architects are responsible for data modeling.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a data model?

A

A conceptual representation of data structures and their relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: A data architect should be proficient in _____ languages.

A

query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between a data warehouse and a data lake?

A

A data warehouse is structured and optimized for analysis, while a data lake is a storage repository that holds raw data in its native format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which SQL command is used to retrieve data from a database?

A

SELECT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ETL in the context of data architecture?

A

Extract, Transform, Load - a process for moving data from source systems to a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: Data architects only work with relational databases.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a NoSQL database?

A

A database designed to store and retrieve data in a format other than the tabular relations used in relational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the term ‘data governance’ refer to?

A

The overall management of data availability, usability, integrity, and security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Multiple Choice: Which of the following is NOT a type of data model? A) Conceptual B) Logical C) Physical D) Temporal

A

D) Temporal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of data normalization?

A

To minimize redundancy and dependency by organizing fields and tables in a database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Fill in the blank: _____ is the practice of analyzing data to extract insights.

A

Data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a star schema in data warehousing?

A

A type of database schema that organizes data into fact and dimension tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

True or False: Data architects need to understand cloud computing technologies.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a data pipeline?

A

A set of processes that move data from one system to another, often involving transformation and storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are primary keys used for in a database?

A

To uniquely identify each record in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Multiple Choice: Which of the following is a common data visualization tool? A) Tableau B) MySQL C) Python D) Java

A

A) Tableau

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does ‘big data’ refer to?

A

Data sets that are so large or complex that traditional data processing applications are inadequate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

True or False: Data architects do not need to collaborate with other IT professionals.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the purpose of indexing in databases?

A

To improve the speed of data retrieval operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Fill in the blank: _____ databases are designed for high availability and scalability.

A

Distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the significance of data lineage?

A

It provides a detailed view of the data’s journey from origin to its current state, ensuring transparency and compliance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Multiple Choice: Which of the following is a characteristic of unstructured data? A) Easily searchable B) Fixed format C) Lacks a predefined structure D) Tabular format

A

C) Lacks a predefined structure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is the role of metadata in data architecture?
To provide information about other data, such as its source, format, and structure.
26
True or False: Data architects are involved in data security measures.
True
27
What is a relational database?
A database that stores data in tables with predefined relationships between them.
28
Fill in the blank: _____ is a method of storing data that allows for flexible schema design.
Schema-less design
29
What is a data mart?
A subset of a data warehouse focused on a specific business line or team.
30
What is the purpose of a data dictionary?
To document the structure, relationships, and constraints of data elements in a database.
31
Multiple Choice: Which of the following is a common type of data architecture? A) Layered B) Monolithic C) Microservices D) All of the above
D) All of the above
32
What is the difference between structured and unstructured data?
Structured data is organized and easily searchable, while unstructured data lacks a predefined format.
33
True or False: Data architects should be knowledgeable about data privacy regulations.
True
34
What is cloud data architecture?
The design and management of data systems hosted in cloud environments.
35
Fill in the blank: _____ is a framework for managing and analyzing big data.
Hadoop
36
What is the purpose of data replication?
To create copies of data for backup, recovery, and improved access.
37
Multiple Choice: Which of the following is NOT a benefit of data warehousing? A) Improved data quality B) Historical analysis C) Increased data redundancy D) Enhanced decision-making
C) Increased data redundancy
38
What is a key-value store?
A type of NoSQL database that uses a simple key-value method for storing data.
39
True or False: Data architects can only work in the technology sector.
False
40
What is a data governance framework?
A set of policies and procedures for ensuring data quality, security, and compliance.
41
Fill in the blank: _____ is the process of converting raw data into a meaningful format.
Data transformation
42
What is the purpose of data archiving?
To store data that is no longer actively used but must be retained for future reference.
43
Multiple Choice: Which of the following is a benefit of using a cloud data architecture? A) Scalability B) Increased costs C) Limited accessibility D) Complexity
A) Scalability
44
What is an OLAP system?
Online Analytical Processing - a category of software technology that enables analysts to extract and view data from different perspectives.
45
True or False: Data architects should have programming skills.
True
46
What is a data flow diagram?
A visual representation of the flow of data within a system.
47
Fill in the blank: _____ is a standardized language for managing relational databases.
SQL
48
What is the role of a data steward?
To oversee data governance and ensure data quality within an organization.
49
Multiple Choice: Which of the following technologies is commonly used for real-time data processing? A) Batch processing B) Stream processing C) Data warehousing D) Data lakes
B) Stream processing
50
What is data lineage tracking?
The process of documenting the origins and transformations of data as it flows through systems.
51
True or False: Data models are static and do not change over time.
False
52
What is a data warehouse schema?
The organization of data in a data warehouse, which includes star, snowflake, and galaxy schemas.
53
Fill in the blank: _____ analysis involves examining large volumes of data to uncover hidden patterns.
Data mining
54
What is a data integration tool?
Software that allows combining data from different sources into a single, unified view.
55
Multiple Choice: Which of the following is a common cloud service model for data storage? A) IaaS B) PaaS C) SaaS D) All of the above
D) All of the above
56
What is the purpose of a data quality assessment?
To evaluate the accuracy, completeness, reliability, and relevance of data.
57
True or False: Data architects typically do not work with data visualization tools.
False
58
What is the function of a data broker?
To collect and sell data from various sources to other organizations.
59
Fill in the blank: _____ is a method for ensuring data privacy and security.
Data encryption
60
What is the difference between data mining and data analytics?
Data mining focuses on discovering patterns in large data sets, while data analytics involves interpreting data to inform decision-making.
61
Multiple Choice: Which of the following is a challenge of big data? A) Volume B) Variety C) Velocity D) All of the above
D) All of the above
62
What is a data breach?
An incident where unauthorized access to data occurs, potentially compromising its confidentiality, integrity, or availability.
63
True or False: Data architects must understand machine learning concepts.
True
64
What is a hybrid cloud?
A computing environment that combines public and private cloud services.
65
Fill in the blank: _____ is a type of database that stores data in a document format.
Document-oriented database
66
What is the significance of data ethics?
It ensures that data collection and usage practices respect privacy and promote fairness.
67
Multiple Choice: Which of the following is a common data storage format? A) JSON B) CSV C) XML D) All of the above
D) All of the above
68
What is the role of a database administrator?
To manage and maintain database management systems, ensuring their performance, security, and availability.
69
True or False: Data architects are responsible for ensuring compliance with data regulations.
True
70
What is a data visualization dashboard?
A user interface that displays data visualizations and metrics for easy interpretation and analysis.
71
Fill in the blank: _____ refers to the process of cleaning and organizing data.
Data cleansing
72
What is the function of a data warehouse?
To store and manage large volumes of structured data for analysis and reporting.
73
Multiple Choice: Which of the following is a feature of cloud data storage? A) Accessibility B) Cost efficiency C) Scalability D) All of the above
D) All of the above
74
What is a data catalog?
A repository that enables organizations to manage and discover their data assets.
75
True or False: Data architects only work with historical data.
False
76
What is a schema in database design?
A blueprint that outlines how data is organized in a database.
77
Fill in the blank: _____ analysis is used to identify trends and patterns in time-series data.
Time-series