S2-m6 Flashcards

1
Q

Operational Data Storage (ODS)

A

A repository of transactional data from multiple sources and is often an interim area between a data source and data warehouses. Ex) captured transactional data an be related to operational activities such as customer order, sales, or vendor payments
-smaller and are frequently overwritten as transactions are modified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Warehouse

A

Very large data repositories that are centralized and used for reporting and analysis rather than for transactional purposes
-pulls data directly from enterprise systems or from a ODS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Mart

A

Like a warehouse but is more focused on a specific purpose such as a marketing or logistics and is often a subset of a warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data Lake

A

A repository similar to a warehouse, but it contains both structured and unstructured data, with data being intis natural or raw format
-does not have a predefined structure or schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Relational Database

A

The most efficient and effective method to store data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Benefits of a Relational database

A

storing data in a normalized, relational database helps to reasonably assure:
-Completeness
-No redundancy
-Business rules enforcement
-Communication and integration of Business Processes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Tables

A

Tables are organized structures that establish columns and rows to store specific types of data records. Each represents and object

The existence of more than one table in a relational database is the first differentiator between flat files and relational databases; relational databases are made up of at least two tables that are related.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Attributes

A

Columns in a relational database Describe the characteristics or properties desired to be known about each entity. Must be unique to that table. Three types of columns: primary keys, foreign keys, and descriptive attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Records

A

Rows contains info about one entity within the table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fields

A

A space created at the intersection of a column and row in a table in which data is entered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data types

A

Each attribute in a table has a designated data type. Specify how the data is stored and indicate how the data in a given attribute can be analyzed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Primary Key

A

A primary key attribute is required in every table. It typically made up of one column. The purpose of the primary key is to help solidify that each row in the table is unique
-rarely descriptive; instead, a collection of letters or sequential numbers
-like id numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Composite Primary Key

A

When more than one attribute is necessary to function as a unique identifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Foreign Key

A

Are attributes in one table that are also primary keys in another table
-same customer ID may appear multiple times in the sales orders table because a single customer can place more than one order
- link between a primary key in one table and a foreign key in another table is what creates a relationship between tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Data Dictionaries

A

A type of metadata - data about data
Summaries info about the data in a database to make it easier to work with the data and understand how it can be used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Normalization

A

a database design technique that reduces data redundancy and eliminated undesirable characteristics. Normalization rules divide larger tables into smaller tables and link them using relationships. Purpose is to eliminate redundancy

17
Q

First Normal Form 1NF Criteria

A
  1. Each cell in a table must contain only once piece of information. One value may be in a column
  2. Each record in every table must be uniquely identified. Accomplished with a primary key,
18
Q

Second Normal Form 2NF

A

Follows once a table is in 1NF. Requires all non-key attributes in a table to depend on the entire primary key. 2nF is particularly meaningful for tables that have composite primary keys.

19
Q

Third Normal Form 3NF

A

Once the table is in 1NF and 2NF the next step is to ascertain that each column n a table describes only the PK. 3NF wants to establish that non of the none key attributes depend on other non key attributes

20
Q

What is a database schema?

A

a set of instructions to tell the database engine how to organize data to be in compliance with the data models. defines the actual structure of the database, including the tables, columns, and relationships between the data entities. Actual implementation and execution of the design

21
Q

What is a data model?

A

Conceptual representations of the data structures in an information system and are not restricted to relational databases only. A high level of design of the data structures in an info system

22
Q

A conceptual data model

A

a high level big picture representation of the data structure in an info system

23
Q

A logical data model

A

more detailed representation of the data structures in an info system at the level of the data itself, providing more detail than a conceptual model.

24
Q

A physical data model

A

most detailed representation. Specify how the data will be stored in the database.

25
Q

Fact Table

A

contain measures or metrics, which are referred to as facts. these facts measure the business, such as sales, cost of goods, or profit. Do not contain descriptive elements about the business but do contain foreign keys

26
Q

Dimension Table

A

Contain descriptive or contextual data for measures such as dates, product names and customer names

27
Q

Star Schema

A

Most common schema for dimensional modeling and the simplest. Fact table in the center and the dimension tables arranged around it

28
Q

Snowflake Schema

A

Dimension tables further normalized. The dimension tables are broken down to related tables, rather than a single table. More complex and flexible. A balance between a normalized schema and a star schema