Exam 1 Flashcards

To cover Data Fundamentals

1
Q

What are the steps to make data useful?

A

Data Acquisition, Data Modeling, Extraction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name and Describe Data Provisioning

A

Data Provisioning is the process of providing users and systems with access to data. This includes the security authorizations to limit access to only those data which the user or system is officially permitted to view

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Replication

A

Data is copied from the source and transferred to the analysis system. This is done to keep the data intact. It is done in time or in batches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Structured Data

A

Structured data is computer readable and usable. EX(databases, spreadsheets, flat files) Specific data types. Metadata is data about the data (meaning, context, purpose)

Create, Read, Update and Delete (CRUD)
Tables: Columns and Rows

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Unstructured Data

A

Unstructured data does not conform to a data model and or has associated metadata such as pictures, audio, video, tweet, reviews

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Relational Databases

A

Relationships between tables
Each row has a unique id called a primary key.

Connect liked tables with the primary key in another table called a foreign key.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

CRUD Anomalies

A

Read - Does not create an anomaly
Create Anomalies by repeating data already stored, combining data, possibly creating unstructured data.
Update anomalies- storing the same data in many different places
Delete anomalies- delete a row if data which affects another tables data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe Normalization

A

Normalization is the process of decomposing a database into more tables until the database is not longer susceptible to anomalies
Most common forms of anomalies
-First normal form(1NF)
-Second Normal form (2NF)
-Third Normal form(3NF) (Industry standard)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

1NF

A

Each table cell should contain a singe value

Each record needs to be unique

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

2NF

A

Rule 1- Be in 1NF

Rule 2- Single Column primary key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

3NF

A

Rule 1- Be in 2NF

Rule 2- Has no transitive functional dependencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a transitive functional dependencies

A

A transitive functional dependency is when changing a non-key column, might cause any of the other non-key columns to change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are some examples of tagged data?

A

XML and HTML and JSON are examples of tagged data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is AI?

A

The Turing test is a test of a machines ability to exhibit intelligent behavior equivalent to, or extinguishable from that of a human

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define Natural Language processing

A

NLP translates human voice and language into computer readable text using programming languages.

Examples:

  • Speech recognition
  • Sentiment Analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Describe Transactional systems (TLP)

A

TLP store and process business data required for each of the business transaction cycles (OLTP)

  • Sales
  • Inventory

Designed to process transactions quickly, reliable and accurately

17
Q

OLTP

A

Three tiered architecture

  • User interface
  • Business Logic
  • Data Services
18
Q

Informatational Systems

A

Informational Systems provide a place for data to be stored and prepared for analytical purposes.

  • Data driven decisions
  • Read only
  • Data extracted from OLTP systems
  • Referred to as Online Analytical Processing(OLAP
19
Q

Compare OLTP and OLAP

A
OLTP
Level of Detail – Very Detailed
Updatable - Yes
Speed – Quick on writing
Current - Must
Requirements -  Must be know
OLAP
Level of Detail – summarized data
Updatable – Read Only
Speed – Quick on reading
Current – Depends on extract
Requirements - Ambiguous
20
Q

Compare OLTP and OLAP Continued

A

OLTP
Historical Data – not normally kept
Data – sometimes compartmentalized
Availability – Needs to be 100%

OLAP
Historical Data – kept for analysis
Information – shared normally across company
Availability – Needs to be 100%

21
Q

What is an inner join?

A

The INNER JOIN selects all rows from both participating tables as long as there is a match between the columns

22
Q

What is an outer join?

A

The SQL OUTER JOIN returns all rows from both the participating tables which satisfy the join condition along with rows which do not satisfy the join condition.

23
Q

Pivot Tables and Terms related

A
Dashboards
Aggregate
Conditional Formatting
Sorting
-ABC
-Numerical 
-Chronological

Filtering

  • Label
  • Value

Ranking

  • Top N or Bottom N
  • Top % or Bottom %
  • Combined

Calculations