Integration of Data Sources Flashcards

(7 cards)

1
Q

What does ETL stand for?

A

Extract, Transformation, Loading

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some advantages of using flat files?

A

No overhead of maintaining data as DBMS does.
Sorting, merging, deleting, replacing and other data migration functions are much faster outside the DBMS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the disadvantages of using Flat Files?

A
  1. No concept of updating.
  2. Queries and random access lookups are not well supported by the system.
  3. Flat files cannot be indexed for fast lookups.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name 4 purposes flat files should be used for?

A
  1. Staging source data for safekeeping and recovery
  2. Sorting data
  3. Filtering
  4. Replacing Text Strings
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Log Scraping?

A

Involves taking a snapshot of the database redo log at a certain time and finds the transactions affecting the tables that ETL is interested in.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Log Sniffiing?

A

Involves pooling the redo log at small time granularity and capturing the transactions on the fly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 4 basic steps needed for conceptual schema integration?

A
  1. Pre-integration analysis
  2. Comparison of schemas
  3. Conformation of schemas
  4. Merging and restructuring
How well did you know this?
1
Not at all
2
3
4
5
Perfectly