Unit 11: Behind the Scenes: Databases and Information Systems Flashcards

1
Q

data warehouse

A

large-scale collection of data that contains and organizes in one place all the data form an organizations’ multiple databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 sources for data warehouses

A
  1. internal sources - sales, billing, inventory, and customer databases
  2. external sources - vendors and suppliers
  3. clickstream data - software used on company websites to capture info about each click user makes as they navigate through the site
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

time-variant data?

A

data that doesn’t all pertain to one period of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

data staging?

A

an intermediate storage area used for data processing during the ETL proecess

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

ETL?

A

extract, transform, and load - the process fo formatting/cleansing data to enable data of different sources and types to comingle for analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

OLAP?

A

online analytical processing - software that provides standardized tools for viewing and manipulating data in a data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

data mart?

A

a related set of data that is grouped together and separated tout form the main body of data in the data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

data mining?

A

the process by which a great amount of data are analyzed and investigated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hadoop?

A

an open-source platform that makes complex unsaturated data easier to manage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how Hadoop stores files and processes data

A
  1. file storage - break data into chunks and is then distributed across many servers to be stored
  2. data processing - uses MapReduce
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

MapReduce?

A

sends code to each of the servers storing the data and uses multiple processors to process its set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

6 data-mining techniques

A
  1. anomaly detection - identify outliers
  2. association/affinity group - determines which data goes together
  3. classification - defines data classes to spot trends
  4. clustering - organize data into smaller subgroups
  5. estimation/regression - assign value to data based on certain criteria
  6. visualization - a visual representation of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

information system?

A

a software-based solution used to gather and analyze data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

TPS?

A

transaction-processing system - an operational-level system that keeps track of everyday business transactions/activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

real-time processing?

A

the database is updated while the transaction is taking place

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

OLTP?

A

online transaction processing - ensure that the data in the TPS is current

17
Q

Batch processing?

A

transaction data is accumulated until a certain point is read then several transactions are processed at once

18
Q

What is the ACID test?

A

a test to ensure transactions are processed accurately

19
Q

ACID?

A
  1. A - atomicity - all components of the transaction are treated as on or the transaction is not completed
  2. C - consistency - at the end of a transaction a new piece of data is completed or the transaction remains incomplete
  3. I - isolation - during process each transaction is tread separately from other transactions
  4. D - durability - complete transactions are saved by TPS - cannot be undone
20
Q

MIS?

A

management information system - provides timely and accurate info that enable managers to make critical business decisions

21
Q

What types of reports are generated by MISs? (3)

A
  1. detail report - list of transactions that occurred over a certain period of time
  2. summary report - provides a consolidated picture of detailed data
  3. exception report - shows conditions that are unusual or that need attention by the system users
22
Q

consolidation?

A

the act of unifying something

23
Q

DSS?

A

decision support system - type of business info system designed to help a manager develop solutions for specific problems

24
Q

model management system?

A

software that assists in building management models in a DSS

25
Q

BI?

A

business intelligence - the ability to improve business decision making with databases and other fact-based support systems

26
Q

business intelligence system?

A

a form of business info system that is used at the executive level

27
Q

ERP system?

A

enterprise resource planning system - a software system that accumulates all info relevant to running a business and makes it available to whom ever needs it

28
Q

knowledge-based system?

A

a system that provides intelligence that supplements the user’s own intellect

29
Q

expert system?

A

tries to replicate the decision making of a human expert in order to solve problems

30
Q

NLP system?

A

natural language processing system - enables users to communicate with computer systems using a natural spoken/written language instead of a programming language

31
Q

AI?

A

artificial intelligence - attempts to create computers that think like humans

32
Q

data redundancy?

A

Unnecessary duplication of data