05 - DB Sys, DCntrs, Bus. Intel Flashcards

1
Q

Database

A

An organized collection of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Database Management System

DBMS

A

A group of programs that manipulate the database and provide an interface between the database and the user of the database and other application programs.

Users -> Applications -> DBMS -> DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The Digital Universe

A
  1. 8 Zettabytes

1. 8 trillion gigabytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Character

A

A basic building block of most information, consisting of upper and lower case letters, numeric digits and special symbols.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Field

A

Typically a name, number, or combination of characters that describes an aspect of a business object or activity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Record

A

A collection of data fields all related to one object, activity, or individual.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

File

A

A collection of related records.

Databases are a collection of integrated files.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Hierarchy of Data

A
Bits
Characters - Bytes
Fields - keyed and computed types
Records
Files
Database
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data Scientist

A

Help analyze what is stored in vast corporate databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Back End Interaction

A

Entering metadata.

For example - entering survey responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Entity

A

A general class of people, places or things for which data is collected, stored and maintained.

Employees
Inventory
Customers

Records contain the data items pertinent to an entity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Attributes

A

A characteristic of an entity.

Employee number
Last name
Hire date

The records contain fields to hold the data points for the attributes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Item

A

The specific value of an attribute. Found in the fields of the record describing an entity.

Data items are entered into fields.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Key

A

A field or set of fields in a record that is used to identify a record.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Primary Key

A

A field or set of fields that UNIQUELY identifies the record.

No two records can share a primary key.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Traditional Approach to Data Management

A

Where each distinct operational system uses data files dedicated to that system.

A spreadsheet for each data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Database Approach to data management

A

Where multiple information systems share a pool of related data.

Requires a DBMS so a record may only be manipulated by one application program at a time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Data Modeling Considerations

A

Content - what data should be collected at what costs.

Access - what’s data should be provided to which users and when

Logical structure - how should data be arranged so that it makes sense to users

Physical organization - where is data physically located.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Data Center

and

Modular Data Center

and

green Data Centers

A

A climate-controlled building or set of buildings that houses database servers and the systems that deliver mission-critical information and servers.

Modular data centers like HP Ecopod are built inside shipping containers. 700,000sf modular data center in Northlake, IL - 16 football fields 220 shipping containers.

North Carolina - Apple Google Facebook

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

De-duplication

A

Eliminating undesired data redundancy.

Only about 1/3 of information is secure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Data Model

A

A diagram of data entities and their relationships.

Enterprise data modeling is done at the level of the entire enterprise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Entity-relationship (ER) diagrams are models that use basic graphical symbols to show the organization of and relationships between data.

A

Development of ER diagrams helps ensure that the logical structure of application programs is consistent with the data relationships in the database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Database Models

A

Flat Files (spreadsheets)
Hierarchical
Network Models
Relational

Relational has become most popular and normally easier for managers to understand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Relational Database Model logic

A

All data elements are placed in two-dimensional tables, or relations. As long as they share at least one common element, these relations can be linked to output useful information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Relational Model

A

A database model that describes data in which all data elements are placed in two-dimensional tables called relations, which are the logical equivalent of files.

IBM DB2
Oracle - leader with over 1/2 of market
Sybase
MS SQL Server, MS Access and MySQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Manipulating Data

A

Basic database manipulations include:

Selecting
Projecting
Joining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Domain

A

Allowable values for data attributes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Selecting

A

Manipulating data to eliminate rows according to certain criteria.

Isolating a record in a table.

29
Q

Projecting

A

Manipulating data to eliminate columns in a table.

30
Q

Joining

A

Manipulating data to combine two or more tables.

31
Q

Linking

A

The ability to combine two or more tables through common data attributes to form a new table with only the unique data attributes.

Linking ability is a primary advantage of the relational database model.

32
Q

Data Cleanup

A

The process of looking for and fixing inconsistencies to ensure that data is accurate and complete.

Valuable data - accurate, complete, economical, flexible, reliable, relevant, simple, timely, verifiable, accessible and secure. Cleanup along with proper design helps develop data with these characteristics.

Database Normalization

33
Q

MS OneNote

A

Store of random notes that are accessible from other applications like word processors and spreadsheets.

EverNote is a freeware alternative that can store photos, voice and handwritten notes.

34
Q

Database Types

A

Flat File
Some spreadsheet and
word-processing apps firms unrelated
MS OneNote, EverNote

Single User
MS Access and FIleMaker Pro

Multiple User
Oracle, MS, Sybase and IBM
Some single user solution can be deployed for multi-user access over a network but usually have limitations.

35
Q

Schema

A

A description of the entire database. It is accessed by the DBMS to find where to access required data in relation to the other data.

Used to define the tables and other database features associated with a group of users.

A description that involves “telling” the DBMSS the logical and physical structure of the data and the relationships among the data of each user.

36
Q

Data Definition Language

A

A collection of instructions and commands used to define and describe data and relationships in a specific database.

Used to enter and tie schemas together.

Describes logical access paths
File, area, record and set description are terms the DDL defines and uses.

37
Q

Data Dictionary

A

A detailed description of all the data used in the database.

Name, alias, value range, type (alpha or numeric), storage required, creator and user and access dates and information. Also who created, who is responsible and who can access the data. Also lists what reports can make use of the data.

Helps improve information reliability and reduces redundancy.

38
Q

Logical Data Access Path

LAP

A

The path applications use to locate data when accessing a DBMS.

39
Q

Physical Data Access Path

PAP

A

The path DBMS uses to locate data on a storage device.

40
Q

Concurrency Control

A

A method of dealing with a situation in which two or more users or applications need to access the same record at the same time.

41
Q

Query by Example (QBE)

A

A visual approach to developing database queries or requests. This feature provides a menu and graphical method to perform database manipulation and report setup.

42
Q

Data Manipulation Language

DML

A

A specific language, provided with a DBMS, which allows users to access and modify the data, to make queries and to generate reports.

1970’s D.D. Chamberlain and others at IBM developed Structured Query Language (SQL). Adopted in 1986 by ANSI

42
Q

Structured Query Language (SQL)

A

Industry leading data manipulating language.

43
Q

Database Admin (DBA)

A

Position with typically a degree in computer science or IS and work Experience. Helps users decide optimal design and attributes for desired entities.

45
Q

Data Administratoa

A

A nontechnical position responsible for defining and implementing consistent principals for a wide variety of data issues.

Sets standards for consistent nomenclature, attribute meaning and security. Usually a high level position.

DBA’s would report to the DA in larger firms. The DA would report to the CIO or CTO

46
Q

Open Source DB

A
PostgreSQL
MySQL
CouchEB Couchbase
Apache Hadoop
- can manage unstructured and relational DB's
47
Q

Database as a Service (DaaS)

Database 2.0

A

When the database and data are stored on equipment and managed off site.

Emerging solutions.
Administration is provided by service provider.
Database stored at providers site.

48
Q

DB Virtualization

A

Uses virtual servers and operating systems to allow two or more database systems, including servers and DBMSs to act like a single, unified DB system.

Allows more efficient use of computing resources, reduces costs and provides better access to critical information.

49
Q

Special-Purpose Databases

A

Offer ability to store forms of data such as music and images that do not fit well in conventional tables.

NoSQL Not Only SQL
Examples:
Hadoop
Cassandra
Hypertable
50
Q

Front End Interaction

A

Data query activity based on key word search through the front end application.

DBMS can act as Front End or Back End applications.

51
Q

Back End Application

A

Indirectly interacts with users. Frequently the database that feeds information to the front end application.

DBMS can act as Front End or Back End applications.

52
Q

Applications

A

Database Applications manipulate the content of a database to produce useful information.

Common manipulations:
Searching, filtering, synthesizing and assimilating data.

53
Q

Big Data

A

Large amounts of unstructured data (various types) that is difficult or impossible to capture, store and manipulate using traditional database management systems.

Hadoop - open source
Oracle - Big Data Appliance
SAS
IBM InfoSphere BigInsights based on Hadoop
IBM BigSheets
54
Q

Semantic Web

A

Developing a seamless integration of a database with the Internet.

A Semantic Web captures metadata with all Web content using technology called the Resource Description Framework (RDF)

This has helped the entire Web develop into a giant database.

Heightened by increasing use smartphone and tablet computers to connect to DB’s

55
Q

Data Warehouse

A

A large database that collects business information from many sources (relational databases, flat files, spreadsheets) it the enterprise, covering all aspects of the company’s processes, products and customers in support of management decision making.

Oracle’s warehouse management can accept RFID signals as data.

56
Q

Data Mart

A

A subset of a data warehouse that is used by small to medium sized businesses and departments within large corporations to support decision making.

57
Q

Data Mining

A

An information-analysis tool that involves the automated discovery of patterns and relationships in a data warehouse.

Methods and tools to support bottom-up, discovery driven analysis. Requires no assumptions but identifies facts and conclusions based on patterns discovered.

58
Q

Predictive Analysis

aka - Business Analytics

A

A firm of data mining that combines historical data with assumptions about future conditions to predict outcomes of events , such as future product sales or the probability that a customer will default on a loan.

Used to upgrade occasional customers into frequent purchasers.

Used to predict future sales up to a year in the future.

59
Q

Business Intelligence (BI)

A

The process of gathering enough of the right information in a timely manner and usable form and analyzing it so to have a positive impact on business strategy, tactics or operations.

Gathering via Data Mining

60
Q

Competitive Intelligence

A

One aspect of business intelligence limited to information about competitors and the ways that knowledge affects strategy, tactics and operations.

All legal tactics to create whole picture from bits of freely available information.

61
Q

Counterintelligence

A

The steps an organization takes to protect information sought by “hostile” intelligence gatherers.

Define and manage “Trade Secret” intelligence assets.

62
Q

Online Analytical Processing (OLAP)

A

Software that allows users to explore data from a number of perspectives.

The tools that support top-down, query driven analysis. Requires repetitive testing of user-originated theories.

63
Q

Data Loss Prevention (DLP)

A

Systems designed to lock down, identify, monitor and protect data within an organization.

Supports counterintelligence efforts.

A necessity in complying with government regulations that require companies to safeguard private customer data.

64
Q

Distributed Database

A

A database and in which the data can be spread across several smaller databases connected through telecommunications devices.

65
Q

Replicated Database

A

A DB that holds a duplicate set of frequently used data.

Writing changes in satellite DB’s back to a master DB through the act of Data Synchronization.

66
Q

Object-Oriented DB

Method - a procedure to compute some example.

Message - a request to execute or run a method.

A

A DB that stores both data and its processing instructions.

Object-Oriented DB Management System (OODBMS): A group of programs that manipulate on object-oriented DB and provide a user interface and connections to other application programs.

Object-Related DB Management System (ORDBMS): a DBMS capable of manipulating audio, video and graphical data. Allows 3rd parties to add data.

67
Q

Spatial Data Technology

A

Using a DB to store and access data according to the locations it describes and permit spatial queries and analysis.

68
Q

Tools that database designers use to show the logical relationships among data:

A

Data Models

Enterprise Data Modeling

Entity-Relationship (ER) Diagrams

Relational Models