DATABASES 1 INF2603 CH 1 Flashcards

CHAPTER 1 AND INTRO

1
Q

what is data?

A

The word raw indicates that the facts have not yet been processed to reveal their meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is Information?

A

is the result of processing raw data to reveal its meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

knowledge (data)

A

-body of information and facts about a specific subject.
-familiarity, awareness, and understanding of information as it applies to an environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data management

acronym(g,s,r)

A

proper generation, storage, and retrieval of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data quality (6 characteristics )
acronym(A,R,C,T,U,U)

A

■ Accuracy: Is the data accurate and has it been obtained from a verifiable source?

■ Relevance: Is the data relevant to the organization?

■ Completeness: Is the required data being stored?

■ Timeliness: Is the data updated frequently to meet the business requirements?

■ Uniqueness: Is the data unique and without redundancy?

■ Unambiguous: Is the meaning of the data clear?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

explain Data governance.

A
  • a strategy or methodology defined by an organization to safeguard data quality.

-Each company makes its own policies and procedures for managing the availability, usability, quality, integrity, and security of data state

-who owns the data, is authorized to create, update, and delete new records in the database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Master Data Management (MDM)

A
  • technology of data governance strategy
  • consistent and accurate data
  • auditing, reporting, and compliance of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A DATABASE

A

is a shared, integrated computer structure that stores a collection of:

■ end-user data, or raw facts of interest to the end user

■ metadata, or data about data, through which the end-user data are integrated and managed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

metadata

A
  • provide a description of the data characteristics and the set of relationships that link the data found within the database.
    -which essentially encapsulates the different properties, history, origin, versions, and other information
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

DATABASE MANAGEMENT SYSTEM (DBMS)

A

is a collection of programs that manages the database structure and controls access to the data stored in the database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Advantages of DBMS 6

A

■ IMPROVED DATA SHARING.
■ BETTER DATA INTEGRATION.
■ MINIMISED DATA INCONSISTENCY.
■ IMPROVED DATA ACCESS.(queries)
■ IMPROVED DECISION MAKING.
■ INCREASED END-USER PRODUCTIVITY.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

QUERY (DBMS)

A

-is a specific request for data manipulation (for example, to read or update the data) issued to the DBMS.
- Simply put, a query is a question,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

AD HOC QUERY(DBMS)

A

is a spur-of-the-moment question

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

QUERY RESULT SET(DBMS)

A

-The DBMS sends back an answer to the application. For example, end users, when dealing with large amounts of sales data, might want quick answers to questions (ad hoc queries) such as:

  • What was the volume of sales by product during the past six months?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

5 ways in which data can be classified in DBMS

A
  1. NUMBER OF USERS SUPPORTED,
  2. WHERE THE DATA ARE LOCATED,
  3. THE TYPE OF DATA STORED
  4. THE INTENDED DATA USAGE
  5. DEGREE TO WHICH THE DATA ARE STRUCTURED.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SINGLE-USER DATABASE

A

supports only one user at a time. if user A is using the database, users B and C must wait until user A is done.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

DESKTOP DATABASE.

A

A single-user database that runs on a personal computer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

MULTI-USER DATABASE

A

multiple users at the same time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

WORKGROUP DATABASE

A

When the multi-user database supports (usually fewer than 50) or a specific department within an organization,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

ENTERPRISE DATABASE

A

When the database is used by the entire organization and supports many users (more than 50, usually hundreds) across many departments,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

CENTRALISED DATABASE

A

a database that supports data located at a single site .

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

DISTRIBUTED DATABASE

A

supports data distributed across several different sites.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

The most popular way of classifying databases today?

A

how they will be used and on the time sensitivity of the information gathered from them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Other names for OPERATIONAL DATABASEs

A

ONLINE TRANSACTION PROCESSING (OLTP), TRANSACTIONAL OR PRODUCTION DATABASE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

OPERATIONAL DATABASE, also called an ONLINE TRANSACTION PROCESSING (OLTP), TRANSACTIONAL OR PRODUCTION DATABASE.

A

support a company’s daily operations, product or service sales, payments and supply purchases reflect critical day-to-day operations andmust be recorded accurately and immediately.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Typically, analytical databases comprise of which two main components?

A

-A DATA WAREHOUSE
-ONLINE ANALYTICAL PROCESSING (OLAP) front end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

THE DATA WAREHOUSE

A

is a specialized database that stores data in a format optimized for decision support that contains historical data obtained from operational databases and other external sources.
-tactical and decision support

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

ONLINE ANALYTICAL PROCESSING(OLAP)

A

-is a set of tools that work together to provide an advanced data analysis environment for
- retrieving, processing, and modeling data from the data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

UNSTRUCTURED DATA

A
  • data that exists in their original (raw) state, in the format in which they were collected.
    -Therefore, unstructured data exist in a format that does not lend itself to the processing that yields information.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

STRUCTURED DATA

A

are the result of formatting unstructured data to facilitate its storage and use, and the generation of information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Semi-structured data

A

have already been processed to some extent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

XML

A

Extensible Markup Language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Extensible Markup Language (XML)

A

is a special language used to represent and manipulate data elements in a textual format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

An XML database

A

supports the storage and management of semi structured XML data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Data Massaging’ (Data Manipulation)

A

to extract information to formulate pricing decisions, sales forecasts, market positioning, etc. to make tactical or strategic decisions from Analytical databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

OPEN SOURCE

A

database system which allows users to build and modify a database of their choice, distribute the database, and improve the actual product

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

‘LAMP’

A

most popular open source software, namely: Linux, Apache Web server, MySQL DBMS and the Perl PHP/Python development languages.

38
Q

Social media

A

refers to Web and mobile technologies that enable ‘anywhere, anytime, always on’ human interactions.

39
Q

NoSQL database.

A

The term NoSQL (Not only SQL) is generally used to describe a new generation of database management systems that is not based on the traditional relational database model.

40
Q

DATABASE DESIGN

A

refers to the activities that focus on the design of the database structure that will be used to store and manage end-user data

41
Q

data

A

Raw facts, such as a telephone number, a birth date, a customer name and a year-to-date (YTD) sales value.
Data have little meaning unless they have been organized in some logical manner. The smallest piece of data that can be recognized by the computer is a single
character, such as the letter A, the number 5 or a symbol such as /.
-A single character requires 1 byte of computer storage.

42
Q

Field

A

A character or group of characters (alphabetic or numeric) that has a specific meaning. A field is used to define and store data.

43
Q

File

A

A collection of related records. For example, a file might contain data about vendors of ROBCOR
Company, or a file might contain the records for the students currently enrolled at Gigantic University

44
Q

record

A

A logically connected set of one or more fields that describes a person, place or thing. For example,
the fields that constitute a record for a customer named J. D. Rudd might consist of J. D. Rudd’s
name, address, phone number, date of birth, credit limit and unpaid balance.

45
Q

STRUCTURAL DEPENDENCE

A

that is, access to a file,is dependent on its structure.

46
Q

STRUCTURAL INDEPENDENCE

A

exists when it is possible to make changes in the file structure without affecting the application program’s ability to access the data.

47
Q

DATA DEPENDENCE

A

Because all data access programs are subject to change when any of the file’s data storage characteristics change (that is, changing the data type), the file system is said to exhibit

48
Q

DATA INDEPENDENCE

A

exists when it is possible to make changes in the data storage characteristics without affecting the application program’s ability to access the data.

49
Q

the practical significance of data dependence is the difference between ?

A

-LOGICAL DATA FORMAT (how the human being views the data) and the

-PHYSICAL DATA FORMAT (how the computer ‘sees’ the data)

50
Q

islands of information

A

The organisational structure promotes the storage of the same basic data in different locations. Database professionals use the term for such scattered data locations.
-As it is unlikely that data stored in different locations will always be updated consistently, the islands of information often contain different versions of the same data

51
Q

Data redundancy

A

exists when the same data are stored unnecessarily at different places.

52
Q

Uncontrolled data redundancy

A

DATA INCONSISTENCY.

POOR DATA SECURITY.
DATA ANOMALIES.

53
Q

DATA INCONSISTENCY.

A

Data inconsistency exists when different and conflicting versions of the same data appear in different places.

54
Q

POOR DATA SECURITY.

A

Having multiple copies of data increases the chances of a copy of the data being susceptible to unauthorised access

55
Q

DATA ANOMALIES.

A

The dictionary defines anomaly as ‘an abnormality’. Ideally, a field value change should be made in only a single place.

Update anomalies-u updating number in different fields

Insertion anomalies: adding errors

Deletion anomalies

56
Q

DATA INTEGRITY

A

is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.

57
Q

database system

A

refers to an organization of components that define and regulate the collection, storage, management, and use of data within a database environment.

58
Q

five major parts: database system

A

HARDWARE, SOFTWARE, PEOPLE, PROCEDURES, and DATA.

59
Q

Hardware.

A

-Hardware refers to all of the system’s physical devices
-example, computers (microcomputers, mainframes, workstations, and servers), storage devices, printers, network devices (hubs, switches, routers, and fiber optics), and other devices (automated teller machines, ID readers, etc.).

60
Q

■ SOFTWARE.

the DBMS itself, functions fully, three types of software are needed

A

-OPERATING SYSTEM SOFTWARE,
-DBMS SOFTWARE, AND
-APPLICATION PROGRAMS AND UTILITIES:

61
Q

OPERATING SYSTEM SOFTWARE DBMS

A

manages all hardware components and makes it possible for all other software to run on the computers. Examples of operating system software include Microsoft Windows, Linux, Mac OS, UNIX and MVS.

62
Q

what does DBMS SOFTWARE do?

A

manages the database within the database system. Some examples of DBMS software include Microsoft Access and SQL Server, Oracle Corporation’s Oracle and IBM’s DB2.

63
Q

APPLICATION PROGRAMS AND UTILITY SOFTWARE dbms

A

are used to access and manipulate data in the DBMS and to manage the computer environment in which data access and manipulation take place.

64
Q

graphical user interfaces (GUIs)

A

For example, all of the major DBMS vendors now provide graphical user interfaces (GUIs) to help create database structures, control database access, and monitor database operations.

65
Q

(GUIs)

A

graphical user interfaces

66
Q

PEOPLE (DBMS)

A

This component includes all users of the database system.

67
Q

five types of users can be identified in a database system:

A

-systems administrators
-database administrators,
-database designers,
-systems analysts and programmers,
-end users.

68
Q

Systems administrators

A

oversee the database system’s general operations.

69
Q

Database administrators, also known as DBAs,

A

manage the DBMS and ensure that the database is functioning properly

70
Q

Database designers

A

-Design the database structure.
-They are, in effect, the database architects.

71
Q

Systems analysts and programmers

A

-design and implement the application programs.

-They design and create the data entry screens, reports and procedures through which end users access and manipulate the database’s data.

72
Q

End users

A

are the people who use the application programs to run the organisation’s daily operations

73
Q

Procedures.

A

Procedures are the instructions and rules that govern the design and use of the database system.

74
Q

Data.

A

The word data covers the collection of facts stored in the database. Since data are the raw material from which information is generated,

75
Q

DBMS performs several important functions that guarantee the integrity and consistency of the data in the database.

There is 9

A
  1. data dictionary management,
  2. data storage management,
  3. data transformation and presentation,
  4. security management,
  5. multi-user access control,
  6. backup and recovery management
  7. data integrity management
  8. database access languages and application programming interfaces
  9. database communication interfaces
76
Q

Data dictionary management.

A

stores definitions of the data elements and their relationships (metadata) in a data dictionary.

uses the data dictionary to look up the required data component structures and relationships, no need to code

Changes stored no need to modify programs

provides data abstraction and it removes structural and data dependency from the system.

77
Q

Performance tuning

A

relates to the activities that make the database perform more efficiently in terms of storage and access speed.

78
Q

Data storage management.

A

The DBMS creates and manages the complex structures required for data storage, thus relieving you of the difficult task of defining and programming the physical data characteristics.

A modern DBMS system provides storage not only for the data, but also for related data entry forms or screen definitions, report definitions, data validation rules, procedural code, structures to handle video and picture formats, etc.

Data storage management is also important for database performance tuning

Although the user sees the database as a single data storage unit, the DBMS actually stores the database in multiple physical data files (see Figure 1.9).

Such data files may even be stored on different storage media.

Therefore, the DBMS doesn’t have to wait for one disk request to finish before the next one starts. In other words, the DBMS can fulfil database requests concurrently.

79
Q

Data transformation and presentation.

A

The DBMS transforms entered data to conform to required data structures.

The DBMS relieves you of the chore of making a distinction between the logical data format and the physical data format.

That is, the DBMS formats the physically retrieved data to make it conform to the user’s logical expectations.

For example, imagine an enterprise database used by a multinational company. An end user in South Africa would expect to enter data such as 11 July 2020 as ‘11/07/2020’. In contrast, the same date would be entered in the United States as ‘07/11/2020’. Regardless of the data presentation format, the DBMS must manage the date in the proper format for each country

80
Q

Security management.

A

The DBMS creates a security system that enforces user security and data privacy. Security rules determine which users can access the database, which data items each user can access and which data operations (read, add, delete or modify) the user can perform. This is especially important in multi-user database systems where many users access the database simultaneously. All database users may be authenticated to the DBMS through a username an

81
Q

Multi-user access control.

A

To provide data integrity and data consistency, the DBMS uses sophisticated algorithms to ensure that multiple users can access the database concurrently without compromising the integrity of the database. Chapter 12, Managing Transactions and Concurrency, covers the details of the multi-user access control.

82
Q

Backup and recovery management

A

The DBMS provides backup and data recovery to ensure data safety and integrity. Current DBMS systems provide special utilities that allow the DBA to perform routine and special backup and restore procedures. Recovery management deals with the recovery of the database after a failure, such as a bad sector in the disk or a power failure. Such capability is critical to preserving the database’s integrity. Appendix K, Database Administration, covers backup and recovery issues (see online platform).

83
Q

Data integrity management.

A

The DBMS promotes and enforces integrity rules, thus minimising data redundancy and maximising data consistency. The data relationships stored in the data dictionary are used to enforce data integrity. Ensuring data integrity is especially important in transactional database systems.

84
Q

Database access languages and application programming interfaces.

A

The DBMS provides data access through a query language.
A query language is a non-procedural language—one that lets the user specify what must be done without having to specify how it is to be done.

The DBMS also provides application programming interfaces to procedural languages such as COBOL, C, Java, Visual and C#. The DBMS also provides administrative utilities used by the DBA and the database designer to create, implement, monitor and maintain the database.

85
Q

query language

A

is a non-procedural language—one that lets the user specify what must be done without having to specify how it is to be done.

86
Q

Structured Query Language

A

SQL

87
Q

Procedural Language

A

In procedural languages, the program code is written as a sequence of instructions. User has to specify “what to do” and also “how to do” (step by step procedure). These instructions are executed in the sequential order. These instructions are written to solve specific problems.

88
Q

EXAMPLES OF Procedural LanguageS

A

FORTRAN, COBOL, ALGOL, BASIC, C and Pascal.

89
Q

Non-Procedural Language

A

In the non-procedural languages, the user has to specify only “what to do” and not “how to do”. It is also known as an applicative or functional language. It involves the development of the functions from other functions to construct more complex functions.

90
Q

EXAMPLES OF Non-Procedural Language

A

SQL, PROLOG, LISP.

91
Q

Database communication interfaces.

A

Current-generation DBMSs accept end-user requests via multiple, different network environments

For example, the DBMS might provide access to the database via the internet through the use of Web browsers such as Chrome, Firefox or Edge. In this environment, communications can be accomplished in several ways:

  • End users can generate answers to queries by filling in screen forms through their preferred Web browser.
  • The DBMS can automatically publish predefined reports on a website.
  • The DBMS can connect to third-party systems to distribute information via email or other productivity applications.
92
Q

PROBELMS AND COST INVOLVED DBMS

A

Increased costs. Database systems require sophisticated hardware and software and highly skilled
personnel. The cost of maintaining the hardware, software and personnel required to operate and
manage a database system can be substantial.
■ Management complexity. Database systems interface with many different technologies and have a
significant impact on a company’s resources and culture. The changes introduced by the adoption
of a database system must be properly managed to ensure that they help advance the company’s
objectives. Given the fact that database systems hold crucial company data that are accessed
from multiple sources, security issues must be assessed constantly.
■ System maintenance. To maximise the efficiency of the database system, you must keep your
system current. Therefore, you must perform frequent updates and apply the latest patches and
security measures to all components. Since database technology advances rapidly, personnel
training costs tend to be significant.
■ Vendor dependence. Given the heavy investment in technology and personnel training, companies
may be reluctant to change database vendors. As a consequence, vendors are less likely to offer
pricing point advantages to existing customers and those customers may be limited in their choice
of database system components.