Chapter Eight Flashcards

(42 cards)

1
Q

What are the advantages of partitioning in a database?

A
  • Efficiency: records used together are grouped together
  • Local optimization: each partition can be optimized for performance
  • Security: data not relevant to users are segregated
  • Recovery and uptime: smaller files take less back up time
  • Load balancing: partitions stored on different disks, reduces contention

Advantages of partitioning help in managing large datasets effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the disadvantages of partitioning in a database?

A
  • Inconsistent access speed: slow retrievals across partitions
  • Complexity: non-transparent partitioning
  • Extra space or update time: duplicate data; access from multiple partitions

Disadvantages highlight the potential challenges in managing partitioned databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define horizontal partitioning in a database.

A

Distributing the rows of a logical relation into several separate tables

Useful for situations where different users need access to different rows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three types of horizontal partitioning?

A
  • Key Range Partitioning
  • Hash Partitioning
  • Composite Partitioning

Each type serves different use cases depending on data distribution needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define vertical partitioning in a database.

A

Distributing the columns of a logical relation into several separate physical tables

Useful for situations where different users need access to different columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is denormalization in database design?

A

Transforming normalized relations into non-normalized physical record specifications

While it can improve performance, it may introduce data integrity issues.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

List the benefits of denormalization.

A
  • Can improve performance (speed) by reducing number of table lookups
  • Reduce number of necessary join queries

Benefits must be weighed against potential costs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the costs associated with denormalization?

A
  • Wasted storage space
  • Data integrity/consistency threats

These costs highlight the trade-offs involved in denormalization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a physical file in database design?

A

A named portion of secondary memory allocated for the purpose of storing physical records

Essential for managing data storage in relational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the components of a tablespace in a database?

A
  • Segment: a table, index, or partition
  • Extent: contiguous section of disk space
  • Data block: smallest unit of storage

Understanding these components is crucial for effective database management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the types of file organizations?

A
  • Heap: no particular order
  • Sequential
  • Indexed
  • Hashed

Each type of organization has its own use cases and performance implications.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What factors should be considered when selecting file organization?

A
  • Fast data retrieval and throughput
  • Efficient storage space utilization
  • Protection from failure and data loss
  • Minimizing need for reorganization
  • Accommodating growth
  • Security from unauthorized use

These factors are essential for optimizing database performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the purpose of clustering files in a database?

A

To improve performance of join operations by storing related records from different tables together in the same disk area

Clustering can enhance efficiency in data retrieval.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define indexed file organizations.

A

Storage of records sequentially or nonsequentially with an index that allows software to locate individual records

Indexes are crucial for fast data access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the purpose of a query optimizer?

A

To optimize performance by selecting the most efficient query execution plan

Essential for improving database response times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the common data types in databases?

A
  • VARCHAR2: variable-length character data
  • CHAR: fixed-length character data
  • NUMBER: positive or negative numbers
  • DATE: date representation
  • TIMESTAMP: date with fractional seconds
  • BLOB: binary large objects

Understanding data types is fundamental for effective database design.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does the Sarbanes-Oxley Act aim to achieve?

A

To protect investors by improving accuracy and reliability of financial reporting

Compliance with SOX is crucial for organizations handling financial data.

18
Q

What are common authentication schemes in database security?

A
  • Passwords
  • Two-factor authentication
  • Three-factor authentication

Strong authentication measures are vital for protecting sensitive data.

19
Q

What are the recovery facilities in database management?

A
  • Backup Facilities
  • Journalizing Facilities
  • Checkpoint Facility
  • Recovery Manager

These facilities are crucial for data integrity and recovery after loss.

20
Q

What is the role of a recovery manager in a database?

A

To restore the database to a correct condition when a failure occurs and then resume processing user requests

This ensures minimal downtime and data loss.

21
Q

When should indexes be used in a database?

A
  • On larger tables
  • For primary keys
  • For frequently searched fields
  • On fields in ORDER BY and GROUP BY commands

Proper indexing can significantly improve query performance.

22
Q

What are the downsides of cloud-based data management services?

A
  • Existing systems do not yet provide adaptive capacity
  • Lack of full consistency guarantees in distributed environments
  • Challenges in monitoring cloud providers’ SLA commitments

These downsides highlight the limitations of current cloud solutions.

23
Q

What is the purpose of clustering in databases?

A

Improves performance of join operations by storing related records together

Clustering stores primary key records of the main table adjacent to associated foreign key records of the dependent table.

24
Q

What command is used in Oracle to create a cluster?

A

CREATE CLUSTER

This command facilitates the clustering of tables in Oracle databases.

25
List some benefits of Cloud-based Data Management Services.
* No initial investments in hardware, physical facilities, and systems software * Lower need for internal expertise in database management * Better visibility of costs * Increased flexibility in capacity management * Easier exploration of new data management technologies * High availability, reliability, and security from mature cloud providers
26
What does Software-as-a-Service (SaaS) involve?
Software solutions/applications for non-computing activities ## Footnote SaaS delivers software over the internet, addressing specific business needs.
27
Define Database-as-a-Service (DBaaS).
Data management cloud platform service ## Footnote DBaaS allows users to access database services without the need to manage underlying infrastructure.
28
What does Infrastructure-as-a-Service (IaaS) provide?
Hardware and systems software resources ## Footnote IaaS enables users to rent IT infrastructure on a pay-as-you-go basis.
29
What is the role of Platform-as-a-Service (PaaS)?
Provides hardware and systems software resources for application development ## Footnote PaaS allows developers to build, deploy, and manage applications without worrying about the infrastructure.
30
What is a rollback in response to database failures?
Preferred method to revert database to a previous state ## Footnote Rollbacks are used for aborted transactions and incorrect data updates.
31
What is a rollforward in database recovery?
Returns transactions to the state just prior to an abort ## Footnote Rollforward is used when recovering from certain types of failures.
32
What should a disaster recovery plan include?
* Detailed written plan * Regular testing * Multidisciplinary team training * Off-site backup data center * Scheduled backup copies of databases
33
What is a data dictionary?
Documents data elements of a database ## Footnote A data dictionary is crucial for understanding the structure and elements of a database.
34
What is a system catalog?
A system-created database that describes all database objects ## Footnote The system catalog provides metadata about the database structure.
35
What does an information repository store?
Metadata describing data and data processing resources ## Footnote Information repositories help in managing and understanding data assets.
36
Fill in the blank: _______ is a method for organizing data in a database to optimize retrieval.
[File organization]
37
Describe the physical database design process.
Involves translating a database model into efficient structures, including denormalization ## Footnote The process ensures the database is optimized for performance and storage.
38
True or False: Cloud-based data management services require significant initial hardware investments.
False ## Footnote Cloud services eliminate the need for initial investments in hardware.
39
Heap (Unordered)
Records are stored in no particular order Fast for inserts, slow for search Best for small tables or when access patterns are unpredictable Frequent inserts No order, raw append
40
Sequential (Ordered)
Records are stored in sorted order based on a key Efficient for range queries and batch processing Slower for random access or frequent inserts/deletes Range queries Sorted rows by key
41
Indexed
Uses an index structure (like B+ trees) to point to record locations Fast for searching, updating, and range queries Common in most relational database Mixed operations Uses lookup table/index
42
Hashed
Records are placed using a hash function on a key Excellent for direct lookups Poor for range queries or unordered scans Fast lookups Bucketed, scrambled order