Base organizations Flashcards by DeliBerus Liberta

Serial media

Physical storage device where records are stored one after another in sequence.

How well did you know this?

Not at all

Perfectly

Serial organization

Storing records without any specific ordering or location criteria.

How well did you know this?

Not at all

Perfectly

Physical deletion

The record is removed and subsequent records are physically shifted to fill the gap.

How well did you know this?

Not at all

Perfectly

Logical deletion

The record is marked as deleted, leaving a gap in its place.

How well did you know this?

Not at all

Perfectly

Update on constant-sized record

The content of the record is modified in-place without changing its size.

How well did you know this?

Not at all

Perfectly

Update on non-consecutive record

If a sufficiently large gap exists elsewhere, the record is updated in that location.

How well did you know this?

Not at all

Perfectly

Retrieval with identifying selective key

The system reads records sequentially until a matching record is found.

How well did you know this?

Not at all

Perfectly

Distributed free space (DFS)

Reserved percentage of space in each bucket dedicated to updates, reducing the need to move records.

How well did you know this?

Not at all

Perfectly

Gaps list

In-memory structure that tracks locations in the file with available space for insertions.

How well did you know this?

Not at all

Perfectly

Compacting

Maintenance process of physically reorganizing records to eliminate gaps and restore density.

How well did you know this?

Not at all

Perfectly

Overflow

Occurs when a record cannot be inserted in its sorted position due to lack of space.

How well did you know this?

Not at all

Perfectly

Degeneration

Gradual growth of unsorted areas in a sequential organization due to repeated unsorted insertions.

How well did you know this?

Not at all

Perfectly

Extended binary search

A binary search followed by scanning forward and backward to retrieve all matching records.

How well did you know this?

Not at all

Perfectly

Overflow management (consecutive organization)

Uses a dedicated unsorted area for handling overflowed records.

How well did you know this?

Not at all

Perfectly

Overflow management (non-consecutive organization)

Uses techniques like rotations, bucket interleaving, and cell partitions to manage overflow.

How well did you know this?

Not at all

Perfectly

Overflow management - rotations

Moves records from a full bucket into a neighboring bucket with available space.

How well did you know this?

Not at all

Perfectly

Overflow management - bucket interleaving

Reserves empty buckets during file creation for future overflow handling.

How well did you know this?

Not at all

Perfectly

Overflow management - cell partitions

Adds an empty bucket directly after an overflowed bucket and stores excess records there.

How well did you know this?

Not at all

Perfectly

Direct addressing organization

Study These Flashcards

Each record is stored at a bucket whose address directly corresponds to the key value.

Absolute direct addressing

Study These Flashcards

The key itself is used directly as the bucket address.

Relative addressing

Study These Flashcards

A bijective function maps the key to a unique bucket address.

Transformation function

Study These Flashcards

A non-reversible function that converts a key into a bucket address for hashing.

Synonymous keys

Study These Flashcards

Different keys that result in the same bucket address under a specific hash function.

Homonymous keys

Study These Flashcards

Identical key values that always map to the same bucket.

Addressing capability

The number of possible key values should be greater than or equal to the number of buckets to reduce collisions.

Collision

When two different records are assigned to the same bucket address.

Saturation

Overflow records are stored elsewhere within the same address space using an alternate strategy.

Address space

The total number of possible bucket addresses available for storing records.

Overflow area

A separate storage region used specifically for handling overflowed records.

Open addressing

Overflow handling technique that searches for a new available bucket within the same address space.

Progressive chained saturation

Overflow records are stored in another location with the original bucket pointing to them via a pointer.

Chained overflow area

An overflow bucket is created in a separate area and linked to the original bucket.

Independent overflow area

Overflow records are stored in a completely separate archive, with no pointer from the original bucket.

Static hashing

Hashing method where records are placed in buckets using a fixed hash function and a fixed number of buckets.

Rebound

A condition where an overflowed record cannot be inserted into its primary or subsequent buckets due to lack of space.

Global addressing space

The total number of possible addresses that can be generated by the hash function.

Extendable hashing

A dynamic hashing method where more bits of the hash are used as needed to expand the number of buckets.

Virtual hashing

Uses a directory to track which buckets are actually in use, avoiding physical storage of empty buckets.

Dynamic hashing

Hashing method where the directory structure grows like a tree to adapt to changes in data volume or distribution.

Cluster

File organization method that physically stores related records together using a shared clustering key.

Serial cluster

Cluster where records are grouped together but not sorted internally.

Sorted cluster

Cluster where records are sorted by a sorting key that is part of the clustering key.

Hashed cluster

Cluster where the clustering key is hashed to determine the storage location.

Indexed cluster

Cluster that uses indexes to locate groups of related records based on the clustering key.

Why use clusters

To reduce disk I/O by physically grouping related data, and to optimize grouped or joined queries.

Cluster identity

A set of records stored together that share the same value for the clustering key.

Disadvantages of automatic reordering

Requires a full file scan, may not align with business logic, and can poorly balance cost vs. performance.

Base organizations Flashcards

(47 cards)