Base organizations Flashcards

(47 cards)

1
Q

Serial media

A

Physical storage device where records are stored one after another in sequence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Serial organization

A

Storing records without any specific ordering or location criteria.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Physical deletion

A

The record is removed and subsequent records are physically shifted to fill the gap.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Logical deletion

A

The record is marked as deleted, leaving a gap in its place.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Update on constant-sized record

A

The content of the record is modified in-place without changing its size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Update on non-consecutive record

A

If a sufficiently large gap exists elsewhere, the record is updated in that location.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Retrieval with identifying selective key

A

The system reads records sequentially until a matching record is found.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Distributed free space (DFS)

A

Reserved percentage of space in each bucket dedicated to updates, reducing the need to move records.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Gaps list

A

In-memory structure that tracks locations in the file with available space for insertions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Compacting

A

Maintenance process of physically reorganizing records to eliminate gaps and restore density.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Overflow

A

Occurs when a record cannot be inserted in its sorted position due to lack of space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Degeneration

A

Gradual growth of unsorted areas in a sequential organization due to repeated unsorted insertions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Extended binary search

A

A binary search followed by scanning forward and backward to retrieve all matching records.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Overflow management (consecutive organization)

A

Uses a dedicated unsorted area for handling overflowed records.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Overflow management (non-consecutive organization)

A

Uses techniques like rotations, bucket interleaving, and cell partitions to manage overflow.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Overflow management - rotations

A

Moves records from a full bucket into a neighboring bucket with available space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Overflow management - bucket interleaving

A

Reserves empty buckets during file creation for future overflow handling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Overflow management - cell partitions

A

Adds an empty bucket directly after an overflowed bucket and stores excess records there.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Direct addressing organization

A

Each record is stored at a bucket whose address directly corresponds to the key value.

20
Q

Absolute direct addressing

A

The key itself is used directly as the bucket address.

21
Q

Relative addressing

A

A bijective function maps the key to a unique bucket address.

22
Q

Transformation function

A

A non-reversible function that converts a key into a bucket address for hashing.

23
Q

Synonymous keys

A

Different keys that result in the same bucket address under a specific hash function.

24
Q

Homonymous keys

A

Identical key values that always map to the same bucket.

25
Addressing capability
The number of possible key values should be greater than or equal to the number of buckets to reduce collisions.
26
Collision
When two different records are assigned to the same bucket address.
27
Saturation
Overflow records are stored elsewhere within the same address space using an alternate strategy.
28
Address space
The total number of possible bucket addresses available for storing records.
29
Overflow area
A separate storage region used specifically for handling overflowed records.
30
Open addressing
Overflow handling technique that searches for a new available bucket within the same address space.
31
Progressive chained saturation
Overflow records are stored in another location with the original bucket pointing to them via a pointer.
32
Chained overflow area
An overflow bucket is created in a separate area and linked to the original bucket.
33
Independent overflow area
Overflow records are stored in a completely separate archive, with no pointer from the original bucket.
34
Static hashing
Hashing method where records are placed in buckets using a fixed hash function and a fixed number of buckets.
35
Rebound
A condition where an overflowed record cannot be inserted into its primary or subsequent buckets due to lack of space.
36
Global addressing space
The total number of possible addresses that can be generated by the hash function.
37
Extendable hashing
A dynamic hashing method where more bits of the hash are used as needed to expand the number of buckets.
38
Virtual hashing
Uses a directory to track which buckets are actually in use, avoiding physical storage of empty buckets.
39
Dynamic hashing
Hashing method where the directory structure grows like a tree to adapt to changes in data volume or distribution.
40
Cluster
File organization method that physically stores related records together using a shared clustering key.
41
Serial cluster
Cluster where records are grouped together but not sorted internally.
42
Sorted cluster
Cluster where records are sorted by a sorting key that is part of the clustering key.
43
Hashed cluster
Cluster where the clustering key is hashed to determine the storage location.
44
Indexed cluster
Cluster that uses indexes to locate groups of related records based on the clustering key.
45
Why use clusters
To reduce disk I/O by physically grouping related data, and to optimize grouped or joined queries.
46
Cluster identity
A set of records stored together that share the same value for the clustering key.
47
Disadvantages of automatic reordering
Requires a full file scan, may not align with business logic, and can poorly balance cost vs. performance.