Advanced Databases COPY Flashcards

Question

What does disjoint mean in the context of fragmentation?

Answer 1

No tuple of the global relation appears in more than one fragment.

Answer 2

Every tuple in the original global relation must belong to at least one of the fragments.

Answer 3

Goals include: * Reduce overall execution time * Balance I/O load evenly across nodes.

Answer 4

Factors include: * Number and capacity of available nodes * Query workload * Network topology and distance.

Answer 5

It deals with how a query formulated against a global, non-distributed database schema operates on fragments of data across different nodes.

Answer 6

Involves substituting localization programs for relations in the original query, potentially leading to expensive operations.

Answer 7

Techniques employed to optimize queries by reducing the amount of data transferred across the network.

Answer 8

Reduction with selection Reduction with join

Answer 9

Redution with projection

Answer 10

Applied when a query includes a selection operation (WHERE), identifying irrelevant fragments to avoid accessing them.

Answer 11

Optimizes join operations by identifying pairs of fragments whose join will be empty and avoiding unnecessary joins.

Answer 12

Aims to reduce data processed and transferred by eliminating unnecessary fragments for a query involving a projection operation.

Answer 13

Moves only the part of one relation needed for the join.

Answer 14

self-sufficient.

Answer 15

To reconstruct global relations using expressions derived from fragments.

Answer 16

Serialisability, the highest isolation level ## Footnote 2PL is essential for preserving isolation in distributed DBs.

Answer 17

* Centralised 2PL (C2PL) * Distributed 2PL (D2PL) ## Footnote C2PL uses a single site for lock management, while D2PL has lock managers at each site.

Answer 18

The transaction manager sends a lock request to the central lock manager ## Footnote The central lock manager decides whether to grant the lock.

Answer 19

Single point of failure and potential bottleneck ## Footnote Every lock request and release must go through the central lock manager.

Answer 20

To the local lock manager at each participant site ## Footnote Each participant site has its own lock manager.

Answer 21

It sends an 'end of operation' message back to the transaction manager ## Footnote This informs the transaction manager that the operation is complete.

Answer 22

Occurs when 2 or more transactions are waiting for each other to release a lock on an item ## Footnote Deadlocks can severely impact system performance.

Answer 23

* Concurrency * Hold * Wait ## Footnote These conditions must be met for a deadlock to occur.

Answer 24

A directed graph that represents the dependencies between transactions ## Footnote It helps in detecting deadlocks.

Answer 25

The presence of a cycle ## Footnote A cycle in the WFG signifies that transactions are waiting indefinitely.

Answer 26

* Local WFG * Global WFG ## Footnote Local WFGs are per site, while Global WFGs consider all transactions.

Answer 27

Transaction pre-declaration ## Footnote Transactions declare all accessed data items in advance. TM only locks if all items are available.

Answer 28

Requiring transactions to always access resources in a predefined order ## Footnote This can be challenging in dynamic databases.

Answer 29

Using timestamps to decide which transaction to abort when a lock request is denied ## Footnote Rules like WAIT-DIE and WOUND-WAIT are examples of this approach.

Answer 30

An older transaction waits for a younger one, while a younger one is aborted if it requests a lock held by an older one ## Footnote This helps to manage transaction priorities.

Answer 31

One site acts as the deadlock detector, checking for cycles in the Global Wait-For Graph ## Footnote Each site sends its Local WFG to the central detector.

Answer 32

Single point of failure ## Footnote If the deadlock detector fails, deadlocks may go undetected.

Answer 33

Organizing deadlock detectors in a hierarchy for monitoring ## Footnote Local detectors report to higher-level detectors.

Answer 34

Responsibility for deadlock detection is shared among sites ## Footnote This allows for a more robust detection mechanism.

Answer 35

Using timeouts to abort transactions waiting too long for a resource ## Footnote This method assumes that long wait times indicate a deadlock.

Answer 36

Replication is an extension of the fragmentation problem that involves creating multiple copies of data across different geographical locations.

Answer 37

* Latency reduction * Availability * Resilience * Performance

Answer 38

By storing copies of the data closer to users in different geographical areas.

Answer 39

It increases availability by having multiple copies; if one replica fails, data can be accessed from other replicas.

Answer 40

If a node containing a replica fails, transactions can be rerouted to other nodes with copies of the required data.

Answer 41

It balances the read workload across multiple replicas, reducing bottlenecks and increasing throughput.

Answer 42

Each replica has its own transaction management system.

Answer 43

All copies of an item have the same value after the execution of an update transaction.

Answer 44

All copies of an item will eventually have the same value after the execution of an update transaction.

Answer 45

Epsilon defines a bound on the allowed inconsistency, specifically the number of missing writes.

Answer 46

It allows reads as long as they are within a defined bound of time units.

Answer 47

It considers the average/combined temporal difference across multiple items accessed in the same transaction.

Answer 48

Mutual consistency means replicas have the same value, but does not ensure that updates occurred in a single, step-by-step order.

Answer 49

A type of data object that can be used in replicated systems to facilitate lazy distribution of updates without leading to conflicts.

Answer 50

* Associative * Commutative * Idempotent

Answer 51

Updates can be lazily distributed without immediate propagation, ensuring the final state is consistent regardless of update order.

Answer 52

It is practical for systems with network latency or intermittent connectivity.

Answer 53

The dynamic rate of participation of nodes within the network.

Answer 54

It makes fragmentation and replication of data more challenging.

Answer 55

Extreme autonomy with no control of the network topology.

Answer 56

The lack of a central overseer.

Answer 57

* Very resilient * Supports maximum autonomy

Answer 58

* Unpopular items are not replicated enough * Enormous communication cost

Answer 59

To manage data and queries.

Answer 60

The range of hash keys stored in a specific node.

Answer 61

It becomes expensive due to the need for synchronisation.

Answer 62

A network where nodes connect to virtual nodes to control topology.

Answer 63

* Tree * Hypercube * Ring

Answer 64

Each node maintains a routing table that stores the address of one node representative of a different prefix.

Answer 65

Creation of multiple identities by an attacker to manipulate voting processes.

Answer 66

By shifting validation from identity count to computational effort.

Answer 67

Difficulty tuning the puzzle can affect transaction confirmation speed.

Answer 68

A system where participants stake their own economic value to become validators.

Answer 69

They lose their stake.

Answer 70

* Requires validators to reveal identity * Openly auditable transactions

Answer 71

Cryptography.

Advanced Databases COPY Flashcards

(101 cards)