Distributed Databases Flashcards

1
Q

What is a Distributed
Database?

A

This is a Db system that is stored on several computers, interconnected through a network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some features of a site?

A

Each site can process local transactions involving data only on that site

Each site has a local transaction manager

Each site has a transaction coordinator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Are global interactions possible?

A

Yes, Some sites may get involved in global transactions,
which access data from several sites.

Local transaction managers communicate to achieve this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Examples

A

University of the West Indies
 Bursary - Financial information
 Personnel - Staff information
 Registry - Student information

Multinational
 HQ in Kingston
 Manufacturing in Trinidad
 Warehouse in Miami
 European HQ in London
 Each site keeps data on local employees, as well as data
relevant to the operation of the site.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why are distributed Databases Beneficial?

A

More natural for representing many real world organizations.

Local autonomy

Improved performance
- Speeds up query processing with the possibility of parallel computing
-Local Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Provide 3 benefits of Distrbuted databases:

A

Improved availability/reliability
-DB can continue to function even if some subsystems are down if we replicate data or hardware.

Security
-Avoid destruction of DB by replicating vital data.

Incremental growth
-Increasing size of DB
-Increasing operations
-Example: Include R&D facility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do we want from a distributed database?

A

To allow users to use the system as if it is not distributed.
No reliance on central master site as this causes bottle necks.

Continuous Operations to enforce reliability and availability.

Distributed Query Processing so that all data is available at a no. of different sites.

Distributed Transaction Management
- Same copy of a data item may be a a number of sites.

Transparency

Local Autonomy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is autonomy in distributed systems?

A

All operations at a site are controlled by that site.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the 3 different types of autonomy?

A

Design
Communication
Execution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe each type of autonomy.

A

Design
Individual DBs can use data models and transaction management techniques that they prefer.

Communication
Individual DBs can decide which information they want to make accessible to other sites

Execution
Individual DBs can decide how to execute transactions submitted to them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some issues with Distributed database design.

A

How should DB and applications be placed across sites.
How should data dictionary be placed across sites.
Replication and partitioning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are issues and problems associated with distributed database systems?

A

Distributed database design

Distributed query processing (How to break down a query into series of data manipulation operations.)

Distributed concurrency control

Distributed deadlock control

Reliability

Heterogeneous databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe the Problem of Reliability with DDBs(Distributed Database Systems).

A

If site becomes inaccessible, how do you ensure that DBs at other sites remain consistent and up-to-date?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a heterogeneous database?

A

Sharing pre-existing DBs with different conceptual schemas.

Also called multidatabase systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Replication?

A

Maintaining multiple copies of a part of a DB.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Fragmentation?

A

Storing different parts of a database on different sites.

17
Q

Advantages of Replication

A

Increased availability - If one site goes down, the data may be available from elsewhere.

Increased parallelism - More likely that the data needed is at the site that poses the query.

We may send parts of the same query to different sites.

18
Q

Disadvantages of Replication

A

Disadvantages
Increased storage space
Increased overhead on update.
Must ensure that all copies are consistent.

19
Q

Why fragment? Why not simply store different
complete tables at different sites?

A

Applications usually access only a subset of a relation.

Fragmentation gives possibility of increased parallelism.

20
Q

What are the 3 types of Fragmentation? Explain the different types.

A

Horizontal fragmentation - Assign different tuples to each fragment (e.g.,through a selection in the sense of relational algebra).

Vertical fragmentation - Assign different attributes to each fragment.
Must ensure re constructability.

Mixed Fragmentation - Mixture of the two.

21
Q

Problems with Fragmentation?

A

Increased response time

Data integrity might be harder to maintain

Issues with correctly finding where to place the various fragments.