Lecture 7 Flashcards

Question 1

Q

What are the two types of scaling?

Answer

A

Vertical and Horizontal scaling

Question 2

Q

Horizontal scaling

Answer

A

Adding more servers to your application to spread the load.

Can make use of non-demand cloud server architectures.
Facilitates redundancy - having each layer running on multiple servers means that if any single machine fails, your application keeps running.
Requires more complicated software and architecture.

Question 3

Q

Vertical scaling

Answer

A

Add more RAM, processors, bandwidth, or storage to a a machine.

Quick and east way to get your application’s level of service back up to standard. Will only get you so far.
Upgrading a single server beyond a certain level can become very expensive and often involves downtime and comes with an upper limit.

Question 4

Q

CAP theorem meanings:

Answer

A

Consistency
Availability
Partition tolerance

Question 5

Q

Consistency

Answer

A

A consistent view of data on all nodes of the distributed system.

Question 6

Q

Availability

Answer

A

Demands the system ti eventually answer every request, even in case of failures.

Question 7

Q

Partition tolerance

Answer

A

The system is resilient to message losses between nodes.

A partition is an arbitrary split between nodes of a system, resulting in complete message loss in between.

Question 8

Q

Partitioning (Definition)

Answer

A

Separating one table’s rows into multiple different tables.

Question 9

Q

Types of partitioning

Answer

A

Range based
Key based

Question 10

Q

Partitioning explanation

Answer

A

Partitioning may be stored on different table spaces, which can be on different storage tiers. (RAM/SSD/HD)
- Partitions can be compressed using different compression schemes.

Local indexes can be dropped for some partitions.
Table statistics can be frozen on some partitions, while being periodically refreshed on others.

Question 11

Q

Partitioning explanation

Answer

A

Partitioning may be stored on different table spaces, which can be on different storage tiers. (RAM/SSD/HD)
- Partitions can be compressed using different compression schemes.

Local indexes can be dropped for some partitions.
Table statistics can be frozen on some partitions, while being periodically refreshed on others.

Question 12

Q

Distributed Partitions are also known as..

Question 13

Q

Distributed Partitions / Sharding

Answer

A

Storage and badwidth constraints
Scale read and write capacity
Geolocation: Proximity, Privacy and data protection laws.
Higher Availability (losing a single shard vs losing all connection)

Question 14

Q

Distributed Partitions / Sharding

Answer

A

Sharding key needed, the value used to determine to which database to connect.

Smaller reference tables may need to be replicated to all shards, a strategy is needed for how these tables can be modified and changes propagated to all shards.

Question 15

Q

Distributed DBMS - Parallel Database

Answer

A

Nodes are physically close to each other.
Nodes are connected via high-speed LAN.
The communication cost between nodes is assumed to be small. As such, one does not need to worry about nodes crashing or packets getting dropped when designing internal protocols.

Question 16

Q

Distributed DBMS - Distributed Database

Answer

Study These Flashcards

A

Nodes can be far from each other.
Nodes are potentially connected via a public network, which can be slow and unreliable.
The communication cost and connection problems cannot be ignored.

Question 17

Q

NoSQL Hashed Sharding

Answer

Study These Flashcards

A

Since records have no relations and cannot be joined, there are no transactions.
We can use a hash-function to distribute the data, this evenly distributes the data across clusters.