Data Warehouse - Theory Flashcards

1
Q

What were the user benefits in the shift to cloud computing?

A

Ease of management (no configuration or backups needed.)

Ubiquity of access, but I need to connectivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What were the advantages to vendors in the shift to cloud computing?

A

Software as a service allows faster application development (easier to make changes and improvements)

Improvements and fixes in the software are easier inside their data centers (instead of updating many millions of clients with peculiar hardware and software configurations)

The hardware deployment is restricted to a few well tested configurations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What caused the appearance of warehouse scale computers (WSCs)?

A

The trends toward server – side computing and widespread Internet services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define data centers?

A

They are building where Multiple servers and communication units are co-located because of their common environmental requirements and physical security needs, and for ease of maintenance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the characteristics of a traditional data center?

A

The typically host a large number of relatively small or medium size applications

Each application is running on a dedicated hardware infrastructure that is decoupled and protected from other systems in the same facility

Applications tend not to communicate with each other

Those data centers host hardware and software for multiple organization units or even different companies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define the warehouse scale computing

A

They belong to a single organization, use a relatively homogeneous hardware and systems software platform, and share a common system management layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the characteristics of a warehouse computing?

A

The only smaller number of very large applications

The common resource management infrastructure allow significant deployment flexibility

The homogeneity off the hardware allow for a more economic purchase, since they are bought in batches, uniform, and patterned way of management, and a more relevant match with the characteristics of the application they run, what leads to a more cost efficient computing power consumption

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Does the WSC run small applications, locate traditional data center?

A

Yes, it does. These kind of applications rely on virtual machines (or containers), and they access large, common services for block or data bases storage, load balancing, and so on, fitting very well with the WSC model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why are there multiple data centers located far apart?

A

In order to reduce user latency and improve serving throughout

Also for disaster recovery in order to keep your service on in case something happens to one data center (redundancy)

There is also the geo political reason which is related to different laws regarding the data management in different countries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is and how it’s done the hierarchical approach to the data centers locations ?

A

The world is divided into geographic areas (GAs), the division is done by Geo political boundaries determined mainly by data residency.

The geographical areas are divided into regions (CRs), customers see regions as the finer grain discretization of the infrastructure. The main reasons of the division of areas into regions are for better service provision and for disaster prevention (redundancy).

Finally, the regions are divided into other liability zones (AZs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the servers in a data center?

A

They are the main processing equipment they differ regarding the CPUs, RAM, local storage, accelerators, and form factor

They are like ordinary PCs, but with a form factor that allow to fit them into the shelves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How are servers interconnected?

A

They are interconnected by hierarchy of networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the three types of servers regarding to their physical format

A

The tower server: which are good for scalability and ease of upgrade, cost effective, cools easily. But in the other hand, consumes a lot of space, provide the basic level of performance and complicated cable management

The rack servers: The racks are used to store these rack servers. The server racks are measured in units (U) the advantage of using these racks is that it allows designers to stack up other electronic devices along with the servers

The blade servers: we are the smallest types of servers available at this time and are great for conserving space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the pros and cons of rack servers?

A

Pros: failure containment (Easy to identify, remove, and replace a malfunctioning server), simplified cable management, cost effective

Cons: power usage (needs of additional cooling system due to their high overall component density), maintenance (gets considerably tough with the increasing number of racks)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the pros and cons of blade servers?

A

Pros: size and form factor (hi space efficiency), cabling (Do not involve the tasks of setting up cabling), centralized management (comes with centralized management that allow administrators to easily monitor, configure and update all blades from a single interface), balancing, failover, scalability

Cons: expensive configuration and higher initial cost, vendor lock-in, cooling (special accommodations have to be arranged for the servers to ensure they don’t get overheated)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What did the WSC do in order to satisfy the growing compute needs for deep learning?

A

They deployed specialized accelerator hardware: GPUs, TPUs and FPGAs

17
Q

Least some of the graphical processing units characteristics

A

Data parallel computations, the same program is executed or many data elements in parallel

High level languages are required

18
Q

What are tensor processing unities?

A

TBUR custom built integrated circuit developed specifically for machine learning. They are used for training and inference. The first version of it is more inference focused, while the versions two through five focuses both on training and interference

19
Q

What are field – programmable gate arrays (FPGAs)?

A

They are programmable hardware devices, where you can customize its logic. Array of Logic gates that can be programmed in the field by the user of the device as opposed to the people who designed it.

20
Q

What are some FPGA usage examples?

A

Network acceleration: they can offload specific network processing tasks from CPUs

Security acceleration: encryption, description, and other security related tasks

Data analytics: can be used to accelerate specific algorithms used in data analytics workloads

Machine Learning: can be configured to implement specific machine learning algorithms efficiently

21
Q

What are the advantages and disadvantages of CPU?

A

Advantages: easy to be programmed and support any programming framework, fast design space exploration and run your applications

Disadvantages: suitable only for simple AI models that do not take long to train and for small models with small training set

22
Q

What are the advantages and disadvantages of GPU?

A

Advantages: ideal for applications in which data needs to be processed in parallel, like the pixels of images or videos

Disadvantages: programmed in languages like CUDA and openCL and therefore provide limited flexibility compared to CPU

23
Q

What are the advantages and disadvantages of TPU?

A

Advantages: very fast at performing dense vector and matrix computations in our specialized on running very fast program based on tesorflow

Disadvantages: for applications and models based on Tenser flow, lower flexibility compared to CPU and TPU

24
Q

What are the advantages and disadvantages of FPGA?

A

Advantages: higher performance, lower cost and lower power consumption compared to other options like CPU and GPU

Disadvantages: programs using openCL and high-level synthesis, limited flexibility compared to other platforms

25
Q

What are edge data centers?

A

They are smaller data centers designed to reduce the network round trip of request and response (closer to the user)