MapReduce & Yarn Flashcards

1
Q

JobTracker Description and responsibilities

A

Single master process coordinating all jobs on the cluster
- Assigns map and reduce tasks to TaskTrackers.
- Monitors job progress.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

TaskTrackers Description and responsibilities

A

Subordinate processes executing assigned tasks
runs tasks on a fixed number of map and reduce slots within a data node.
- Execute map and reduce tasks.
- Report progress to JobTracker.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

slot

A

A slot represents an ability to run one of these “Tasks”
(map/reduce tasks) individually at a point of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

dynamic appraoch:

A

a job can request for
what it needs rather for an individual slot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Slots flexibility:

A

jobs never leave their original slots and cannot be
move to free slots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

limitations of classical MapReduce

A

Scalability
Resource utilization
Support of workloads different from MapReduce.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

JobTracker responsibilities

A

*Management of computational resources in the cluster
*Coordination of all tasks running on a cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

YARN Acronym FOR WHAT ?

A

Yet Another Resource Negotiator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Analytics Architecture

A

Edge node
Network switches
Data nodes
porprities :Not only SQL based
High scalability, availability, and flexibility
Compute and storage in the same box for reducing network latency
Right design for semi-structured and unstructured data
Data and Application are in the same machine (Data nodes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Computer cluster

A

a group of linked computers, working together
closely so that in many respects they form a single
computer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Computer cluster advantages

A

High availability : disponibilité élevée

Load balancing : redistribuer vers un autre ordinateur du
cluster

Remontée en charge

Flexibility

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Clustered file systems (CFS)

A

comprises nodes connected via a network
store data with redundancy

Store new data with replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly