AWS Data Pipeline | Getting Started Flashcards

1
Q

How do I install a Task Runner on my on-premise hosts?

Getting Started

AWS Data Pipeline | Analytics

A

You can install the Task Runner package on your on-premise hosts using the following steps:

Download the AWS Task Runner package.

Create a configuration file that includes your AWS credentials.

Start the Task Runner agent via the following command:

java -jar TaskRunner-1.0.jar –config ~/credentials.json –workerGroup=[myWorkerGroup]

When defining activities, set the activity to run on [myWorkerGroup] in order to dispatch them to the previously installed hosts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can I get started with AWS Data Pipeline?

Getting Started

AWS Data Pipeline | Analytics

A

To get started with AWS Data Pipeline, simply visit the AWS Management Console and go to the AWS Data Pipeline tab. From there, you can create a pipeline using a simple graphical editor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What can I do with AWS Data Pipeline?

Getting Started

AWS Data Pipeline | Analytics

A

With AWS Data Pipeline, you can schedule and manage periodic data-processing jobs. You can use this to replace simple systems which are current managed by brittle, cron-based solutions, or you can use it to build complex, multi-stage data processing jobs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly