Chapter 6 - Data Analytical Tools Flashcards

1
Q

List the 3 reasons R language is so popular?

A

1) it’s open source
2) the machine learning packages are free on CRAN, the R code repository
3) the tidyverse data-analyis packages simplify the R language make it easier for data analysts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Python has a specialist tool specifically for data analysts to use. What is it called and what kind of data structures does it work with?

A

PANDAS = Python Data Analysis Library.
It’s specifically for structured/tabular data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which of the following commands are Data Definition Language and which are Data Manipulation Language

CREATE
UPDATE
ALTER
DELETE
DROP
SELECT
INSERT

A

CREATE - DDL
UPDATE - DML
ALTER - DDL
DELETE - DML
DROP - DDL
SELECT - DML
INSERT - DML

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which 3 DDL commands does the book mention?

A

CREATE
ALTER
DROP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which 2 ways can tools interact with databases using SQL?

A

1) provide a GUI to either reconfigure the database or retrieve data from the database
2) directly via computer software.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which statistics packages created in the 60’s and 70’s are still widely in use today?

A

IBM SPSS
SAS

SPSS Statistics is a different product to SPSS Modeller!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which are two statistics packages (or statistical analysis software) aren’t used as much today?

A

Minitab and Stata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

if you saw a machine learning tool interface that showed decision trees in the form of flowcharts, which tool would you be looking at?

A

IBM SPSS Modeler

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which machine learning tool offer pre-built analytics templates for common business scenarios and offers access to hundreds of different algorithms?

A

Rapid Miner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Farrow wants to perform analysis of data but it requires a bespoke solution, what out of the below methods should they go with?

Spreadsheets
IBM SPSS
Use Python or R?

A

Using Python or R will allow Farrow to build an analysis package that suites their specific needs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List the analytics suites discussed in the book

A

AWS quicksight
business Objects
Domo
Datorama
IBM Cognos
Microstrategy
Power BI
Qlik
Tableau

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which analytics suite is specifically designed for Sales and Marketing functions of the business?

A

Datorama

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which analytics suite from SAP allows companies to integrate it with other applications?

A

BusinessObjects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Raheela needs to create a bespoke analytics but she’s not very familiar with programming, which solution should she choose?

A

R

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

List the SQL Data Manipulation Commands

A

UPDATE
SELECT
INSERT
DELETE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What’s the key benefit machine learning software or packages bring to analysts of today?

A

They don’t require analysts to know programming languages or write their own scripts/algorithms

17
Q

What’s considered the most popular data visualization tool?

A

Tableau

18
Q

Which data visualization tool enables easy ingestion of data from a wide variety of sources?

A

Tableau

19
Q

Qlik X is the company’s original analytics platform and Qlik X is a more advanced platform

A

Qlik VIEW is the company’s original analytics platform and Qlik SENSE is a more advanced platform

20
Q

Regarding IBM Cognos, what’s the difference between the two modules below?
Query Studio
Report Studio

A

Query Studio provides access to data querying and basic reporting
Report Studio is for complex reporting needs

21
Q

Which ‘Studio’ module in IBM Cognos enables advanced modelling and analytics for large data sets?

A

Analysis Studio

22
Q

If you’re a business executive and want to create a scorecard to analyse key metrics across the business, which module would you use in IBM Cognos?

A

Metric Studio

23
Q

Which Power BI module allows developers to create paginated reports that are designed for printing and email?

A

Power BI Report Builder

think building reports to publish

24
Q

An organization wants to host it’s own Power BI capability, what offering would they purchase?

A

Power BI Report Server

25
Q

List the analytics suites begging with A and B

A

AWS Quicksite
Business Objects (SAP)

26
Q

List the analytics suites beginning with D

A

Domo
Datarama

27
Q

List the analytics suites beginning with C and P

A

Cognos
Power BI

28
Q

List the analytics suites beginning with Q and T

A

Qlik
Tableau

29
Q

Power BI Report Builder has a key function to use specifically when printing or emailing reports, what is it?

A

Pagination

30
Q

Which 2 products in the book are built especially for machine learning capabilities?

A

IBM SPSS Modeller
Rapid Miner

31
Q

Which name is given to a group of packages within R that is specifically design for data manipulation, analysis and visualization?

A

Tidy-verse

32
Q

Machine learning and the tidy-verse analytics packages are stored where for R?

A

in CRAN (Comprehensive R Archive Network)

33
Q

these provide an advanced statistical environment via GUI or bult-in scripting language and are generally the domain of professional statisticians

A

Statistics packages