Week 2: Business Analysis Tools Flashcards

- Learn different business analytics techniques. - Understand the importance of spreadsheet tools for business analytics. - Understand the importance of programming tools in business analytics. (26 cards)

1
Q

Big Data Analytics (examples)

A

Data mining, clustering, regression, classification, association analysis, decision tree, neural networks,
statistical analysis, optimization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Text analytics (examples)

A

Information retrieval, text summarization, sentiment analysis, topic modeling, thematic analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Web/mobile analytics (examples)

A

Web information retrieval, search systems, web crawling, website ranking, search log analysis, smartphone platforms, mobile advertising and marketing, gamification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Social media analytics (examples)

A

Content-based analytics, Structure-based analytics (or social network analytics), community detection, social influence analysis, Link prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Multimedia analytics (examples)

A

Audio analytics, speech analytics, video analytics, automatic video indexing and retrieval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

3 main approaches to business analytics tools

A
  1. Spreadsheet tools
  2. Programming tools
  3. Proprietary business analytics solutions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Spreadsheet tool (definition)

A

Interactive software application for structuring, transforming, analyzing, and storing data in rows and columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tabular data (definition)

A

Data that has rows and columns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Comma-separated value [CSV] file

A

Tabular spreadsheet data that uses commas to separate lines and new lines to separate records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Comma-separated value [CSV] file

A

Tabular spreadsheet data that uses tabs to separate lines and new lines to separate records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Examples of data cleaning in spreadsheets

A
  • Search and replace
  • Sorting and filtering
  • Built-in functions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Programming (definition)

A

process of solving a problem using computer algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Programming language (definition)

A

formal set of instructions that can be used to produce various kinds of output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Open-source programming tools (definition)

A

programming tools that are made
freely available, often developed by and for the community

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Programming code (definition)

A

collection of statements written in a particular programming language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Pros of Excel

A
  • Highly accessible functions easily implemented at the point of contact with data
  • Functions built into spreadsheet environment (easier to implement)
  • Experience with Excel is more comment
17
Q

Cons of Excel

A
  • Functionality limited for advanced statistics
  • Limited number of rows and columns on a worksheet
  • Limited number of characters in one cell
18
Q

Pros of R [programming language]

A
  • Good for data-oriented projects
  • Handles very large datasets (big data)
  • Large number of ready-made packages
  • Data visualization tools built-in
  • Developed by data scientists
  • Large community support
  • Supported by RStudio (integrated development environment) that lacks good competitors and and has no Python equivalent
19
Q

Cons of R [programming language]

A
  • Steep learning curve
  • Less efficient for general computations
  • Some inefficiently written packages
20
Q

Pros of Python

A
  • Growing community of compsci software engineers and programmers
  • More opportunities to take advantage of AI
  • Flexible
  • data analysis can be integrated with website and mobile apps or production database
  • Can do other programming tasks besides data analysis
21
Q

Cons of Python

A
  • Less efficient for statistical computations
  • Less visually-appealing data visualization tool
  • Fewer packages
22
Q

Types of variables

A

Logical, integer, numeric, and character

23
Q

Logical variable (definition)

A

contains only two possible values: TRUE or FALSE
(indicator variable or dummy variable)

24
Q

Integer variable (definition)

A

contains numbers without decimal points

25
Numeric variables (definition)
contains numbers with decimal points
26
Character variables (definition)
contains words that do not have order or numerical meaning (string variable or text variable)