9 Service Analytics Flashcards

1
Q

Which phenomenon contributes to the formation of huge amounts of data, also called big data?

A

Internet of Things -> every electronic object generates data

Industry 4.0 -> A lot more data gets “harvested” in all industries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what are the four “Vs” of big data?

A

variety
volume
velocity
veracity (Wahrhaftigkeit)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is Big Data such a big thing?

A

People expect new insights by analysing this data
- find structures

-> these new insights shall enable new business models or bring to light possibilities to makes processes more efficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name the two levels big data will be exploited and name examples

A

Centralized level: Central data model and intelligence
- transportation systems, warehouses

Autonomous/ decentralized intelligence

  • Self-managing traffic lights
  • packages will find their way automatically
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
Name 8  (Business) Analytics methods 
\+ group them in "basic" and "advanced"
\+ sort them according to their degree of intelligence/ competitive advantage
A
Basic:
Standard reporting
Ad hoc reporting
Drill down
Alerts
Advanced: 
Forecasting
Simulation
Predictive modeling
Optimization
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name three kinds of analytics

A

Descriptive
Predictive
Prescriptive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In which departments of a company are more likely to rely on data and analytics?

A

with increasing value:

Customer
HR
Strategic
Operational
Financial
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Distinguish supervised from unsupervised machine learning

A

supervised:

  • we need to train the model
  • > set of know problems/ answers needed to train
  • typical tasks: regression or classification
  • Example: Tell a child to sort cars into sports cars and SUV after telling him what their characteristics are

unsupervised:

  • identify previously unknown patterns
  • > outcome might be a structure we haven’t been thinking of yet
  • typical: clustering/ association rules
  • Example: Tell a child to sort cars but don’t give any criteria
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Definition: Machine learning

A

algorithms that can learn from and make predictions on

data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Definition: Data Mining/ Knowledge discovery in databases

A

The nontrivial process of identifying valid, novel, potentially useful, and understandable patterns in
data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Draw the Crisp- DM circle

A

Business understanding Data understanding

Data Understanding -> Data preparation

Data preparation Modeling

Modeling -> Evaluation -> Deployment

Evaluation -> Business understanding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe the term Data Science using a Venn Diagram

A

Computer Science + Statistics = Art zone, solving problems that never appear

Computer Science + Business Application = Danger zone, automation of gut feeling

Statistics + Business Application = Theory zone, solution without implementation

All three disciplines are needed!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain shortly multivariate regression

is it used for supervised or unsupervised learning?

A

predict the value of a response variable with help of its correlation with other variables

  • find the right few (~ 2-4) variables that explain the response variable realistically
  • not too many -> overfitting

-> used for supervised learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

describe the method “classification” shortly and name techniques for it

A
One variable indicates class membership
The other variables are used to predict it
Techniques:
Naive Bayes
k-Nearest Neighbor
Decision Trees (Quinlan’s ID3, C4.5)
Logistic Regression
Neural Networks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Name two applications of unsupervised learning

A

Clustering/ Segmentation

Association rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain Clustering shortly

A

Arrange data objects so that nearby objects are similar

  • > Algorithm derives (herleiten) the pattern
  • K- Means algorithm to cluster n objects into k groups
17
Q

Explain shortly association rules

A

Algorithm should find rules/ dependencies in a set of objects

Example:

  • When someone buys beer, how likely is it that he also buys chips (beers -> chips)
  • When someone buys chips, how likely is it that he also buys beer (chips -> beer)
  • How often does someone put a specific object in his cart?
18
Q

What kind of algorithms exist for text mining? Name examples for applications

A

Classification
Clustering
Language analysis (what’s the meaning, but also what kind of person is that)

Applications:

  • Identifying personal traits from texts -> create a personality portrait
  • capture information about what customers want from social networks