Week 10 Flashcards

1
Q

What is Association Rule Mining (ARM)?

A

A “rule”-based machine learning method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the objective of ARM?

A

To discover interesting relations between variables in large DBs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Association Rule Mining aims to identify _____ rules discovered in DBs, using some measures of interestingness

A

strong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

T/F - Association Rule Mining assumes all data is numeric

A

False. ARM assumes all data is categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

T/F - ARM features good algorithms for numeric data

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What was Associated Rule Mining initially used for?

A

Market analysis, to determine how items purchased by customers are related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A customer buys bread, there is an 80% chance they buy butter.
bread > butter

or

A customer buys {onions, potatoes} > buys{tomatoes}

What is this an example of?

A

Associated Rule Mining

It was useful for marketing activities like product promotion or product pricing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Bread > Butter[20%, 45%]

What is the Antecedent, Consequent?
What is the Support, Confidence?

A

Antecedent = Bread
Consequent = Butter
Support = 20%
Confidence = 45%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

In a store where there are 100 total transactions, what does a support of 20% for bread mean?

A

It means 20 transactions contained bread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

With 20 transactions that contain bread there is a 45% confidence that the transaction also contained butter. How many transactions had butter?

A

9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

T/F - Recommendation systems like Amazon’s “also viewed” use ARM

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Here are 3 algorithms for association rule:

Apriori Algorithm
Elcat Algorithm
F.P. Growth Algorithm

Which one is the best?

A

Apriori

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What do we find with this equation?

of transactions where X appears / Total # of transactions

A

Support (X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

To find this we divide Support(X,Y) with Support (X)

A

Confidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Algorithm for k products:

  • User sets a minimum support criterion
  • Generate a list of one-item sets that meet the support criterion
  • Use the list of one-item sets to generate list of two-item sets
  • Use the list of two-item sets to generate list of three-item sets
  • Continue up through k-item set
  • All item sets must meet the support criterion

What do we get by following this algo.?

A

The Apriori algo. for Generating Frequent Item sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the maximum Confidence value for a X > Y rule if the consequent and antecedent always occur together? What is the minimum?

A

1, 0

17
Q

What does Lift measure?

A

How often X and Y happen together vs how often they happen independently

18
Q

What does it mean if the lift is greater than 1?

A

There is a positive correlation

19
Q

What does it mean if the lift is less than 1?

A

There is a negative correlation

20
Q

What does it mean if the lift is 1?

A

There is no relationship

21
Q

Association rules produce rules based on associations between ______ from a ___________ of transactions

A

items from a database of transactions