Identifying Good Problems for ML Flashcards

1
Q

Before starting with ML ask yourself the following questions:

A

(in order)

  1. What problem is my product facing?
  2. Would it be a good problem for ML?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Know the problem before focusing on the data. If you understand the problem clearly, you should be able to ___

A

list some potential solutions to test in order to generate the best model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What kind of data is going to be the most useful?

A

Data collected specifically for your task is going to be the most useful. In practice, you may not be able to do this, and you’ll rely on whatever data you can get that’s close enough. That’s fine as long as you’re aware of the cost, and as you can eventually get product logs, you can use those to build something more targeted to your task.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why should you not make ML discover all the useful feature for you and simply throw everything at the model and see what looks useful?

A

Your model will likely wind up overly complicated, expensive, and filled with unimportant features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In __ __, you have a higher chance that a feature will be correlated with your label by chance within your sample of data.

A

smaller datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What do we mean by decisions in ML?

A

By decisions, we mean that your product should take action on the output of the model. ML is better at making decisions than giving you insights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Make sure your _____ allow you to take a useful action.

A

predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly