Chapter 1 Flashcards

1
Q

What is Machine Learning?

A

Machine Learning is the field of study that gives computers the capability to learn without being explicitly programmed. It means rogramming computers so they can learn from data. Learning means getting better at some task, given some performance measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Mining

A

Applying ML techniques to dig into large amounts of data can help discover patterns that were not immediately apparent. This is called data mining.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Four types of problems where ML shines?

A

ML is great for problems with no algorithmic solutions, to replace long list of hard wired rules, to build systems that adapt to fluctuating environments, and getting insights in large amounts of data(data mining).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

NLP

A

Natural Language Processing. Text classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Supervised learning

A

In supervised learning, the training set you feed to the algorithm includes the desired solutions, called labels.
It has two types : Classification(spam filters), Regression( to predict a target numeric value, such as the price of a car, given a set of features (mileage, age, brand, etc.) called predictors).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name some supervised learning algorithms.

A

k-Nearest Neighbors
Linear Regression
Logistic Regression
Support Vector Machines (SVMs)
Decision Trees and Random Forests
Neural networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Unsupervised learning

A

In unsupervised learning, the training data is unlabeled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Can you name four common unsupervised tasks?

A

Clustering, Visualization and dimensionality reduction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Dimensionality reduction

A

Simplify the data without losing too much information. One way to do this is to merge several correlated features into one.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Name one Dimensionality reduction technique

A

Feature extraction.
For example, a car’s mileage may be strongly correlated with its age, so the dimensionality reduction algorithm will merge them into one feature that represents the car’s wear and tear. This is called feature extraction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Reinforcement Learning

A

In Reinforcement Learning, the learning system, called an agent, can observe the environment, select and perform actions, and get rewards in return (or penalties in the form of negative rewards). It must then learn by itself what is the best strategy, called a policy, to get the most reward over time. A policy defines what action the agent should choose when it is in a given situation. example teaching robots to walk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Batch learning

A

In batch learning, the system is incapable of learning incrementally: it must be trained using all the available data. This will generally take a lot of time and computing resources, so it is typically done offline. First the system is trained, and then it is launched into production and runs without learning anymore; it just applies what it has learned. This is called offline learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Online learning

A

In online learning, you train the system incrementally by feeding it data instances sequentially, either individually or in small groups called mini-batches, as opposed to a batch learning system. This makes it capable of adapting rapidly to both changing data and autonomous systems, and of training on very large quantities of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is out-of-core learning?

A

Online learning algorithms can be used to train systems on huge datasets that cannot fit in one machine’s main memory. The algorithm loads part of the data, runs a training step on that data, and repeats the process until it has run on all of the data. This is called out-of-core learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Whai is learning rate?

A

It is a parameter of online learning systems of how fast they should adapt to changing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Instance-based learning

A

An instance-based learning system learns the training data by heart; then, when given a new instance, it uses a similarity measure to find the most similar learned instances and uses them to make predictions.

17
Q

What do model-based learning algorithms search for? What is the most common strategy they use to succeed? How do they make predictions?

A

Model-based learning algorithms search for an optimal value for the model parameters such that the model will generalize well to new instances. We usually train such systems by minimizing a cost function that measures how bad the system is at making predictions on the training data, plus a penalty for model complexity if the model is regularized. To make predictions, we feed the new instance’s features into the model’s prediction function, using the parameter values found by the learning algorithm.