ML Flashcards

1
Q

What is Machine Learning?!

A

Machine learns from data …

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Machine Learning: Definition

A
  • ML is about enabling a machine to act, without explicitly programming it!
  • It is predicting unknown from uncertain information
  • Self-configuring data structures that allow a computer to do things that would be called “intelligent” if a human did it ….
  • “making computers behave like they do in the movies”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data is recorded from some real-world phenomenon. What might we want to do with that data?

A

Prediction - what can we predict about this phenomenon?

Description - how can we describe/understand this phenomenon in a new way?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Learning from Data: What is it Good for?

A

Right: Write code to make the computer learn how to do the tasks
Wrong: Write code to explicitly do the above tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Many applications are immensely hard to program directly. These almost always turn out to be “pattern recognition” tasks.

A

Right: 1. Program the computer to be able to learn from examples. 2. Provide “training” data.
Wrong: 1. Program the computer to do the pattern recognition task directly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Steps Involved in Software Engineering Methodology

A

• Analyze
– Interview experts, users, etc. to determine the actions the system must perform
• Design
– Apply Computer Science knowledge to design a solution
• Implement
• Test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Challenges for Software Engineering Methodology

A

• Standard SE methods fail when…
– System requirements are hard to collect
– The system must resolve difficult tradeoffs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why System Requirements Can be Hard to Collect …

A
• There are no human experts
– Cellular telephone fraud
• Human experts are inarticulate
– Handwriting recognition
• The requirements are changing rapidly
– Computer intrusion detection
• Each user has different requirements
– E-mail filtering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Machine Learning:

A

Replacing Guesswork with Data
• In all of these cases, the standard SE methodology requires engineers to make guesses
• Machine Learning provides a way of making these decisions based on data, instead of guesswork

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Machine Learning vs. Artificial Intelligence

A

Humans can think, learn, see, understand language, reason, etc.
• Artificial Intelligence aims to reproduce these capabilities.
• Machine Learning is one part of Artificial Intelligence.
• Machine learning aims to reproduce learning capabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Using machine learning to detect spam emails.

A

Naïve Bayes

Rule mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Using machine learning to recommend books.

A

Collaborative Filtering
Nearest Neighbour
Clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Using machine learning to identify faces and expressions.

A

Decision Trees

Adaboost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Using machine learning to identify vocal patterns

A

Feature Extraction
Probabilistic Classifiers
Support Vector Machines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ML for working with social network data:

A

detecting fraud, predicting click-thru patterns, targeted advertising, etc etc etc .

ALGORITHMS
Support Vector Machines
Collaborative filtering
Rule mining algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Usually First Step in Machine Learning

A

Feature Extraction

17
Q

Feature Extraction

A

presenting stimulies using their distinctive attributes

18
Q

Feature Extraction: What for?

A

Decreasing the size of data presented to machine
– Saving computational and memory load
Presenting stimuli’s using their discriminative attributes
– Not including attributes which does not contribute in the specific learning task.
– Not including attributes which does not help in distinguishing between different stimulies.
– Example: Transcribing speech to text:
• Loudness is irrelevant to message content, no matter how load you speak, you still say the same thing!
• Loudness information should be removed in the feature extraction process
– Removing attributes helps in getting rid of extra sources of noise
• Example: when loudness is removed, no need to deal with noise coupled with loudness …

19
Q

Features should be…

A

discriminative between different stimuli classes: e.g. discriminating between apple and orange!

20
Q

Two steps of ML:

A
– Training: Letting the machine learn association between a stimuli class and its features
– Testing: Presenting a new stimuli to machine, asking the machine to recognize the stimuli class!
21
Q

Forms of Learning:

A

Supervised and Unsupervised Learning

22
Q

Supervised Learning

A

– Agent learns a function mapping input to output from examples, which are labeled into known classes

23
Q

Unsupervised Learning

A

– Agent identifies patterns in examples, but no known label is provided

24
Q

Perceptrons (Neural Networks)

A
  • Perceptron model simulates brain cognitive mechanism
  • Perceptron is a dense interconnection of computing cells, similar to brain which is a dense interconnection of biological cells
25
Q

Perceptron unit

A

A Perceptron is a dense interconnection of below units.
Input links → Input function → activation function→ output → output links

Inputs to the cells are weighted individually (multiplied by weights), summed up, and then pass through a nonlinearity in order to form the output

26
Q

Perceptron is a Feed Forward Network

A

Outputs of each layer is calculated based on outputs of previous layers

27
Q

Propagation

A

Computations in each layer is propagated to the next layer:
• In the first layer, features are multiplied by weights, summed and passed by a non-linear function in order to form outputs of f3 and f4 …

28
Q

Perceptron Learning

A

• Usually in machine learning, an error (cost) function (E) is defined to be minimized.
– The error function is distance between the output of the machine and the known labels.
– Example: A robot has to reach a certain position, the error function is the distance between the target position and its final position.
• Learning involves adjusting weights in the network (parameters in the model), so that the error function is minimized!

29
Q

Gradient descent is a …

A

first-order iterative optimization algorithm for finding the minimum of a function. To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient (or of the approximate gradient) of the function at the current point.
(Gradientenverfahren, auch Verfahren des steilsten Abstiegs)