II: Machine Learning Flashcards

1
Q

Define Machine Learning.

A

A computer is said to learn from experience E with respect to a class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What’s the difference between top down and bottom up?

A

Bottom up: give the system lots of data to discover for itself
Top down: model all different functions and wire these together, and write manual instructions for solving a task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is machine learning top down or bottom up?

A

Bottom - up

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the process of using machine learning to reach AI?

A
  1. analyse the task manually
  2. select appropriate model/s
  3. Training and testing
  4. Use the same model to solve unseen tasks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 3 pillars of machine learning?

A

1.Models and algorithms
2.Computation (powerful and cheaper)
3. Massive data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the model and algorithm pillar of machine learning.

A

Doesn’t require knowledge engineers to find all functions and ‘wire’ them in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain the data pillar of machine learning.

A

We have massive amounts of data and the more we can feed into machine learning the more successful it will be.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is data mining?

A

Exploration and analysis of large quantities of data to discover valid, novel and useful patterns that are understandable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a big difference between data mining and machine learning?

A

Data mining gives understandable patterns so it can be explained why a conclusion has been come to as opposed to through machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two types of learning in machine learning?

A

Supervised and Unsupervised Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Top-down and bottom-up are also known as?

A

Deduction (top-down)
Induction (bottom-up)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe supervised learning.

A

Learns from labelled data which consists of an input and a desired output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe Unsupervised learning.

A

Learns from unlabelled data to describe a hidden pattern/structure
e.g. to reach a model or regression or grouping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Name three types of supervised learning.

A

Linear regression, decision tree, ANN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is reinforcement learning? What is the goal?

A

Learning over time based on trial and error / awards from actions. Goal is to learn a process to maximise reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some examples of reinforcement learning?

A

Deep neural networks, Q-learning, deep Q networks

17
Q

What are the two main differences between supervised and unsupervised learning?

A
  • supervised has a relationship between Y and f(x), unsupervised usually doesn’t.
  • supervised uses a labelled data set, unsupervised uses an unlabelled set
18
Q

In data cleaning: what is the meaning of imputation?

A

Replacing missing values with realistic estimated values

19
Q

What is clustering?

A

The process of grouping unlabelled data based on a similarity or distance metric

20
Q

What’s the point of a test data set?

A

To check how well a trained data set performs

21
Q

What is regression training?

A

Finding a function based on given data that minimises its mean squared error to ‘fit’ the samples

22
Q

What is overfitting?

A

Fitting a model tightly to a small data set which increases the risk of fitting to an anomaly or noise data

23
Q

What are 3 pros of linear regressions models?

A
  • short training time
  • easy to interpret
  • easy to implement
24
Q

What are two cons of linear regression models?

A
  • sensitive to noise and outliers
  • cannot handle complicated relationships
25
What is classification learning?
Learning to predict which set an instance belongs to based on pre-labelled instances
26
In the decision tree model, what is an internal node?
A decision point where a test is made on an attribute of the data and a decision is made
27
In the decision tree model, what is a branch?
The course of a decision or action
28
In the decision tree model, what is a leaf node?
A predicted class label / output reached through a path of decision points (internal nodes) and branches
29
What are 4 pros of decision tree?
- Reasonable training time - Can handle large number of features - Easy to implement - Easy to interpret
30
What are 4 cons of decision tree?
- Only simple decision boundaries - Problems with lots of missing data - Cannot handle complicated relationships - Over- complex tree: potential for overfitting
31
When we only have one dataset, why do we need to split it?
To have data for training and data for testing
32
What stage happens before the training set is fed into the machine learning method?
Data pre-processing
33
What is a confusion matrix?
It represents how much data is predicted to each outcome and how many of those are predicted correctly and incorrectly
34
What is the concept of perceptron in Neural networks?
There is a set of weighted inputs that if they surpass a certain amount, it causes a set of outputs.
35
What is the differing meaning of positive and negative weights for neural network inputs?
- a positive weight encourages the neuron to fire as a result of that input - a negative weight discourages the neuron from firing as a result of that input
36
What are the 4 logic functions?
- Or - And Not
37
What does the logical OR function do in ANNs?
one or other of the inputs needs to be one
38
What is the logical function 'And Not' in ANN?
A AND NOT B means A must have a value of 1 and B a value of 0
39