Lecture 1 - Introduction Flashcards

1
Q

What is machine learning about

A

Machine learning is about using the right features to build the right models that achieve the right tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are Tasks

A

Problems that can be solved with machine learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are Predictive Tasks

A

Predicting a target variable from a number of features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are descriptive tasks

A

exploiting the underlying structure of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are 3 Predictive Tasks

A
  • Classification
  • Regression
  • Predictive clustering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are 3 Descriptive Tasks

A
  • Descriptive clustering
  • Association rule mining
  • Subgroup Discovery

Descriptive tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the difference between predictive and descriptive tasks

A

Model output of predictive models involves a target variable, while the model output of the descriptive models does not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Predictive Classification

What is Classification

Also give 2 examples

A

Classification tasks predict categorical target variable from a set of features.
* image classification
* weather type prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Predictive regression

What is regression

Give 2 examples

A

Regression tasks predict a numerical target variable from a set of features
* stock price forecasting
* weather temperature forecast

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Predictive clustering

What is predictive clustering

give 1 example

A

Predictive clustering predicts with the intention to assign class labels (predicting a target)
* fraud detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Descriptive Clustering

What is descriptive clustering

give 2 examples

A

The clusters are representing different groups formed in data without the intention of predicting a target.
* grouping plant data
* pattern mining

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Association rule mining

What is association rule mining

Give 2 examples

A

A rule-based task for discovering interesting relations between variables
* market basket anaylsis
* online shopping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sub-group discovery

What is subgroup discovery

give 2 examples

A

Technique that discovers interesting associations among different variables, with respect to a property of interest
* detection of risk groups with disease
* finding patterns in traffic accidents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Supervised vs Unsupervised

What is supervised learning

A

In supervised learning tasks, we provide a traning set of examples: instances, labelled with the true target value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Supervised vs Unsupervised

What is Unsupervised learning

A

In unsupervised the data is unlabelled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name two Supervised and Predictive models

A
  1. Classification
  2. Regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Name one Supervised learning descriptive model

A
  1. Subgroup discovery
18
Q

Name one unsupervised learning predictive model

A
  1. Predictive clustering
19
Q

Name two unsupervised learning descriptive models

A
  1. Descriptive clustering
  2. association rule discovery
20
Q

What are Models?

A

Models are what is being learned from the data, in order to solve a given task

21
Q

How does a model of regression look like?

Equasion with Yi, Xi, ei

A

Yi = f(Xi+B)+ei
where Yi, Xi, ei are the target, features, and noise of specific instance i, and B and f are model paramters and model function

22
Q

What are the two ways machine learning models can be distinguished

A
  1. Main intuition
  2. Modus operandi (mode of operations)
23
Q

Main intuition

What are geometric models

A

using geometrical concepts. shit like linear tranformations, distance metrics, seperating hyperplanes

24
Q

Main intuition

what are Probabilistic models

A

aim for reducing uncertanty using probability distributions

25
# Main intuition what are Logical Models
defined in terms of easily interpretable logical expressions
26
# Second Categorization (modus operandi) What are gouping models
dividing the instance space into segments in each segment a very simple model is learnt
27
# Modus operandi What are Grading models
learning a single, global model over the instance space
28
# Geometric models Instance Space?
the set of all possible instances, whether they are present in the data set or not
29
# Geometric models Distances?
distance between two points * Euclidean distance for two points. (could work in multiple dimentions)
30
# Geometric models What are Hyper-Planes
a decision boundary that divides the input space into two or more regions, each corresponding to a different class or output label
31
# Probabilitic models What is the bayes rule
P(Y|X)=P(X|Y)P(Y) P(X)
32
# probabilitic models What is *posterior*
posterior = (likelihood x prior)/evidence
33
# Proabilitic models P(Y|X) P(X|Y) P(Y) P(X)
P(Y|X) = posterior dist. P(X|Y) = likelihood prob. P(Y) = prior P(X) = evidence
34
# Logical models what is meant by **Declerative**?
Declerative: models of this type can be earlisy translated into ruls that are understandable by humans * Rules can be organised into a feature tree
35
# Grading vs Grouping Difference between grouping models and grading models. | Give 3 points for each
Grouping models * break up the instance space into **groups of segments** * grouping models have a **fixed and finite resolution** * cannot distingiush between indivual instances beyond this resolution Grading models * Do not work based on the notion of segments * they form one **global model **over the instance space * **infinite resolution**
36
Give 1 example of a grouping model
1. Decision trees
37
Give two examples of a grading model
1. Linear regression 2. Linear classifiers
38
# Training vs inference What is the Training phase
Training is the process of creating a machine learning model that has learned ot perform a task using a training set of numerous data points.
39
What is inference phase
Inference is the process of using a machine learning model to perform the task on a new data point.
40
what are Features
Kind of measurement taht can be easily performed on any instance
41
what if the data is not in the form you want?
1. Feature construction 2. Discretisation (numerical into categorical) 3. Feature transformation (project data into a new space) 4. Feature selection (removing redudntant features)