2-mlconcepts Flashcards

1
Q

What is an instance?

A

An instance is an individual, independent example of a concept. (Rows)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an instance also known as?

A

Instances are also known as exemplars

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an attribute?

A

Attributes are the measuring aspects of an instance. (Columns)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are attributes also known as?

A

Attributes are also known as feature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are concepts?

A

Concepts are the things we aim to learn, generally in the form of labels or classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a supervised learning method?

A

Supervised learning methods have prior knowledge of a closed set of classes
and set out to discover and categorise new instances according to those
classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is an unsupervised learning method?

A

Unsupervised learning methods do not have access to an inventory of classes, and instead discover groups of ‘similar’ examples in a given dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is “strong” unsupervised learning?

A

Strong unsupervised learning dynamically discovers the “classes” (implicitly derived from grouping of instances) in the process of categorising the instances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is “weak” unsupervised learning?

A

Weak unsupervised learning categorises instances as certain labels without the aid of pre-classified data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are nominal quantities?

A

Nominal quantities are distinct symbols, serving only as labels or names. No relationship is implied on nominal values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are ordinal quantities?

A

A set of labels with an explicit total order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are numeric quantities?

A

Numeric quantities are real valued attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can nominal values be converted to numeric values?

A
  1. Map categories to numbers (creates artificial ordering)
  2. One hot encoding (increases dimensionality)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is feature standardisation?

A

Feature standardisation rescales features to be distributed around a 0
mean with a unit standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is feature scaling?

A

Feature scaling rescales features to a given range. For example, min-max scaling rescales values between 0 and 1 using the minimum and maximum feature value observed in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is discretization?

A

Discretization groups numeric values into a pre-defined set of distinct categories. This can be done through binning the dataset.

17
Q

What is equal width discretization?

A

Equal width discretization, partitions the data set into n bins of width (max - min)/n

18
Q

What is equal frequency discretization?

A

Equal frequency discretization, partitions the data set into n bins, each with an equal number of values. Requires dataset to be sorted before binning.

19
Q

What ways can numeric values be discretised?

A
  1. Equal width discretization
  2. Equal frequency discretization
  3. Clustering (K-means)
20
Q

What is linear regression?

A

Linear regression is a model, that estimates a real value (numeric class) through a linear combination of real valued attributes and attribute specific weights

21
Q

What is the ordinary least squares learning method?

A

The ordinary least squares learning method, minimises the sum of squared errors in linear regression

22
Q

What is the sum of squared errors (SSE)?

A

The sum of squared errors is the sum of the square of the difference between the actual and predicted values