Revision for recognition Flashcards

1
Q

How can we measure the class impurity in the decision tree

A

We can use entropy, Gini Index

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Explaining briefly the pruning process

A

it aims to delete the parts to reduce the complexity and variance
the complexity that are inaccuracy for prediction and relies on the misclassification rate

another approach consists of a rule simplification technique
where we delete part of a test or a whole rule s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the advantages and disadventages of the decision tree

A

Ability to merge qualitative and quantitave features
the scaling doesn’t

No stability, slight modification of the data lead to variation in the resulting tree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain random forest

A

this is to use several weak decision trees that are generated by bootstrapping (random selection of features)then a majority vote is used for prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are the advantages in tree forests

A

they are simple and accuracy, pruniunig can be omitted by setting the maximum depth for the trees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does C affect the SVM

A

The higher is C the smaller is the marge , the bigger is C the higher is the margin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SVM

A

is supervised machine learning technique that can be used for both regression and classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to deal with non linearity in SVM

A

We can use what we call the kernel trick where we introduce a non-linear transformation to the datapoint such that problem becomes linear in essence projecting the data into high dimensional space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is the role of hidden units in MLP

A

allows applying non linear transofrm to the input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are some dificulties in terms of NN

A

Thereis no systematique rule to determine the network architecture and the existence of local optima giving sub optimal solutions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what isNN

A

Correspond to to network of interconnected unite composed of an input layer, outputlayer and optionally hidden layer , each node is connected to the next one and eachconnection is characterized by a direction and a weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the difference betwee classfication and regression

A

in classificaiton we seek to predict something descrete while regression we to seek to predict something contineous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is the difference between logistic regression and NN

A

logistic regression is a technique that can be used for binary classification which a neural network is more complex than that and logistic regression is a subset of

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the differnce between a logistic regression and perception

A

percpetron predicts class like yes or no while logistic regression is outputs probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is Gini index

A

describes the likelihood of a new data to be misclassified

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the entropy

A

reflects how much information we can get from particular attribute

17
Q

what is the margin in SVM

A

defines the sparation between inter classeq

18
Q

how gamma iinfluence the kernel in RBF

A

gamma influence the impace of distance between each point