Liner regression Flashcards

Question

what does machine learning do

Answer 1

Finds a mathematical formula when applied to a collection of inputs (« training data ») produces the desire outputs.

Answer 2

imput+ desired result computation program

Answer 3

input+ Programm computation = results

Answer 4

dimension reduction and clustering.

Answer 5

a technique used to reduce the number of features in a dataset while retaining as much of the important information as possible.

Answer 6

the training set gives the computer example answers. eg pictures of cats and dogs are already provided

Answer 7

Input, also known as features or exogenous variables

Answer 8

Output, also known as label, response or endogenous variable: y

Answer 9

to collect historical data from a previous algorithm.

Answer 10

classification and regression

Answer 11

Supervised learning we are trying to predict results which have discrete output (i.e. category or class) eg identifying objects or language

Answer 12

Logistic Regression Linear (and quadratic) Discriminant Analysis K-Nearest Neighbors RLab: Logistic, LDA, QDA, KNN

Answer 13

we are trying to predict results which have continuous output. like finding the line of best fit stock prices forecast, correlation analysis, medical diagnosis, demand and sales volume analysis,...

Answer 14

It uses a small amount of labeled data and a large amount of unlabeled data, which provides the benefits of both UL and SL while avoiding the challenges of finding a large amount of labeled data.

Answer 15

The goal is to learn a policy which is a function (similar to the model in SL) that takes the feature vector of a state as input and outputs an optimal action to execute in that state. The action is optimal if it maximizes the expected average reward. the policy is constantly being updated

Answer 16

Model-based means memorizing lots of information Model-free means generalize situation eg he self-driving car doesn't memorize every movement but tries to generalize situations and act rationally while obtaining a maximum reward.

Answer 17

a numerical value like 2

Answer 18

is an ordered list of scalar values, called attributes, like 𝑎 = −2, 5 .

Answer 19

matrix is a rectangular array of numbers arranges in rows and columns. 2 6 −1 30 −6 −3

Answer 20

A function is a relation that associates each element 𝑥 of a set 𝒳 (the domain of the function) to a single element 𝑦 of another set 𝒴 (the codomain of the function).

Answer 21

We say that 𝑓(𝑥) has a local minimum at 𝑥=𝑐 if 𝑓(𝑥)≥𝑓(𝑐) for every 𝑥 in some open interval 𝑥 = 𝑐.

Answer 22

a function 𝑓 is a function or a value that describes how fast 𝑓 grows (or decreases).

Answer 23

Differentiation is the process of finding a derivative.

Answer 24

a random variable from a distinct data set like a dice can only be random between 1-6

Answer 25

a random variable from an infinite data set

Answer 26

Conditional probability = the probability of the random variable 𝑌 = 𝑗 given the observed predictor vector 𝑥0 of the random variable 𝑋:Pr (𝑌=𝑗 |𝑋=𝑥0) = Pr 𝑌=𝑗𝑋=𝑥0 Pr(𝑋=𝑥0) ----------------------------------- Pr(𝑌 = 𝑗)

Answer 27

variables that define the model learned by the learning algorithm (are directly modified by the algorithm based on the training data).

Answer 28

the model predicts the training data well

Answer 29

model makes many mistakes on the training data. The line of best fit may underfit the data and may consider the general direction of data.

Answer 30

Main reasons: - model is too simple for the data (linear regression) - engineered features are not informative enough Main solutions: - try a more complex model - engineer features with higher predictive power

Answer 31

Low variance = low sensitivity = performs well on both train and test sets.

Answer 32

High variance = high sensitivity = performs well on train but poor on test overfitting

Answer 33

BEFORE Analyst feeds the algorithm input data, which corresponds to an expected output. The model evaluates the data repeatedly to learn more about the data’s behavior and then adjusts itself to serve its intended purpose.

Answer 34

AFTER the model is built, testing data once again validates that it can make accurate predictions. Test data provides a final, real-world check of an unseen dataset to confirm that the ML algorithm was trained effectively.

Answer 35

Problems model is too complex for the data (deep NN) -too many features but a small number of training examples Solutions -try simpler model - add more training data if possible - regularize the model (more widely used)

Answer 36

multiple models of the same algorithm with different random training samples it avoids overfitting data

Answer 37

selecting data points which give wrong predictions. Each time the data gives a wrong prediction it trains the new model Often causes overfitting

Answer 38

logistic Linear discriminant analysis (maximises distance) QDA K's nearest neighbours

Answer 39

Linear discriminant analysis and logistic regression

Answer 40

if it has more than one x

Answer 41

the X is ^to the power linear equation will always be in the form of $y = mx + b$

Answer 42

they are different leaves on a decision tree

Answer 43

your optimizer will be jumping big leaps and never find the minimum

Answer 44

it will take forever to find the minimum

Answer 45

A perceptron takes several binary inputs 𝑥1 , 𝑥2 , 𝑥3 ,... and produces a single binary output as follows:

Liner regression Flashcards

(70 cards)