Lecture 3 Flashcards

1
Q

Residual standdard Error

A

Average amount that the response deviate from the regression line

y~ is the estimate
n is the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RSE small implies model fits data well

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RSE high implies model does not fit Data well

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Any prediction of lpsa based on lweight will still be off by 1.046 units on average.

If it is accepted or not it depends on the problem

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RSE is measured in units of the output

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

R-squared is a measure of the fit however without the units

A

YES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

RSS: Amount if variablity that is left unexplained after performing the regression

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

TSS: total variance in response to Y

A

Amount of variability in response before regression is performed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

R squared measure the proportion of variability in response y that can be performed using x

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

R squared close to 1 : large proportion of variability is explained by x which is good

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

R squared close to 0 => Regression did not explain much of the variability

A

True

Linear regression thus can be wrong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When the application we are considering to approximate is far from being approximated using he model then R2 will be near zero

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

R2 is highly affected by the number iof predictors we have

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Since R2 is highly affected by the number of predicotrs we have what is called adjusted R2,(how we pick predicotrs)

A

Regards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

We want F-statistics to be as far from 1 as it can be

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The larger the F-statics , the more it indicates that we have a relation between what we are feeding into the model and the response

17
Q

Correlation : measure of linear relationship between X and Y

18
Q

Correlation does not imply casuality, it meeans how much value vary in the same way

19
Q

Multiple linear regression model , we want to add more predictors to our response variable

20
Q

Interaction effect the or as known as the synergy effect in marketing

A

Accounting for possible interactions between the predictors

21
Q

I introduce a new coefficient and a new variable given by X1 * X2 which allws me to account for interaction

22
Q

Linear regression: I am assuming the relationship between response and the predictor is linear

23
Q

Since the relationship between the predictor and the response is not always linear thus we can generate a polynomial regression model

24
Q

We should always ask ourselves, is it worth it to create a higher order model?

25
I data science we are taking a sample from the population in order to get something that we can say about the population
TRUE
26
W want to keeep part of our data aside in order to test our model
TRUE
27
WE want to see how our model is performing on different subsets of data
TRUE
28
We want to estimate the test prediction error of our model
TRUE
29
Resampling: Given one sample we repeatadily draw samples from it in order to refit our model
TRUE
30
Cross validation is when we want to evaluate the performance of our model by estimating it is test error
TRUE
31
When you have a flexible model the training error might underestimate the test error
True
32
We divide our data into part for creating model and part for testing
TRUE
33
WE need to randomly split our data
TRUE