lecture 2 Flashcards

(31 cards)

1
Q

derive this

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When do you apply the chain rule in differentiation?

A

You apply the chain rule whenever you are differentiating a composite function, meaning a function nested inside another function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the chain rule

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do we differentiate?

A

Functions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does this mean?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

apply the chain rule to differentiate the composite function below (hint, identify the inner vs outer function)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

I want to differentiate with respect to X, how do i write this using proper notation?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

I want to differentiate with respect to t, how do i write this using proper notation? What about y?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the correct notation for differentiating a function with multiple variables, how do i differentiate with respect to 1, x

e.g., f(x, y), and I want to differentiate with respect to 1

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Least squares estimation - what is this

A

They are mathematically derived formulas that minimize the sum of squared errors.

formulas are the least squares estimates for the coefficients 𝛽0 (intercept) and 𝛽1 (slope) in simple linear regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the formula for the sum of squared errors

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Whats the difference between B0 and B1 and the below

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Whta is b0hat and b1hat least squares estimates of?

A

b0 and b1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do we mean by centroid of data?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Sxx what is the formula for this and whats it reflecting

A

This sums up the squared differences between each participant’s x-score (x𝑖 ) and the mean of all x-scores (xˉ).

Interpretation:
*Large Sxx → More spread-out x-values (higher variance in x).
*Small Sxx → x-values are closer to the mean (lower variance in x).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the value of B0(hat) that minimises thee SSE ?

17
Q

What is the value of B1(hat) that minimises thee SSE ?

18
Q

write mean of x (xbar) another way

19
Q

Syy what is the formula for this and whats it reflecting

A

This sums up the squared differences between each participant’s y-score (𝑦𝑖 ) and the mean of all
y-scores (𝑦ˉ).

Interpretation:
*Large Syy → More spread-out y-values (higher variance).
*Small Syy → y-values are closer to the mean (lower variance).

20
Q

Sxy - what is the formula for this and whats it reflecting

A

This represents the total covariation between x and y. It measures how much x and y change together.

Interpretation:
*Positive Sxy → x and y tend to increase together (positive correlation).
*Negative Sxy → When x increases, y tends to decrease (negative correlation).
*Near-zero Sxy → Little to no linear relationship between x and y.

21
Q

what does b1hat coefficient represent

A

the expected change in Y for a unit increase of X

22
Q

Can I apply the regression equation to predict any value of an iv for someone outside of the study

A

Not if values lie outside of the range of x-values used to fit the model: the linear relationship might not hold outside of this range. It can be dangerous to extrapolate

23
Q

what does goodness of fit measure

A

measures how widely scattered the points are across the line of bast fit

24
Q

How can we quantify the variation in data across the line of best fit ?

25
when adding an error term to the statistical model, what is the errors term expected value, variance, and correlation matrix?
In the model Ei is a random variable whos expected value is 0, with a variance of sigma squared. And the Ei's are all independent which is what meant by the third bullet point We assume these are independent and normally distributed
26
Why do we need the error terms to be independent?
so we can say Yi is normally distributed
27
For a standard regression equation with 1pv and 1 error term, what is random vs fixed variables?
28
Whats the difference between OLS and MLE?
OLS (least squares estimates) estimates β0 and β1, but it does not explicitly estimate σ2. OLS is a specific case of Least Squares Estimation, and in simple linear regression, they are the same thing. (Minimizes squared errors) MLE estimates all three parameters: β0,β1 and σ2 (Maximizes likelihood). Maximizing the likelihood means choosing the values of β0, β1 and σ2 that make the observed data most probable.
29
What is the mean and variance of:
30
What sis the formula for the SSE (sum of squared errors)?
31
Why Do We Differentiate the SSE with Respect to β0 and β1 in Least Squares Estimation?
because we are trying to find the values of β0 and β1 that minimize the SSE.