Week 7 - Linear pattern Flashcards

(33 cards)

1
Q

What relationship do we observe between Price and Mileage?

A

There is a negative relationship. As mileage increases, price decreases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a negative relationship mean in this context (price and mileage)?

A

Cars with higher mileage tend to be cheaper, and cars with lower mileage tend to be more expensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is it important to understand the Price–Mileage relationship?

A

Because it:

Explains how car price changes with mileage

Helps us predict the average price given a mileage

Allows us to quantify how strong the relationship is

Helps measure the effect of mileage on price

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the dependent variable in this price-milage analysis?

A

Price — it is the value we want to describe and predict.
(Plotted on the y-axis.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the explanatory variable in this price-milage analysis?

A

Mileage — it may help explain Price.
(Plotted on the x-axis.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the correlation coefficient used to measure?

A

The strength and direction of a linear relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the formal name of the correlation coefficient?

A

The Pearson correlation coefficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the formula for the Pearson correlation coefficient?

A

r_xy = cov(x,y)/ (σ_x σ_y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does covariance measure?

A

The joint variability of two variables — how they move together.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the formula for covariance?

A

cov(x,y) = 1/n n∑j=1 (x_j - x̄)(y_j - ȳ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why do we divide covariance by the product of standard deviations in the correlation formula?

A

To scale the covariance so that correlation:

Is unit-free

Always lies between –1 and +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a positive covariance mean?

A

When x is above its mean, y tends to be above its mean too (move together).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does a negative covariance mean?

A

When x is above its mean, y tends to be below its mean (move in opposite directions).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Example: If x_j - x̄ is positive, and y_j - ȳ
is negative, what can we say?

A

Their product is negative, indicating a negative relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the formulas for the variances of x and y?

A

σˆ2 _x = 1/n n∑j=1 (x_j - x̄)ˆ2
σˆ2 _y = 1/n n∑j=1 (y_j - ȳ)ˆ2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does r= -1,0, and +1 mean?

A

–1 → perfect negative linear relationship

0 → no linear relationship

+1 → perfect positive linear relationship

17
Q

What is the range of the correlation coefficient r?

A

r ranges from –1 to +1.

18
Q

What does a larger value
∣r∣ indicate?

A

A stronger linear relationship between the variables.

19
Q

What does r=0 tell us?

A

There is no linear relationship (but other, non-linear patterns may still exist).

20
Q

What does r=+1 or r=−1 indicate?

A

A perfect linear relationship (very rare in real data).

21
Q

What does the sign of r tell us?

A

Positive sign → positive relationship

Negative sign → negative relationship

22
Q

What does the value (size) of r NOT tell us?

A

It tells us nothing about the steepness (slope) of the relationship.

23
Q

State the hypotheses for testing correlation.

A

Null hypothesis: H_0 : p_x,y
= 0 (no population correlation)

Alternative hypothesis: H_1 : p_x,y ≠ 0

24
Q

What is the difference between p_x,y and r_x,y?

A

p_x,y : population correlation coefficient
r_x,y : sample correlation coefficient

25
What does "Sig. (2-tailed)" mean in SPSS/outputs?
It is the p-value. If p < α, the correlation is statistically significant.
26
Can we calculate a correlation coefficient for a non-linear pattern?
Yes, but it will lead to incorrect or misleading analysis.
27
What does Anscombe’s Quartet demonstrate?
All four datasets have the same correlation r=0.816 But they show very different patterns Conclusion: Always plot your data before interpreting correlation.
28
What important warning must we remember about correlation?
CORRELATION IS NOT CAUSATION.
29
What do scatterplots help us observe?
The shape and direction of the relationship (linear or non-linear).
30
What does the correlation coefficient measure?
Direction (positive/negative) Strength (weak/moderate/strong) of a linear relationship.
31
Does a low correlation mean no pattern exists?
No — there may still be a non-linear pattern.
32
After identifying a linear pattern, what is the next question we ask?
How much does Mileage affect Price?
33
What statistical tool lets us measure the effect of Mileage on Price?
Simple Linear Regression.