Topic 5 Multiple Variables Flashcards
What is the purpose of a joint distribution in a bivariate analysis?
To examine the relationship and combined behavior of two variables.
A joint distribution helps answer:
a) Whether variables exist
b) If E[X] = E[Y]
c) To what extent two variables are related
d) Which variable is more important
c) To what extent two variables are related
What is the formula for covariance?
Cov(X, Y) = E[(X - E[X])(Y - E[Y])] = E[XY] - E[X]E[Y]
True/False
If X = Y, then Cov(X, Y) = Var(X)
True
What does a positive covariance indicate?
That X and Y tend to vary in the same direction.
If Cov(X, Y) < 0, then:
a) X and Y increase together
b) X and Y are independent
c) X is likely less than E[X] when Y is greater than E[Y]
d) X and Y are always equal
c) X is likely less than E[X] when Y is greater than E[Y]
What is Cov(cX, Y)?
c · Cov(X, Y)
True/False
Cov(X, c) = 0 for any constant c
True
Cov(X + Y, Z) = Cov(X, Z) + _________.
Cov(Y, Z)
What does a covariance matrix represent?
It represents the variance and pairwise covariances among multiple random variables.
What is the formula for the correlation coefficient?
ρ(X, Y) = Cov(X, Y) / (σ_X * σ_Y)
True/False
If ρ(X, Y) = −1, then Y is a decreasing linear function of X.
True
If X and Y are uncorrelated, what is true about their variances?
a) They are equal
b) Var(X + Y) = Var(X) + Var(Y)
c) Cov(X, Y) = 1
d) E[X + Y] = 0
b) Var(X + Y) = Var(X) + Var(Y)
If X₁, X₂, …, Xₙ are pairwise uncorrelated, then Var(X₁ + … + Xₙ) = ____________.
Var(X₁) + Var(X₂) + … + Var(Xₙ)
True/False
Corr(X, Y) = 0 always implies X and Y are independent.
False
What is Pearson’s correlation coefficient?
A measure of linear dependence between two continuous variables.
A correlation coefficient of 0.975 indicates that the two variables are ________.
highly correlated
What are spurious correlations?
Correlations between two variables that are statistically related but not causally linked.
True/False
High correlation always means one variable causes the other.
False — correlation ≠ causation.
The joint PMF for discrete variables X and Y is defined as P_XY(j, k) = _________.
P(X = j and Y = k)
What is R_XY in joint distributions?
a) Range of X only
b) All possible pairs (j, k) where PXY(j, k) > 0
c) Marginal of Y
d) Independent distribution
b) All possible pairs (j, k) where PXY(j, k) > 0
What is the formula for joint CDF of discrete variables X and Y?
F_XY(j, k) = P(X ≤ j, Y ≤ k)
True/False
Joint CDF can be expressed as an intersection of events: P((X ≤ j) ∩ (Y ≤ k))
True
What is the formula for the joint PDF for continuous variables X and Y?
f_XY(x, y) = P(x ≤ X ≤ x + dx, y ≤ Y ≤ y + dy) / (dx * dy)