🟫 Bivariate Data Flashcards
(8 cards)
What is bivariate data?
Data that involves two variables, often used to find relationships between them.
What is a scatter plot?
A graph that displays bivariate data using points. Used to identify the correlation between variables.
What is correlation and how is it interpreted?
Correlation shows how strongly two variables are related:
Positive = both increase
Negative = one increases, one decreases
None = no pattern
What is the purpose of a line of best fit in a scatter plot?
A line of best fit represents the relationship between two variables and helps to predict values within the data range.
How do you interpret correlation coefficients?
The correlation coefficient ranges from -1 to +1, where:
+1 indicates a perfect positive correlation
-1 indicates a perfect negative correlation
0 indicates no correlation.
A scatter plot shows the relationship between the number of hours studied and test scores for 5 students. The hours studied are: 2, 4, 6, 8, 10, and the corresponding test scores are: 50, 60, 70, 80, 90. Does the data show a positive, negative, or no correlation?
Hours Studied: 2, 4, 6, 8, 10
Test Scores: 50, 60, 70, 80, 90
As the number of hours studied increases, the test scores also increase.
✅ This shows a positive correlation.
Answer: Positive correlation
You are given the following bivariate data for two variables, age and height:
Age: 4, 6, 8, 10, 12
Height: 100, 110, 120, 130, 140
Plot a scatter graph and describe the relationship between age and height.
Data:
Age 4 6 8 10 12
Height 100 110 120 130 140
Plot (if drawn): Each point moves up consistently as age increases.
📈 The points form a straight upward-sloping line.
✅ This is a strong positive linear relationship.
Answer: The relationship between age and height is strong, positive, and linear.
What is the formula for finding the line of best fit in a bivariate dataset?
The general form for the line of best fit (or linear regression line) is:
y=mx+c
Where:
y = dependent variable (e.g. height, test score)
x = independent variable (e.g. age, hours studied)
m = gradient (rate of change)
c = y-intercept (value when
x = 0)
Answer: y=mx+c
​