Chapter #10 - 10/30/24 Flashcards
relationships between measurements variables (42 cards)
explain the difference between deterministic and statistical relationship
Describe a linear pattern in a scatterplot.
Describe how a correlation relates to the strength and direction of a linear relationship.
Describe a linear pattern using a regression line
Do you think each of the following pairs of variables would have a positive
correlation, a negative correlation, or no correlation?
b. Calories eaten per day and IQ
no correlation (not a clear relationship)
Do you think each of the following pairs of variables would have a positive
correlation, a negative correlation, or no correlation?
a. Calories eaten per day and weight
positive correlation
define “correlation” :
measures the strength of a certain type of relationship (e.g., linear relationship) between two measurement variables.
define “deterministic” :
if we know the value of one variable, we can determine the value of the other exactly. e.g. relationship between volume and weight of water.
define “regression” :
gives a numerical method for trying to predict one measurement variable from another.
define “statistical” :
natural variability exists in both measurements. Useful for describing what happens to a population or aggregate.
when is a relationship statistically significant ?
if the chances of observing the relationship in the sample when nothing is going on in the population are less than 5% (small).
what are two warnings about statistical significance ?
- even a minor relationship will achieve “statistical significance” if the sample is very large.
- A very strong relationship won’t necessarily achieve “statistical significance” if the sample is very small.
what is a scatterplot ?
when creating a scatterplot, if there is an explanatory variable, where do we plot it ?
on the horizontal axis (x axis)
How sensitive to changes in water temperature are coral reefs. Scientists examined data on mean sea surface temperatures (in degrees Celcius) and mean coral growth (in centimeters per year) over a several-year period at locations in the Gulf of Mexico.
Sea surface temperature and Growth
what is the response variable ?
coral reef growth
what letter represets correlation ?
r
what does r represent ?
indicator of how closely the values fall to a straight line.
what does correlation measure ?
linear relationships only (that is, it measures how close the individual points in a scatterplot are to a straight line)
what are some features of correlations ?
- Correlation of +1 indicates a perfect linear relationship between the two variables; as one increases, so does the other. All individuals fall on the same straight line (a deterministic linear relationship).
- Correlation of –1 also indicates a perfect linear relationship between the two variables; however, as one increases, the other decreases.
- Correlation of zero could indicate no linear relationship between the two variables, or that the best straight line through the data on a scatterplot is exactly horizontal.
- A positive correlation indicates that the variables increase together.
- A negative correlation indicates that as one variable increases, the other decreases.
- Correlations are unaffected if the units of measurement are changed. For example, the correlation between weight and height remains the same regardless of whether height is expressed in inches, feet or millimeters (as long as it isn’t rounded off).
if 0.5 less than or equal|r| less than 0.8 what does this mean ?
moderate
if |r| less than or equal to 0.5, what does this this mean ?
weak
if |r| greater than or equal to 0.8, what does this mean ?
strong
Choose the option that best answers the following question.
If the correlation (r) between two variables is close to 0, you can conclude that a scatterplot would show :
a) a strong straight-line pattern
b) a cloud of points with no visible pattern.
c) no straight-line pattern, but there may be a strong pattern of another form.
c) no straight-line pattern, but there may be a strong pattern of another form.
when specifying linear relationships with regression? what is the goal ?
find a straight line that comes as close to possible to the point in a scatterplot