Midterm Review Flashcards
Area under density curve
Always = 1
How to tell if something is an outlier
Q1 - 1.5(IQR)
Q3 - 1.5(IQR)
Outside of this range is an outlier
Density Curve
A smooth curve which approximates the shape of a histogram and describes the overall pattern of a distribution
Residual
The difference between an expected value and the actual value (y-yhat)
Complementary events
When two events together make up the entire sample space (even and odd numbers)
Ogive graph
Graph of percentile, relative cumulative frequency distribution
Variance calculation
Σ(x-xbar) squared / (n-1)
Independent event
Choice of selecting one object does not affect ways of selecting other objects
Dependent events
Selecting an object does affect selecting other objects
Circular permutations
N objects in a circle, then (n-1)! permutations of the objects
Calculating correlation
Sum (standardized x)(standardized y) / (n-1)
Least squares regression
Method of predicting response given explanatory
Line of “response” on “explanatory”
LinReg(a+bx)
Yhat= a + bx
LSRL “b”
Slope:
r(sy/sx)
LSRL “a”
Intercept:
Ybar- b(xbar)
Coefficient of determination
R^2,
Percent variation that can be explained by the lsrl
Influential point
When removed, dramatically changes slope of lsrl, often x outlier
Power law model
Y=ax^p
Log y = log a + p log x
Both variables are transformed
Exponential growth model
Y=ab^x
Log y = log a + x log b
Only response is transformed
Common response
A lurking variable- both x and y are acted on by another z force
Confounding
A lurking variable which also affects the response, making it unclear how much effect the explanatory actually has
Conditional distribution
Table cell/row or column total
Marginal distribution
Sum or row total/table total
Important in experiment design
Control (lurking variables)
Random (SRS)
Replication of experiment
Observational study
No treatment/experiment