Midterm Flashcards
Disjoint (mutually exclusive)
- events cannot happen at the same time
ex) rolling a die
Independent
- one event does not affect the probability of the other
ex) tossing 2 coins
Disjoint addition rule
P(AUB) = P(A) + P(B)
proving independence
if P(A)*P(B) = P(A and B), A and B are independent
conditional probability
P(A|B) = P(A and B) / P(B)
Multiplication Rule
P(A and B) = P(A) * P(B|A)
Bad samples
- convenience samples
- introduces bias
- over/under estimation
- x reflect population - bias
- systematically favoring certain outcome
- under/over estimation - voluntary response
- people who choose to answer a general appeal
- people with strong emotion
Good samples
- simple random sample
- ensures everyone has an equal chance of selection
- preferred when there are smaller data sets. - stratified random sample
- ensures all subgroups are represented
- more precise - cluster
- create cluster by location
- save money and time - systematic random sample
- randomly select k individual and count every kth individual
- preferred when population is ordered
- easier to conduct
What can go wrong?
- undercoverage
- members of the population have less of a chance of being chosen or left out - nonresponse
- chosen individuals cannot be contacted or refuse to participate –> big issue - response bias
- individuals lie or answer a question they don’t know - question wording bias
- they way a question is worded or asked influences the response from an individual
Observational study vs. experiment
observational study
- x treatment
experiment
- treatment (need experiment to know causation)
confounding variable
other possible variables other than explanatory variable that affects response variable
4 principles of experimental design
- random assignment
- replication
- control
- comparison
placebo effect
dummy treatment
purpose of control group
provides a baseline for comparison
purpose of single and double blind experiments
reduce placebo and favoring
purpose of random assignment
- creates roughly equivalent groups
- helps control confounding variables
purpose of replication
use enough subjects/experimental units so the outcome of the experiment can have meaning
statistically significant
- the results most likely did not happen by chance.
- convincing evidence with simulation
blocking
- stratified sampling in experiments
- blocks should be homogeneous
- use a confounding variable
purpose of blocking
- controls confounding variables
- increases chance of finding convincing evidence if the effect is real
matched pairs design
blocks only include 2 experimental units
1. 2 similar units
2. 1 unit that gets both treatment
(randomly assign treatments or randomly assign the order of treatment)
interference
using information from our sample/experiment to draw conclusions about the population
sampling variability
different samples from the same population will give different results
categorical graphs
side by side bar graph
segmented bar graph
mosaic plot
pie chart