Exam 2 Study Guide Flashcards
(47 cards)
What is the definition of probability?
- The likelihood of a certain event occuring
- p=specified outcome/total outcomes
What is the role of probability in inferential statistics?
- Probability is used to obtain the likelihood of obtaining a specific sample from a given population
- If the probability of getting a specific sample is small, we can say that the sample probably came from a different population
What are the requirements of random sampling?
Requires that each individual in the population has an equal chance of being selected
What is the difference between sampling with replacement and sampling without replacement?
- Sampling with replacement requires each individual to be returned back to the population before making the next selection to keep probability constant
- Sampling without replacement does not require each individual to be returned back to the population, does not have requirement of constant probability
Percentile vs Percentile rank
Percentile is a score, percentile rank is a percentage
What makes a distribution of sample means normal?
- If the population the sample means are obtained from is normal
- If n is 30 or greater
sampling error
the natural discrepency (amount of error) between a population parameter and the corresponding sample statistic
The expected value of M
The mean of the DOSM is equal to the population mean (mu)
Distribution of sample means
Distribution of Sample Means - the set of sample means for all possible random samples of specific size (n) that can be selected from a
population.
* This distribution has well-defined and predictable characteristics that are specified in the Central
Limit Theorem
Sampling distribution
A distribution of statistics obtained by selecting all the possible samples of a specific size from a population
Sampling distributions characteristics
- Sample means should pile up around the population mean; most sample means should be relatively close to the population mean
- Pile of samples should tend to form a normal shaped distribution (frequencies should taper off as the distance between M and mu increases)
- The larger the sample size, the closer the sample means is to mu (more representative). Thus, the sample means obtained with a large sample size should cluster relatively close to the population mean; the means obtained from small samples should be more widely scattered
Central Limit Theorem (Shape, Central Tendency, Variability)
- Mean of the theoretical distribution of sample means is called the Expected value of M (always equal to the population mean)
- The standard deviation of the theoretical distribution of sample means is called the Standard error of M and is computed by oM (0/Square root of n)
- The shape of the distribution of sample means is typically normal
- Distribution of sample means approaches a normal distribution as n approaches infinity
Law of large numbers
The larger the sample size (n) in a specific sample, the more probable that M is close to mu (larger sample, smaller standard deviation)
Standard Error of the Mean (how to calculate, and what it measures)
- Provides a measurement of the average expected distance between the M and u (like standard deviation of DOSM)
- Describes the distribution of sample means (variability)
1. Describes the distribution of sample means; provides a measure of how much difference is expected from one sample to another
2. Measures how well an individual sample mean represents the entire distribution
Formula: Om= O / n squared
Decreases when sample size increases
What is the critical value of z for a two tailed significance test w a=.05?
+ or - 1.96
What are the goals of hypothesis testing?
- to use sample
data to evaluate a hypothesis about a population - to rule out chance (sampling error) as a plausible explanation for the results from a research study (although it does not actually do this)
Null hypothesis (How to state using symbols and words)
for two-tailed tests
- When using two-tailed test, the null hypothesis symbol is that we predict M=mu
- means the observed findings are due to random chance (there does not appear to be a real effect)
- Ex. H0: Thinking Cap is not related to IQ. (μ = 100)
Alternative hypotheses (how to state w symbols and words)
for two-tailed tests
- Symbol: mu=/ to M
- Means the observed findings cannot be explained by sampling error (there does appear to be a real effect).
– Ex. H1: Thinking Cap is related to IQ. (μ ≠ 100).
Process of hypothesis testing (4-step method)
- State your hypothesis about the population
- Define the probability at which we think the results indicate that there must be a true effect (Critical value: Defined by associations that are very unlikely to obtain(typically less than 5% chance) if no effect exists.)
- Obtain and test a sample from the population
- Compare data w the hypothesis predictions and make a decision to reject or fail to reject null
Test statistic
- (ex. z-score) forms a ratio comparing the difference between the M and μ versus the amount of difference we would expect without any treatment
effect (σM).
What factors influence a hypothesis test?
The size of the treatment effect and the size of the sample… even a very small effect can be statistically significant if observed in a very large sample (would have a very small standard error)
a (alpha) level
establishes a criterion or cut off for deciding if the null hypothesis is correct (typically a=.05)
* outcome of a hypothesis test can be influenced by alpha level because the smaller the alpha level, the less likely it is that values fall in the critical region (so you are less likely to reject the null hypothesis)
Type 1 Error
- Occurs when sample data indicate an effect when no effect actually exists
- Rejecting the null hypothesis when the null is true.
- Caused by unusual, unrepresentative samples, falling in the critical region without any true effect.
- Hypothesis tests are structured to make Type I errors unlikely.