Introduction to Inferential Statistics Flashcards

(37 cards)

1
Q

How can we infer something about the population based on what we find in the sample?

A

Normal distribution
Empirical rule
Central limit theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Are t-values and standard errors used to estimate population parameters?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are inferential statistics used for?

A

To draw conclusions and make inferences about population parameters by analyzing data collected in a sample
To infer something about the population parameter using sample statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Will parameter estimates based on a sample exactly equal the population parameter?

A

No, because they vary from sample-to-sample
So, when reporting a statistic, we also typically report an interval (e.g., 95% confidence interval) which we believe includes the population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a confidence interval?

A

A range of values within which a population parameter (e.g., mean or proportion) is likely to fall, based on a sample from that population
If you repeated a study of the same size many times, 95% of the resulting confidence intervals would cover (include) the true population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Do statistics estimate the sample?

A

Yes
And parameters estimate the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Since samples describe the population, do statistics describe the parameters?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What notations are used for samples?

A

Number of people: n
Mean: X (with a bar on top)
Variance: s^2
Standard deviation: S or SD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What notations are used for the population?

A

Number of people: N
Mean: mu
Variance: sigma^2
Standard deviation: sigma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the primary characteristics of a normal distribution?

A

Symmetric and unimodal (one peak)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is a normal distribution the most important distribution in inferential statistics?

A

It’s characteristics form the foundational assumptions underlying many interferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can you apply the empirical rule to any normal distribution?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the empirical rule?

A

68% of the observations fall within 1 standard deviation of the mean
95.4% of the observations fall within 2 standard deviations of the mean
99.7% of the observations fall within 3 standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If something has a normal distribution, can you use the central limit theorem?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three distributions?

A

Sample distributions (distribution of the sample)
Population distribution (distribution of the population)
Sampling distribution (distribution of a statistic over a set of of theoretical samples; distribution of sample means; plotted means of various samples)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Is the sampling distribution of the sample means approximately normal?

A

Yes
The sampling distribution of the sample means becomes “more normal” as n (or number of samples) increases
The mean of the sampling distribution of sample means will be the same as the mean of the population

17
Q

Since the distribution of the sample means is approximately a normal distribution, can the empirical rule be applied?

18
Q

Do we know that 95% of all sample means are within 2 SDs of the population mean?

A

Yes, based on the empirical rule
This theoretical assumption is the basis for inferential statistics

19
Q

What is the standard error of the mean?

A

Analogous to the SD of the population data
SEM = SD of the sample/square root of the sample size

20
Q

How is the standard deviation related to the sample error of the mean?

A

Just as the standard deviation increments the distance of a raw score from the population mean, the SEM increments the distance of the sample mean from the population mean

21
Q

What will make the SEM change?

A

The SD of the sample
The size of the sample

22
Q

Why are the standard error (SD) increments of the distribution of the sample means so much narrower?

A

Because we are just plotting means, not raw scores
Outliers are not applied

23
Q

What are z-scores?

A

Indicate how far a score is from the mean in a population context
Raw scores incremented by standard deviation units
A z-score indicates how far a raw score is from the population mean

24
Q

What are t-scores?

A

Indicate how far a score is from the mean in a sample context
Sample means incremented by standard error units
A t-score indicates how far a sample mean is from another mean
Sample means are distributed following the t-distribution

25
Do t-scores differ depending on sample size?
Yes
26
Is there a different t-distribution for each discrete number of degrees of freedom?
Yes aka sample sizes
27
As the sample size increases, does the t-value get closer to the z-score?
Yes
28
What is a confidence interval?
The interval (range of scores) which we believe includes the population parameter We can be 95% confident that ______ on average between...
29
Since we rarely know the population mean, what do we use instead?
the mean score of our sample the standard error of the mean the assumption that the sampling means are normally distributed (so we can apply the Central Limit Theorem)
30
What is a critical t value?
The “critical value” is the cutoff point for 5% of the distribution A test statistic value (t) exceeding the cut-off point is statistically significant, p<0.05
31
What do narrower confidence intervals indicate?
The more precisely we have estimated the population parameter Generally, the width of the confidence interval is inversely related to the size of the sample
32
What are the four sources of variability?
Differences attributable to group membership (between group) Sampling error (within group) Measurement error (within group) Individual differences (within group)
33
What is important to determine when comparing groups?
Does one group constitute their own population If 95% CI do not overlap between groups, the difference is significant (p<0.05)
34
If the 95% CI do overlap, is the difference significant?
No, p>0.05
35
How do you calculate the critical t score when you are comparing groups?
Based on the total number of subjects minus 2 *used when calculating the 95% CI around the mean difference for two groups
36
What are the two approaches for comparing the 95% CI of two groups?
Do CI for each group separately and then see if they overlap 95% CI around the mean difference (CI for the difference in mean between groups; if the interval crosses zero, they are not significantly different)
37